Omnio: First AI model that can natively reason over audio
13 points by lukax 8 months ago | 8 comments- lukax 8 months agoWe built Omnio to address the limitations we kept running into with existing audio AI models (we previously built an automatic speech recognition product). Most of them rely heavily on speech-to-text, which strips out a lot of the things like speaker roles, emotions and non-verbal cues that are critical in more complex scenarios. Omnio directly processes audio signals to capture that kind of context. It's designed to understand conversations in a way that feels more "human."
- barrenko 8 months agoSo this is speech-to what? You're kinda missing that in your info. Are you guys based in USA or Ljubljana?
- barrenko 8 months ago
- LukaFurlan 8 months agoinsane release
- sharpshadow 8 months agoCan this get me the lyrics of rap songs or not?
- overlord_tm 8 months agoTry it out, you get free credits on signup. Worked surprisingly well for me on this example https://www.youtube.com/watch?v=DxkeOkaVRLo
- sharpshadow 8 months agoYes I saw it thanks and will try it out. On the introduction blog post they said beta only for paid developers but seemingly it’s free for all.
- lukax 8 months agoWhoops. We've updated the website. Omnio is available to all developers and all accounts receive $5.00 in free credits.
- lukax 8 months ago
- sharpshadow 8 months ago
- easwee 8 months agoI tried Eminem - Rap God with "transcribe word by word" prompt and it did quite good, I just had to set temperature to 0, else it can get creative.
- overlord_tm 8 months ago