Anthropic launches a voice mode for Claude
121 points by kordlessagain 4 days ago | 37 comments- simonw 1 day agoFrom that article:
> According to the report, Anthropic was holding talks with Amazon, the company’s major investor and partner, and voice-focused AI startup ElevenLabs, to possibly drive future voice features for Claude.
> It’s unclear which of those partnerships, if any, came to fruition.
Here's an easy way to confirm that: check Anthropic's "Trust Center" and review any recent updates. https://trust.anthropic.com/updates
Sure enough, on May 29th they have a subprocessor change:
> As of May 29th, 2025, we have added ElevenLabs, which supports text to speech functionality in Claude for Work mobile apps.
I wonder what they're using for speech-to-text?
- zaptrem 1 day agoMaybe also 11L’s Scribe model?
- zaptrem 1 day ago
- owenpalmer 1 day agoThings I love:
1. Start and stop button. I love this explicit control over who is talking when.
2. Ability to upload files while the voice chat is going. Great idea. Often times I use gpt voice chat for studying, and it's annoying when I need to add another PDF to the context, since I need to stop the chat, upload, and then restart the voice session.
3. Real-time text display during voice chat. I asked you to take the derivative of a function I described, and it outlined its steps, but it wasn't just the transcription of what it was saying.
Things I hate:
1. The transcription is terrible. It took me 10 tries during the conversation to describe f(x) = x^2. Looking back on the transcriptions, it's literally nonsense.
2. There was a buggy moment when the voice conversation started but it was still demoing all the voice options simultaneously. Need some polishing.
- wkat4242 13 hours agoI thought transcription was a solved problem now. I run whisper at home and it's blazing fast and accurate with the large model <3. If anthropic is much worse they need to up their game. Or just use Whisper until they do.
- Fairburn 17 hours agoYet, using Abacus.AIs mobile app, you do not need a.. talk.. no talk UI control. It detects when you interject. Would be a nice feature for Claude as well.
- jazzyjackson 13 hours agoBut does the bot know not to interject if I pause to think?
- jazzyjackson 13 hours ago
- wkat4242 13 hours ago
- grg0 1 day agoDoes it say "y'all"?
- esafak 1 day agoNo, it says youse.
- eru 1 day agoAlas, English used to have a perfectly fine 'thou', but then people abandoned it. And now they are re-inventing the same distinction.
Now just wait until people address a single other person with youse, and then have to make up yous'all to address groups.
(Evolution of language is fascinating. I'm just pretending to be upset.)
- anton-c 5 hours agoA user named eru likes language, I'm not surprised!
Big fan of linguistics and philology myself too.
Edit: Also 'youse guys' is for groups I thought, but maybe you're keeping it to one word contractions, haha
- JumpCrisscross 1 day ago> English used to have a perfectly fine 'thou'
Thou was second-person singular. Y’all is second-person plural.
- thfuran 1 day agoYe is really the missing piece.
- 1 day ago
- anton-c 5 hours ago
- mattnewton 1 day ago^yinz
- eru 1 day ago
- esafak 1 day ago
- refulgentis 1 day agoThere was a seemingly odd quick sequence of announcements from elevenlabs the last 24 hours, makes me think it's them - notably, I believe they launched 2.0 of their conversational AI today.
- ecocentrik 1 day agoThe Feynman voice would be great. I've been using it for non-fiction audio books and it works so well.
- ecocentrik 1 day ago
- andrewstuart 1 day agoI really wish Anthropic would focus all of their developer resources on implementing “download all files”.
I know it’s a massive challenge and might take years to get right but the endless copy and paste is wearing me down.
- rahilsheikh 1 day agoYou know you could just use the filesystem mcp server and give it access to your project/downloads folder.
- bdangubic 1 day agouse claude code
- andrewstuart 1 day agoI can’t afford it.
- mceachen 1 day agoTheir new MAX 5x plan is flat rate $3/day but IME it's enough to drive all-day multi-concurrent-sessions if you stay on sonnet.
Their MAX 20x is double the cost $~6/day for quadruple the quota.
Keep in mind that Opus chows quota at 5x+ the rate of sonnet.
- danw1979 1 day agoUse Claude Desktop with MCP attached to your IDE (if you’re coding)
- mceachen 1 day ago
- andrewstuart 1 day ago
- rahilsheikh 1 day ago
- diamondfist25 1 day agoHn people are too poor to pay for max?
- rudedogg 14 hours agoOr some people aren’t seeing the value at $100/mo
- rudedogg 14 hours ago
- nprateem 1 day agoMeh, Anthropic are dead to me until they have structured output.
- kashunstva 1 day ago> Anthropic are dead to me…
They’re dead to me until they fix their over-aggressive auto-ban. Having done nothing more than traveling frequently, rarely using VPN and only using it for coding, I was caught up in a random inexplicable auto-ban. Zero customer service. Appeal process that leads to a black hole. Whatever their technical advances, their user experience when something goes awry is terrible.
- revicon 1 day agoThe prefil method works pretty well...
https://docs.anthropic.com/en/docs/test-and-evaluate/strengt...
- nprateem 1 day agoYeah but it's XML not pydantic which means it doesn't play well with failovers to other providers. It would be tolerable if Anthropic didn't have such abysmal API uptime but at this point no way will I use them for my SaaS.
- nprateem 1 day ago
- kashunstva 1 day ago
- bariswheel 1 day agoI really want to like Claude, but I hit their limit WAY too early when I PAID for it, 9 months ago, WAY before I hit any type of limit on gippity. (gippity - gpt , gimminy - gemini).
- ChadNauseam 1 day agoHaha, I respect calling it gippity. It reminds me of "I call patrick subaru"
- eru 1 day agoI call her gippity, but I abbreviate the name as GPT when typing.
Just like world-wide-web and www.
- eru 1 day ago
- ChadNauseam 1 day ago
- jsnider3 1 day agoI like it, but giving Claude a "Deep Research" mode would be better.
- curtisszmania 1 day ago[dead]