by kordlessagain on 5/28/25, 2:41 PM with 42 comments
by simonw on 5/31/25, 3:20 AM
> According to the report, Anthropic was holding talks with Amazon, the company’s major investor and partner, and voice-focused AI startup ElevenLabs, to possibly drive future voice features for Claude.
> It’s unclear which of those partnerships, if any, came to fruition.
Here's an easy way to confirm that: check Anthropic's "Trust Center" and review any recent updates. https://trust.anthropic.com/updates
Sure enough, on May 29th they have a subprocessor change:
> As of May 29th, 2025, we have added ElevenLabs, which supports text to speech functionality in Claude for Work mobile apps.
I wonder what they're using for speech-to-text?
by owenpalmer on 5/31/25, 2:33 AM
1. Start and stop button. I love this explicit control over who is talking when.
2. Ability to upload files while the voice chat is going. Great idea. Often times I use gpt voice chat for studying, and it's annoying when I need to add another PDF to the context, since I need to stop the chat, upload, and then restart the voice session.
3. Real-time text display during voice chat. I asked you to take the derivative of a function I described, and it outlined its steps, but it wasn't just the transcription of what it was saying.
Things I hate:
1. The transcription is terrible. It took me 10 tries during the conversation to describe f(x) = x^2. Looking back on the transcriptions, it's literally nonsense.
2. There was a buggy moment when the voice conversation started but it was still demoing all the voice options simultaneously. Need some polishing.
by refulgentis on 5/31/25, 2:23 AM
by grg0 on 5/31/25, 2:25 AM
by andrewstuart on 5/31/25, 3:11 AM
I know it’s a massive challenge and might take years to get right but the endless copy and paste is wearing me down.
by diamondfist25 on 5/31/25, 2:31 PM
by nprateem on 5/31/25, 3:54 AM
by bariswheel on 5/31/25, 2:51 AM
by jsnider3 on 5/31/25, 2:46 AM