from Hacker News

Creates hyper-realistic voice clones from just 3 seconds of audio

by blacktechnology on 1/10/25, 6:16 PM with 46 comments

  • by xnx on 1/10/25, 9:15 PM

    What model is this using? I've had good results with e2-ft-tts running locally via Pinokio. You can also run it online for free https://huggingface.co/spaces/mrfakename/E2-F5-TTS
  • by bugglebeetle on 1/10/25, 8:53 PM

    Sure, just let me submit my voice for cloning to a closed sourced, online service of unknown provenance. What could ever go wrong?
  • by delgaudm on 1/11/25, 1:13 AM

    Hey there /u/blacktechnology, could you email me a few seconds of your voice so I can upload it to this site and see how the cloning goes? I'd love to see what I could do with a copy of your voice. Kthxbye.
  • by superkuh on 1/10/25, 9:01 PM

    I submitted an 8 second clip of speech and the resulting synthesized speech did not sound like the same voice. Too bad.
  • by esperent on 1/10/25, 11:20 PM

    We've been advertising to get someone to take over the lease on a commercial building. Surprisingly, we've had several of what seem like very obvious scam attempts - people stringing us along, not trying to bargain (we are in a haggling country, people always try to bargain), asking us to wait unreasonable amounts of time, and finally when pressed breaking down into logical inconsistencies. So, not even good scam attempts.

    I was wondering, what's the point? I mean, it's a building. You pay money, you sign the lease (in person), you get the use of the building. No money, no building. Where's the scam opportunity?

    The only thing I can think of is that they're trying to get enough data and personal info to clone our voices and use that to try and gain access to bank accounts or to scam our relatives. Even if I'm wrong in this case, this seems like a major new vulnerability in society. I mean, if someone who sounded (and with video AI, perhaps even looked) exactly like me called up my mother and pretended I'd been violently robbed or had an accident, she'd transfer money in a heartbeat.

    I'm considering that I should set up some kind of code system with my family for this. As in, if I ever end up in a situation where I need help, I'll use a particular code phrase. If I don't know it, assume it's an AI clone.

  • by croemer on 1/10/25, 10:04 PM

    Getting error: Failed to generate voice
  • by krainboltgreene on 1/10/25, 9:37 PM

    Getting a 500 from the HTTP API and also there's an `debugger` in the javascript.
  • by ge96 on 1/10/25, 8:52 PM

    3 seconds? That's crazy

    "Huuhhhhhhhhhhh"

    I wonder what their "fox jump" sentence is

  • by croemer on 1/11/25, 2:41 PM

  • by croemer on 1/10/25, 11:02 PM

    The title is editorialized, it should be something like: "Anyvoice - AI Voice Cloning"
  • by croemer on 1/10/25, 11:44 PM

    This is almost definitely against GDPR, there's no indication whatsoever of which legal entity is holding the data and how long it is stored on which servers where.
  • by xqcgrek2 on 1/10/25, 10:09 PM

    Has anyone tried multiple iterations? That is, upload a real voice, get its synthesized version, upload synthesized version 1 to get synthesized version 2, rinse and repeat...
  • by 0_____0 on 1/11/25, 5:49 AM

    I'm surprised you were able to repost this so quickly.

    To reiterate, among my friends, if you use a tool like this to clone my voice for any reason, you are dead to me.

  • by nwroot on 1/11/25, 3:25 PM

    Failed to generate over and over
  • by gamblor956 on 1/10/25, 10:49 PM

    This was a great way for them to collect a lot of free voice data to train their model.
  • by clueless on 1/10/25, 10:19 PM

    anybody try this and have a good result?
  • by mxuribe on 1/10/25, 9:43 PM

    Immediately, i thought that cybersecurity is now ruined for the distant future. Imagine if you will, a starship captain ready with a plot to overcome the evil plaguing their crew...and all they need to do is over-ride the starship computer's safety controls with the captain';s own voice override authorization...but, alas, early in 2025 a tech company developed the means by which said evil entity could re-override the captain's voice auth....and block the captain's plan...thereby dooming the entire crew of the starship.

    This is why we can not have nice things; not now nor in the far off future! All of our uniqueness will be more easily duplicated. Thankfully, i won';t upload any of my voice recordings, and i will continue to walk around in my faraday cage suit. /s