by vladoh on 5/29/21, 9:02 PM with 110 comments
by LordDragonfang on 5/30/21, 12:11 AM
It's a shame they chose that name, since it was such a great play on words for the midi software (synesthesia is sound into colorful visuals, and midi uses synths) whereas this product has basically no relation.
by stevenicr on 5/29/21, 10:44 PM
Ahh.. the anchor fm problem.. guess I'll need an open source version.
I started toying with libreBot I think it's called - which allows you to do anything you want with these things if you self-host license for a grand I think it was.
This synthesia didn't even get the first sentence I tried. It also requires a 'business email' and agree to terms that includes "I agree to receive occasional product information as per Synthesia Privacy Policy *"
trying hard to keep the genie in the bottle aren't they.
by shannifin on 5/30/21, 1:04 AM
(On a side note, I'm not sure I understand the appeal of emotionally bland fake-smile talking heads in general, even when they're real.)
by question000 on 5/30/21, 1:42 AM
No I'm not asking if you think you can you use this to make money, I'm asking do you personally want to sit through a video of a robot telling you do things? Are we supposed to believe this is preferable to simply reading this or hearing recorded audio? This is flat out consumer hostility, basically telling your customers to talk to a sock puppet instead of a real person, I hope this fails, I would pay money to make this illegal.
by xiphias2 on 5/29/21, 10:52 PM
by erichurkman on 5/29/21, 10:59 PM
by istorical on 5/29/21, 10:30 PM
by firefoxd on 5/29/21, 10:16 PM
by anonytrary on 5/29/21, 11:18 PM
Animations are pretty good. Pronunciation could use some work. There also does not seem to be a way to influence the inflection, which is an absolutely crucial component for sales pitches. It's not so much what you say, but how you say it. Also, the right people have to sell the right things. Words coming from Elon's mouth in regards to cryptocurrency have a far greater effect on market behavior than the exact same words coming from this AI person's mouth.
by K0balt on 5/30/21, 12:27 PM
The incoherent facial expressions actually manage to confuse the message more than the dissociated pronunciation.... "witch is know small feet".
This tech is a neat trick at this stage but is less useful than just leaving the text as text, in fact adding negative value to an already fully functional process.
Fiver is a better option, and I would not recommend that.
For an interesting and highly unethical experiment, someone should raise a thousand infants with this drivel and see what happens...I’m going to posit that the result is not good. Children’s narrations is exactly where this is headed though, I can see this as a multimillion view no effort YouTube babysitter.
Children find a pleasant, smiling female face soothing...so this is going to be another way that the dollar and human laziness will use AI to make the world a slightly worse place.
by going_to_800 on 5/30/21, 9:19 AM
by nemothekid on 5/30/21, 6:17 AM
by Swizec on 5/29/21, 10:31 PM
But what’s the point?
If you’re gonna send someone a soulless corporate drone video, is that really better than a soulless corporate email? I thought the goal of doing video was that it’s more personable and human ... an AI video doesn’t quite hit those goals does it?
by geuis on 5/30/21, 3:49 AM
by cs702 on 5/29/21, 10:30 PM
The lips, eyes, and facial features move in natural ways, but the head remains frozen in a somewhat unnatural manner. It's just inside the uncanny valley, with barely perceptible creepiness.
I would hope to see improvements to make face/neck movements look more natural, to overcome these issues over time!
by 2bitencryption on 5/29/21, 10:27 PM
I mean, combine it with GPT-3 and you've got something that's nearly science fiction. Really interested to see where this goes.
by Cyril_HN on 5/29/21, 10:37 PM
by artur_makly on 5/30/21, 1:00 AM
by andersco on 5/30/21, 1:11 AM
by hyperpallium2 on 5/30/21, 4:59 AM
https://youtube.com/watch?v=DFM5zbekZ7c hour-long dev talk (GDC)
by codeulike on 5/29/21, 11:02 PM
by cupcake-unicorn on 5/30/21, 2:15 AM
by p-sharma on 5/30/21, 9:18 AM
by bredren on 5/30/21, 4:31 AM
https://share.synthesia.io/2761933d-4ec7-48c7-b67e-85fc9d686...
by herval on 5/30/21, 8:01 PM
by ilaksh on 5/29/21, 11:31 PM
Reminds me of the movie The Congress.
Obviously this technology has a long way to go, but it seems that that actors should feel less secure about their jobs being resistant to automation.
by FraserGreenlee on 5/29/21, 10:15 PM
by MarkMc on 5/30/21, 1:59 AM
by aishwaryaashok on 5/31/21, 9:47 AM
by andrewmcwatters on 5/30/21, 3:54 AM
by YeGoblynQueenne on 5/30/21, 7:42 PM
Yay! At last! And when we've automated away everyone's work, also say goodbye to synthesia and every other automation service, because there's no business left to use it. Woo-hoo, future world, here I come!
by system2 on 5/30/21, 4:21 AM
What a great sample.
by evan_ on 5/30/21, 2:04 AM
Again, super creepy and not really clear if it would drive engagement.
by dalmo3 on 6/1/21, 1:11 AM
by pedalpete on 5/30/21, 12:15 AM
Anybody can stand blankly in front of a camera without emotion. But this is an impressive start.
by Meph504 on 5/30/21, 12:11 AM
by jordhy on 5/30/21, 1:25 PM
by mensetmanusman on 5/29/21, 10:32 PM
by smusamashah on 5/30/21, 12:56 AM
by boboche on 5/30/21, 1:13 AM
by ravenstine on 5/29/21, 11:54 PM
by anotheryou on 5/30/21, 7:39 PM
Does anyone know what's under the hood for the text to speech?
by junon on 5/30/21, 7:14 AM
by 0xx on 5/30/21, 1:58 PM
To answer a few recurring questions in the thread
---> Use case.
Video is a way more effective way to communicate than text. Not for the HN crowd, but if you're a blue collar worker a 2 minute video in your native language is much preferred to a 5 page pdf for training.
Anyone who has tried to record a simple corporate video know the pain of cameras, film crews, 25 takes to get one that works and post production. Cumbersome, slow and multidisciplinary. By the time the video is done the content is out of date.
Synthetic video is not yet at the quality of real video. Eventually it will be. But the mistake many are making here is comparing it to real video; it should be compared with text.
In X years we'll be able to make Hollywood films on a laptop without needing anything but time and imagination. Just like we can digitally compose music in Ableton, create images in Photoshop and type novels on keyboards rather than with pen and paper.
My (obviously biased;)) belief is that synthetic media will eventually become foundational technology that will move media production from cameras/microphones to API's. We'll be able to do all kind of things we couldn't do before.
Eg. personalized and interactive rich media, video-driven chatbots and eventually Hollywood blockbusters made by your favourite YouTuber from his or her bedroom.
---> Uncanny valley
Simulating real video is incredibly hard. We're constantly improving and launching more expressive synthesis soon.
From our tests with some of our largest clients 8/10 people don't realise it's a synthetic video (unless they are asked to look for it).
---> Tech
Has been developed over the last 3 yrs. Origins/team from Stanford/UCL/TUM.
Learning: Going from research to working, scaleable product is hard and takes time. But very rewarding when it works.
[1] https://www.youtube.com/watch?v=ohmajJTcpNk [2] https://www.youtube.com/watch?v=qc5P2bvfl44
---> Bad uses
Bad actors will do bad things with synthetic media. Like with any other technology from smartphones to cars. We're moderating all content and building safeguards and verification + working with FAANG and others on detection and provenance technology.
Recommended read - deepfakes perfectly follow the story arc of any new, powerful technology: https://journals.sagepub.com/doi/full/10.1177/17456916209193...
---> Actors
Real actors getting rev share + upfront free from every video generated with their likeness. Like being a stock photo actor.
by jelling on 5/30/21, 2:39 AM
by devops000 on 5/29/21, 10:44 PM
by darepublic on 5/30/21, 3:41 AM
by lxe on 5/30/21, 2:27 AM
by Gualdrapo on 5/29/21, 11:05 PM
by doener on 5/30/21, 11:41 AM
by joshribakoff on 5/31/21, 12:54 AM
by alexfromapex on 5/30/21, 4:29 PM
by Exuma on 5/30/21, 12:48 AM
by flemhans on 5/29/21, 10:48 PM
by cush on 5/30/21, 2:02 AM
by rkagerer on 5/30/21, 6:04 AM
by ratsimihah on 5/30/21, 4:49 AM
by aalfson on 5/30/21, 12:17 AM
by gibba999 on 5/30/21, 12:20 AM