by diminikolaou on 8/11/23, 12:46 PM with 117 comments
“Hacker News Recap” (https://www.wondercraft.ai/podcasts/hacker-news-recap) a podcast produced using our platform, has been running for 4 months and currently gets close to 23k listens per month. We’ve made its analytics publicly available: https://op3.dev/show/f77aea62-97e5-5cce-92c6-9464e51c30c6.
Having previously attempted to start a podcast, we were well aware of the difficulties. Figuring out what equipment and software you need to buy is a daunting start. Editing is a lengthy and tedious process, technical difficulties often occur during recording, and planning logistics around recording is a hassle. As a result, content release is infrequent, which leads to lackluster growth.
At the same time, podcast consumption is experiencing exponential growth. There are 500M podcast listeners around the world, double in size compared to 5 years ago. Apart from the growth in listeners, podcasts are the medium that is most likely to influence behavior, which is the reason why the number of businesses having podcasts has grown 5x over the past 5 years. Finally, the last piece that led to the creation of Wondercraft is that text-to-speech models saw a big improvement about 6 months ago, with ElevenLabs releasing models with an output that is almost indistinguishable to humans (see HN thread here: https://news.ycombinator.com/item?id=34361651).
Wondercraft integrates realistic text-to-speech with an infrastructure that simplifies podcast creation. For example, you can integrate music, publish your podcast / create an RSS feed, generate a video for your episode, get assistance in the script generation, auto generate show notes and transcript and translate your podcast all together. All text based tasks (e.g. script assistance, show note generation, etc) are completed using a chain of custom prompts to LLM models. All text-to-speech is done through custom voices that are either synthetically generated or professionally cloned from Voice Actors, using the ElevenLabs platform. Tasks such as episode translation involve the use of both LLMs and ElevenLabs. Video generation runs using Remotion and the RSS feed is an XML creation and updating routine.
Since launching, we’ve had more than 13k users sign up to create their podcast. Use cases that we’re seeing include: businesses repurposing their blogs and generating video content for their socials; writers/bloggers/newsletters reaching audience through another medium; news outlets and publications adding a news rundown podcast in their lineup; businesses creating internal educational/cultural material; and podcast studios using Wondercraft to serve client needs faster.
Wondercraft is not a tool for fully AI generated content. Rather, we save people time by transferring content they’ve created (e.g. an article they’ve written) to another medium. This technology is best suited for news rundowns and narrational format podcasts (often used by businesses talking about a niche topic). And while interview and conversational formats will sound better person-to-person, the logistical and (often) sound quality issues remain, so we’re testing out an “Async Podcasts” feature, where an interviewee can respond to questions async in writing, share a photo and (optionally) a clip of their voice, and a podcast will be created out of it.
We’d love to hear any thoughts, comments or experiences you may have had in relation to leveraging text to speech for podcast creation. Thank you for taking the time to read!
by Kwpolska on 8/11/23, 5:58 PM
by zurfer on 8/11/23, 1:52 PM
[1] https://podcasts.google.com/feed/aHR0cHM6Ly9hcGkyLndvbmRlcmN...
by vishnuharidas on 8/11/23, 2:02 PM
Also I played around with their podcast generation tool, where it neatly built a podcast from my blog posts. This is a good example of what Generative AI can do in the media domain. Congrats on the production launch! Keep up!
by jedberg on 8/11/23, 3:51 PM
> podcast consumption is experiencing exponential growth
I find this so interesting! I know my personal podcast consumption has fallen off a cliff since the pandemic started. I pretty much only listen to podcasts when I commute, and I stopped commuting then. I assumed that everyone did that but I guess I was wrong.
by aloknnikhil on 8/11/23, 10:59 PM
by dutchbrit on 8/12/23, 11:08 AM
by mcpackieh on 8/11/23, 6:22 PM
by RankingMember on 8/11/23, 3:36 PM
by porkbeer on 8/11/23, 6:53 PM
by benzible on 8/11/23, 3:31 PM
by monological on 8/11/23, 5:30 PM
by another-dave on 8/11/23, 6:28 PM
Might be cool to have a feature that read out the source too, like someone would if a human was reading a quote from a book. Hard to control for everyone's different annotation style though I'd imagine.
by Imply8215 on 8/11/23, 1:29 PM
by cca778 on 8/11/23, 7:18 PM
A text-to-speech can help creating english audio tracks for those producing original content in other languages
by ilovetts on 8/12/23, 12:55 AM
by kyriakosel on 8/12/23, 1:13 PM
by rw2 on 8/11/23, 4:20 PM
Someone with speechify: https://speechify.com/
And who wants to write a spotify API write code can do this.
by colesantiago on 8/11/23, 3:19 PM
I am so happy that this exists, I was considering creating a podcast but it was too much effort involved and had to do and redo takes and other priorities.
Will be considering using Wondercraft and others if they exist entirely for this now.
by GordonS on 8/11/23, 2:21 PM
(there's a "start for free" button, but that could mean anything, and it wants me to create an account)
by sakopov on 8/12/23, 6:12 AM
by causi on 8/11/23, 6:49 PM
by swyx on 8/12/23, 3:44 PM
however when i tried signing up for your pod to make my own, i was disappointed that it would only take manually entered content. i want to hook it up to my twitter or rss or discord feed, and have you Do The Thing. please!
by hexage1814 on 8/12/23, 1:39 AM
by snissn on 8/12/23, 6:20 PM
by ksajadi on 8/11/23, 11:42 PM
by jermaustin1 on 8/11/23, 3:21 PM
by magdyks on 8/11/23, 12:50 PM
by oo0shiny on 8/11/23, 4:00 PM
by languagehacker on 8/11/23, 5:25 PM
by hdivider on 8/11/23, 3:25 PM
by rgrieselhuber on 8/11/23, 2:41 PM
by funthree on 8/11/23, 2:12 PM
by 0921kiyo on 8/11/23, 1:51 PM