from Hacker News

Show HN: AI that generates 3blue1brown-style explainer videos

by zan2434 on 1/3/25, 10:44 PM with 46 comments

I've been building prototypes of new AI learning tools for months, but I recently learned that 3blue1brown open sourced his incredible math animation library, Manim, and that LLMs could generate code for it without any fine-tuning.

So I made a tool that automatically generates animated math/science explanations in the style of 3blue1brown using Manim from any text prompt.

Try it yourself at https://TMA.live (no signup required)

or see the demo video here: https://x.com/i/status/1874948287759081608

The UX is pretty simple right now, you just write a text prompt and then start watching the video as it's generated. Once it's done generating you can download it.

I built this because I kept finding myself spending 30+ minutes in AI chats trying to understand very specific concepts that would have clicked instantly if there were a visual explanations on YouTube.

Technical Implementation:

- LLM + prompt to use Manim well, right now this uses Gemini with grounding to ensure some level of factuality, but it works equally well with Claude

- Manim for animation generation

- OpenAI TTS for the voiceovers

- Fly.io for hosting the web app

- Modal.com for fast serverless GPUs to render the videos

- HLS protocol for streaming the videos as they are rendered

Note: This is focused on STEM education and visualization, and it is particularly good for math, but get creative and try it with anything! I used it recently to teach my partner's parents a new board game in Mandarin (which I don't speak!)

I'll be around to answer questions. Happy learning!

  • by deng on 1/4/25, 1:56 PM

    As usual with these things: it is impressive that stuff like this can be generated so quickly at all, but the content is very superficial and often wrong or at least misleading. It's unusable for learning, but great for spamming platforms, just like NotebookLM for instance.

    As an example, I asked about the Cantor function. It generated a 1:24 video, which is laughably short, explained correctly how the Cantor set is defined but showed a flawed visual representation, then simply skipped over how the Cantor function is constructed and simply states the basic properties. Sorry, but this is garbage content.

  • by mrayycombi on 1/4/25, 3:07 AM

    Those videos are apex quality videos. You might as well ask for Nobel prize literature quality essays from ChatGPT.

    You can probably imitate the structure/scaffolding of a 3b1b video in a cargo cult way, but you are losing domain expert level verification of quality (which is why AI fails, because it's not a domain expert).

    So heres how I'm hearing yoir question, and it answers itself: "how do I get domain expert quality from a non-domain expert AI?"....

  • by Eddy_Viscosity2 on 1/4/25, 12:33 AM

    Great but wrong answers. I asked how big tree-3 was after some helpful explaining about exponentials, it tells me its 27. This is... an underestimate.
  • by philipwhiuk on 1/4/25, 11:55 AM

    Whose voice are you 'borrowing' for this?

    Compared to https://www.youtube.com/watch?v=24GfgNtnjXc this video is absurdly limited https://tma.live/video/9c8e725e-ec21-41a7-984a-317d84216497

  • by tunesmith on 1/4/25, 12:14 AM

    Well, I'm blown away. "Show me how information propagates through a neural net."

    I feel like this is the one thing that's been missing from all the LLMs: actual visual explainers, whether image or video. python plots only get you so far, and all the Diffusion stuff is nonsensical. This is amazing.

  • by beshrkayali on 1/4/25, 10:21 AM

    I have to give you props for not requiring me to sign up, I’ve seen many ShowHN posts lately that require me to unnecessarily create an account which always prompts me to close the tab immediately.
  • by AndrewKemendo on 1/4/25, 12:19 AM

    I tried a few times and this is my experience

    1. Doesn't work at all on Firefox 133.0.3 (64-bit)

    2. Works on Chrome 131.0.6778.205 (Official Build) (64-bit)

    3. No existing links do anything but a sub second "Generating" which disappears quickly

    4. Does not work in Incognito on Chrome 131.0.6778.205 (Official Build) (64-bit)

    My prompt kind of worked but ended at 48 seconds

    Prompt: "Describe a honeybee learning which flower to land on in a field using Markov Decision Process with Bellman updating":

    https://tma.live/video/88f535b5-0e5f-41ca-9bd8-e35e7aa8a95a

    I ran it a second time and got a longer video of 1:55 but it primarily just created Text. It also didn't explain Bellman's equation and wrote it incorrectly:

    https://tma.live/video/88f535b5-0e5f-41ca-9bd8-e35e7aa8a95a

    The second prompt kind of worked but ends at 47 seconds and then loops the final 4 seconds forever.

    Prompt: "Describe how the OODA Loop, MDP and SPA learning approaches are the same"

    https://tma.live/video/ee7b5048-3fde-4f1a-8ec1-c8bb48883c75

    Overall this worked as described. It's more than fast enough, but fails to deliver on consistency and graphics.

    A few more iterations and fine tuning and you'll have a solid Beta. I can see this being very useful after another year or so of use and tuning.

    Great work and congrats on shipping.

  • by Tiberium on 1/4/25, 2:21 PM

    MathGPT also has this (exactly in the same 3blue1brown style, so I guess they also use manim), and in my experience it does actually work better and tries to explain math and write the equations.

    I think they use some extremely cheap model for writing the code, probably 4o-mini or similar.

  • by neumann on 1/3/25, 11:59 PM

    Whether I click an existing example or type in my own it doesn't seem to work. A dialog pops up for a second saying 'generating video' and then disappears.
  • by zkmon on 1/5/25, 2:52 PM

    Most hilarious is https://tma.live/video/8eb2d318-3217-4c09-a8aa-3fc7e8bb7cca

    I asked a history question - tell me about Reddy kings rule. It made up a physics rule and started talking about electrons.

  • by cchance on 1/5/25, 12:56 AM

    The Manim is not great, words overlapping etc, the page itself needs a lot of work, generating and then nothing happens, and it seems in a lot of case your backend workflows have issues, with descriptions only starting and ending after a say 30s when it needs to go another 1-2 minutes at least
  • by drewolbrich on 1/5/25, 12:59 AM

    Thank you for sharing. This is a wonderful experiment and I think this concept has a lot of potential.
  • by csomar on 1/4/25, 12:45 PM

    Pure Garbage: https://tma.live/video/5fe506ca-3831-4ba3-b9c6-ff899c571bf1

    Although it is pretty impressive for what an LLM can generate these days.

  • by cr4zy on 1/3/25, 11:11 PM

    Wow, this is awesome! Thanks for building. I didn't realize there was a protocol for streaming while rendering, though I noticed sumo.ai doing something similar for audio. Gemini with grounding is new to me also, very nice!
  • by h_tbob on 1/4/25, 8:19 PM

    Haha that’s funny, the beehive one lays out the hexagons as if they were squares - so they overlap and have empty space lol! But still it’s a promising concept.

    Btw for some reason on iOS I had to download to view the video

  • by cstigler on 1/3/25, 11:48 PM

    This is so cool! Streaming videos while the Manim code is still generating is super impressive - sure that took a lot of hacking.
  • by htk on 1/4/25, 1:36 PM

    Pretty incredible how fast those videos are being generated. Excellent work! You can now spam YT shorts all day.
  • by StefanBatory on 1/4/25, 11:43 AM

    Can I ask from curiosity how long it took you to program and design this entire project?
  • by indigodaddy on 1/10/25, 7:38 AM

    Kudos to you this is really impressive
  • by MidhaelBollox on 1/4/25, 7:32 AM

    Not working
  • by Snoeprol on 1/4/25, 9:34 AM

    Awesome!