from Hacker News

Developing an LLM: Building, Training, Finetuning (A 1h Video Explainer)

by rasbt on 6/14/24, 12:29 PM with 12 comments

  • by htrp on 6/14/24, 1:07 PM

    Not Sebastian (who I assume is the OP), but his blog/substack is also a great resource

    https://magazine.sebastianraschka.com/

  • by mdp2021 on 6/14/24, 12:42 PM

    Seems very good, thank you.

    The channel: https://www.youtube.com/@SebastianRaschka/videos

    contains hundreds of video lessons, originally seemingly originating from Sebastian Raschka teaching at Wisconsin-Madison Uni (before he went full-time entrepreneur).

  • by yoouareperfect on 6/14/24, 1:11 PM

    Is anyone training LLMs outside of Meta, OpenAI, etc... ?

    I don't much get the point. For huge models, it's impossible to outcompete them. For smaller models, isn't mistral or LLaMa good enough?

    What are other startups finetuning LLMs for?

  • by oneshtein on 6/14/24, 1:47 PM

    Can someone train an AI to perform all that?