from Hacker News

Ask HN: How do you distill a frontier model?

by npollock on 1/29/25, 9:01 PM with 0 comments

Is it just obtaining a distribution of the next token predictions, or is it more complex?