from Hacker News

Let's reproduce GPT-2 (124M)

by Multiset on 6/9/24, 11:44 PM with 0 comments