from Hacker News

Muon Is Scalable for LLM Training

by renonce on 2/25/25, 4:50 AM with 1 comments