from Hacker News

DiLoCo: Distributed Low-Communication Training of Language Models

by panabee on 7/26/24, 5:41 PM with 0 comments