from Hacker News

Toward Inference-Optimal Mixture-of-Expert Large Language Models

by zhiQ on 4/10/24, 8:47 PM with 0 comments