from Hacker News

Top
New

Squeeze more out of your GPU for LLM inference–Accelerate and DeepSpeed

by EntICOnc on 11/2/23, 6:03 PM with 0 comments