from Hacker News

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

by limoce on 6/11/24, 2:19 PM with 0 comments