from Hacker News

Dissecting Batching Effects in GPT Inference

by yecomb on 5/18/23, 10:06 PM with 0 comments