from Hacker News

Loading Llama-2 70B 20x faster with Anyscale Endpoints

by robertnishihara on 10/12/23, 3:11 AM with 0 comments