from Hacker News

Petals runs Llama 2 (70B) from Colab at 5 tokens/sec

by borzunov on 7/19/23, 8:52 PM with 3 comments