from Hacker News

New Recipe: Serving Llama-2 with VLLM's OpenAI-Compatible API Server

by zhwu on 8/22/23, 4:20 PM with 0 comments