from Hacker News

xmo

joined 4/19/18, 8:49 PM has 95 karma

vLLM V1: A Major Upgrade to vLLM's Core Architecture
by xmo on 1/27/25, 6:24 PM, with 0 comments
vLLM v0.6.0: 2.7x Throughput Improvement and 5x Latency Reduction
by xmo on 9/5/24, 5:17 PM, with 0 comments
ML Serving Is Broken
by xmo on 7/23/20, 6:41 PM, with 0 comments
Products Are Functions
by xmo on 7/6/20, 3:22 AM, with 0 comments
Governments vs. Big Tech: Resolving Differences in Contact Tracing
by xmo on 5/25/20, 8:29 PM, with 1 comments
Evolving the Databricks Brand
by xmo on 5/5/20, 8:11 PM, with 0 comments
Scaling Python Asyncio with Ray
by xmo on 3/2/20, 2:07 PM, with 0 comments
Preventing the Death of the Dataframe
by xmo on 1/14/20, 6:13 PM, with 0 comments
YouTube algorithm recommend everyone les miserable video
by xmo on 11/28/19, 6:05 AM, with 0 comments
Demo of OpenAI's (Smaller) GPT2 Model
by xmo on 2/20/19, 5:32 AM, with 0 comments
Looking Back at Postgres
by xmo on 1/9/19, 6:43 PM, with 0 comments
Faster pandas, even on your laptop
by xmo on 10/30/18, 9:18 PM, with 0 comments
Modin: Speed up your pandas by changing one line of code
by xmo on 10/25/18, 6:12 PM, with 0 comments
Architecting Applications for Kubernetes
by xmo on 7/11/18, 9:01 PM, with 0 comments
Pandas on Ray – Early Lessons from Parallelizing Pandas
by xmo on 7/7/18, 12:04 PM, with 19 comments
A Short History of Prediction-Serving Systems
by xmo on 7/7/18, 11:41 AM, with 0 comments