from Hacker News

Benchmarking Triton (TensorRT) Inference Server for Transformer Models

by julien_c on 4/20/20, 10:50 PM with 0 comments