from Hacker News

  • Top
  • New

EfficientQAT: LLM Quantization, gets a 2-bit llama2-70B outperform regular 13B

by jackbravo on 7/18/24, 12:44 AM with 0 comments