from Hacker News

  • Top
  • New

Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

by heyitsguay on 6/7/25, 7:04 PM with 0 comments