from Hacker News

  • Top
  • New

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

by EvgeniyZh on 5/5/25, 2:26 AM with 0 comments