from
Hacker News
Top
New
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
by
EvgeniyZh
on 5/5/25, 2:26 AM with 0 comments