from
Hacker News
Top
New
Multi-Agent Step Race Benchmark: LLM Collaboration and Deception Under Pressure
by
zone411
on 1/22/25, 4:17 PM with 1 comments
by
celerrimus
on 1/22/25, 6:41 PM
interesting results, thank you!