from Hacker News

Multi-Agent Step Race Benchmark: LLM Collaboration and Deception Under Pressure

by zone411 on 1/22/25, 4:17 PM with 1 comments

  • by celerrimus on 1/22/25, 6:41 PM

    interesting results, thank you!