from Hacker News

Multi-Agent Step Race Benchmark: LLM Collaboration and Deception Under Pressure

by zone411 on 1/22/25, 4:17 PM with 1 comments