from Hacker News

MafiaBench: LLM eval for the social deduction game of Mafia

by __NSL__ on 4/8/25, 3:47 PM with 0 comments