from Hacker News

Can LLMs Follow Simple Rules?

by armcat on 11/15/23, 2:13 PM with 1 comments

  • by armcat on 11/15/23, 2:13 PM

    Framework for evaluating LLMs' sensitivity to jailbreaks.