from Hacker News

  • Top
  • New

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

by DavidPP on 2/26/25, 3:35 AM with 0 comments