from
Hacker News
Top
New
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs
by
DavidPP
on 2/26/25, 3:35 AM with 0 comments