from Hacker News

  • Top
  • New

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

by smaddox on 5/22/24, 2:14 PM with 1 comments