from Hacker News

  • Top
  • New

Reinforcement Learning from Human Feedback: When the Math Ain't Enough

by scoresmoke on 8/11/23, 3:02 PM with 0 comments