from
Hacker News
Top
New
Reinforcement Learning from Human Feedback: When the Math Ain't Enough
by
scoresmoke
on 8/11/23, 3:02 PM with 0 comments