from
Hacker News
Top
New
Reinforcement Learning Finetunes Small Subnetworks in Large Language Models
by
s-macke
on 5/21/25, 11:31 AM with 0 comments