from Hacker News

  • Top
  • New

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

by s-macke on 5/21/25, 11:31 AM with 0 comments