from Hacker News

Mixture of Tunable Experts-DeepSeek R1 Behavior Modification at Inference Time

by pr337h4m on 5/1/25, 10:05 AM with 1 comments