by desideratum on 3/24/25, 8:39 PM with 1 comments
by reissbaker on 3/24/25, 9:07 PM
I'm sure they're re-RL-training an R1-[minor bump] on top of this model, or perhaps even an R2; it'll be extremely strong when it comes out. For now I've swapped most of my usage to this new V3, since it's basically on-par for my use cases with R1 and doesn't require waiting for thinking tokens.