from Hacker News

Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning

by summarity on 6/5/25, 5:23 PM with 1 comments

by summarity on 6/5/25, 5:23 PM
Models: https://huggingface.co/Gen-Verse/ReasonFlux-Coder-14B
Paper: https://arxiv.org/abs/2506.03136