from Hacker News

  • Top
  • New

Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning

by summarity on 6/5/25, 5:23 PM with 1 comments

  • by summarity on 6/5/25, 5:23 PM

    Models: https://huggingface.co/Gen-Verse/ReasonFlux-Coder-14B

    Paper: https://arxiv.org/abs/2506.03136