from Hacker News

2x faster Gemma 2 finetuning and 63% less VRAM

by ricopags on 7/4/24, 1:27 AM with 1 comments

  • by ricopags on 7/4/24, 1:27 AM

    Gemma 2 27B is currently the best performing 'open' model [license is non-commercial].

    The Unsloth team have a blog post up where they've made fine-tuning Gemma 2 require less VRAM, and also have extended the context window.

    They've also updated their 'mistralified' PHI-3 models to Microsoft's June update of PHI-3 which sees some performance increases as well.