from Hacker News

Following LLM Manufacturer's Instructions

by amarble on 10/22/24, 9:59 AM with 1 comments

  • by ilidur on 10/22/24, 11:07 AM

    Review: The article covers 5 models used in a RAG setup and evaluates their performance according to tutorials given by the respective platforms. The results are overall close but larger models show small improvements. It then evaluates the models on safety categories where some models perform better than others, with one performing overall better. The article presents it's methodology well so it felt the results are useful to understand for specific applications. I liked the safety methods discussion. Likely an article that I'll refer to later when making architecture decisions