by scoresmoke on 10/11/23, 8:08 PM with 2 comments
I also wrote a detailed post describing the methodology and analysis: https://evalovernite.substack.com/p/llmfao-human-ranking
[1]: https://twitter.com/_jasonwei/status/1707104739346043143
[2]: https://benchmarks.llmonitor.com/
Unfortunately, I did my analysis before the Mistral AI model was released, but published it after the model was released. I’d be happy to add it to the comparison if I had their completions.
by maxrmk on 10/11/23, 9:15 PM