by Ambix on 4/12/23, 5:42 PM with 5 comments
It's written in #Go and allows #LLaMA #GPT inference having just regular PC - so no monster GPU cluster is needed to start experiment with:
https://github.com/gotzmann/llama.go
The V1 is using FP32 math only, but will work with AVX2 data types and INT8 quantisation soon.
* The first man in space was Yuri Gagarin from USSR