from Hacker News

Resources to deepen LLMs understanding for software engineers

by frabia on 1/16/24, 9:19 AM with 8 comments

I am a FE developer and am overall interested in LLMs (although who isn't atm?). I have a broad understanding of how LLMs work and a decent knowledge of software, but I clearly don't have the same knowledge an ML engineer would have to understand such a topic.

Therefore I find myself wanting to learn more, but I either find basic resources that already cover what I know, or too advanced stuff that is meant for another audience.

I was wondering if there is somebody producing content for my segment instead. Ideally somebody is very technical on the topic, but that explains it for a broader audience. To be clear, I don't want to become an ML engineer, but I'm interested in learning about updates and breakthroughs (and why they are such), as well as deeply understanding the challenges, limits, applications, and potential of LLMs.

by davidajackson on 1/18/24, 5:27 AM
LlamaIndex docs, LangChain docs, Youtube vids on intro to neural networks, tutorials on RAG, cross encoders, etc. should keep you busy for 20-40 hours at least
by tikkun on 1/16/24, 1:55 PM
https://news.ycombinator.com/item?id=36195527
by coolThingsFirst on 1/16/24, 4:10 PM
The problem with LLMs is that even if you learn them you can't train them. Not enough power in consumer GPUs
by jyu on 1/16/24, 10:29 PM
this has been a great resource. approachable and great for practitioners. it's frequently updated with new papers and techniques https://www.promptingguide.ai/
by duckkg5 on 1/16/24, 1:18 PM
sentdex on YouTube has a good library of videos on the subject.