- Show HN: New LLM Pre-Training and Post-Training Paradigms
by rasbt on 8/21/24, 2:45 PM, with comments
- Developing an LLM: Building, Training, Finetuning (A 1h Video Explainer)
by rasbt on 6/14/24, 12:29 PM, with comments
- Evaluating LLMs locally, on a laptop, with Llama 3 and Ollama
by rasbt on 6/13/24, 3:22 PM, with comments
- Understanding the LLM Development Cycle: Building, Training, Finetuning
by rasbt on 6/8/24, 1:27 PM, with comments
- The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM
by rasbt on 5/12/24, 12:13 PM, with comments
- Finetuning an LLM-Based Spam Classifier with LoRA from Scratch
by rasbt on 5/11/24, 2:50 PM, with comments
- Finetune a GPT Model for Spam Detection on Your Laptop in Just 5 Minutes
by rasbt on 5/3/24, 3:02 PM, with comments
- Insights from Finetuning LLMs for Classification Tasks
by rasbt on 4/28/24, 3:29 PM, with comments
- Tips for LLM Pretraining and Evaluating Reward Models
by rasbt on 3/31/24, 12:37 PM, with comments
- Comparing 5 ways to implement Multihead Attention in PyTorch
by rasbt on 3/8/24, 3:33 PM, with comments
- AI Research in Feb 2024 – LoRA Successor, "Small" LLMs, Transparent LLM Research
by rasbt on 3/3/24, 2:20 PM, with comments
- Understanding, using, and finetuning Gemma
by rasbt on 2/24/24, 2:18 PM, with comments
- Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch
by rasbt on 2/18/24, 6:50 PM, with comments
- AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs
by rasbt on 2/3/24, 1:47 PM, with comments