from Hacker News

rasbt

joined 6/6/14, 5:23 PM has 1717 karma

AI researcher and statistics professor

Show HN: New LLM Pre-Training and Post-Training Paradigms
by rasbt on 8/21/24, 2:45 PM, with 0 comments
Developing an LLM: Building, Training, Finetuning (A 1h Video Explainer)
by rasbt on 6/14/24, 12:29 PM, with 12 comments
Evaluating LLMs locally, on a laptop, with Llama 3 and Ollama
by rasbt on 6/13/24, 3:22 PM, with 0 comments
Understanding the LLM Development Cycle: Building, Training, Finetuning
by rasbt on 6/8/24, 1:27 PM, with 0 comments
The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM
by rasbt on 5/12/24, 12:13 PM, with 0 comments
Finetuning an LLM-Based Spam Classifier with LoRA from Scratch
by rasbt on 5/11/24, 2:50 PM, with 0 comments
Finetune a GPT Model for Spam Detection on Your Laptop in Just 5 Minutes
by rasbt on 5/3/24, 3:02 PM, with 0 comments
Insights from Finetuning LLMs for Classification Tasks
by rasbt on 4/28/24, 3:29 PM, with 0 comments
Tips for LLM Pretraining and Evaluating Reward Models
by rasbt on 3/31/24, 12:37 PM, with 0 comments
Comparing 5 ways to implement Multihead Attention in PyTorch
by rasbt on 3/8/24, 3:33 PM, with 0 comments
AI Research in Feb 2024 – LoRA Successor, "Small" LLMs, Transparent LLM Research
by rasbt on 3/3/24, 2:20 PM, with 0 comments
Understanding, using, and finetuning Gemma
by rasbt on 2/24/24, 2:18 PM, with 48 comments
Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch
by rasbt on 2/18/24, 6:50 PM, with 10 comments
AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs
by rasbt on 2/3/24, 1:47 PM, with 0 comments