- Show HN: Multimodal Code Generation for Web Data Extraction
by KhoomeiK on 7/9/24, 6:19 PM, with comments
- Show HN: Chinchilla Scaling Laws Are Not Universal
by KhoomeiK on 5/28/24, 6:48 PM, with comments
- Scaling Laws Depend on Data Compressibility
by KhoomeiK on 5/28/24, 5:35 PM, with comments
- Gzip Predicts Data-Dependent Scaling Laws
by KhoomeiK on 5/28/24, 5:28 PM, with comments
- SB-1047 regulates all LLM's after training a suboptimal 1e26 FLOP model
by KhoomeiK on 5/26/24, 11:02 PM, with comments
- How to kindle a fire: solving an 800 year old puzzle in Vedic ritual exegesis
by KhoomeiK on 5/24/24, 7:09 PM, with comments
- How to kindle a fire: solving an 800 year old puzzle in Vedic ritual exegesis
by KhoomeiK on 5/24/24, 8:18 AM, with comments
- Show HN: Tarsier – Vision utilities for web interaction agents
by KhoomeiK on 5/15/24, 4:46 PM, with comments
- Show HN: LlamaGym – fine-tune LLM agents with online reinforcement learning
by KhoomeiK on 3/10/24, 12:40 PM, with comments
- De-Redacting Elon's Email with Character-Count Constrained Llama2 Decoding
by KhoomeiK on 3/6/24, 1:49 PM, with comments
- WikiLLM: LLMs as Collaboratively Edited Knowledge Bases
by KhoomeiK on 2/17/24, 5:53 PM, with comments
- Llama2D: 2D Positional Embeddings for Webpage Structural Understanding
by KhoomeiK on 2/2/24, 9:20 PM, with comments
- HoleFill: Automated Data Curation to Fill User-Centric LLM Knowledge Gaps
by KhoomeiK on 1/22/24, 11:06 PM, with comments