- OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
by amilios on 10/11/23, 9:38 PM, with comments
- Emergent and Predictable Memorization in Large Language Models
by amilios on 10/11/23, 7:57 PM, with comments
- LoopQuest: A Production Tool for Embodied AI
by amilios on 10/11/23, 7:53 PM, with comments
- Grokking as Compression: A Nonlinear Complexity Perspective
by amilios on 10/10/23, 8:37 PM, with comments
- Outlier Weighed Layerwise Sparsity: A Missing Secret Sauce for Pruning LLMs
by amilios on 10/10/23, 8:04 PM, with comments
- ToolEmu: Identifying the Risks of LM Agents with an LM-Emulated Sandbox
by amilios on 10/10/23, 7:57 PM, with comments
- Thought Propagation: An analogical approach to complex reasoning with LLMs
by amilios on 10/10/23, 7:55 PM, with comments
- I think about LLM prompt engineering
by amilios on 10/10/23, 7:52 PM, with comments
- The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
by amilios on 10/7/23, 9:19 PM, with comments
- Agent Instructs Large Language Models to Be General Zero-Shot Reasoners
by amilios on 10/7/23, 9:17 PM, with comments
- Low-Resource Languages Jailbreak GPT-4
by amilios on 10/7/23, 9:15 PM, with comments
- Tora: A Tool-Integrated Reasoning Agent
by amilios on 10/7/23, 8:59 PM, with comments
- 41% of French pop in favour of limiting everyone to 4 flights for entire life
by amilios on 9/29/23, 9:02 PM, with comments
- Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack
by amilios on 9/28/23, 9:03 PM, with comments
- Mistral 7B Instruct Unsafe
by amilios on 9/28/23, 9:01 PM, with comments