- Fault Tolerant Llama training
by Mougatine on 6/23/25, 9:30 AM, with comments
- MuLoCo: Muon is a practical inner optimizer for DiLoCo
by Mougatine on 5/30/25, 11:48 AM, with comments
- OpenDiLoCo: Open-Source Framework for Distributed Low-Communication Training
by Mougatine on 7/11/24, 10:08 AM, with comments
- Show HN: Deep Learning for Computer Vision course with colabs and Anki cards
by Mougatine on 9/7/21, 1:50 PM, with comments
- Continual Learning at CVPR 2020
by Mougatine on 6/30/20, 8:30 AM, with comments
- Operation Red Falcon (2015)
by Mougatine on 3/30/20, 11:59 AM, with comments
- Lifelong Learning for Deep Neural Networks (2019)
by Mougatine on 12/27/19, 10:44 AM, with comments
- Seeing Is Not Necessarily Believing: Limitations of GANs for Data Augmentation
by Mougatine on 5/26/19, 5:37 PM, with comments
- Cars detection from satellite imagery with RetinaNet
by Mougatine on 6/25/18, 2:19 PM, with comments
- Human or Company
by Mougatine on 6/10/18, 8:45 PM, with comments
- 3 Small but Powerful Convolutional Networks
by Mougatine on 5/14/18, 1:48 PM, with comments
- An Explanation of Densely Connected Convolutional Networks
by Mougatine on 5/8/18, 2:49 PM, with comments
- Amazon launches an Android app in India called “Internet”
by Mougatine on 4/25/18, 9:03 AM, with comments
- Summary of “Deep Learning Scaling Is Predictable, Empirically”
by Mougatine on 4/20/18, 9:58 PM, with comments
- How RoI Pooling and RPN Work in Faster-RCNN
by Mougatine on 3/30/18, 1:41 PM, with comments
- Selective Search Explained
by Mougatine on 3/13/18, 5:02 PM, with comments
- Why Some Clocks Have Been Running Slow in Europe
by Mougatine on 3/9/18, 5:37 PM, with comments
- Efficient Graph-Based Segmentation
by Mougatine on 3/9/18, 5:31 PM, with comments
- A Few Useful Things to Know About Machine Learning
by Mougatine on 2/8/18, 2:22 PM, with comments
- Job One for Quantum Computers: Boost Artificial Intelligence
by Mougatine on 2/2/18, 10:22 PM, with comments
- The case for learned index structures
by Mougatine on 1/23/18, 4:09 PM, with comments
- Data-Intensive Systems for the Next 1000x (2016)
by Mougatine on 1/23/18, 12:34 PM, with comments
- The Morning Paper: CS Papers Explained Every Weekday
by Mougatine on 1/14/18, 10:43 AM, with comments
- Useful Mental Models (2016)
by Mougatine on 1/8/18, 9:43 PM, with comments
- Shut Up and Calculate
by Mougatine on 1/2/18, 4:49 PM, with comments
- Course on Distributed Algorithms
by Mougatine on 1/2/18, 3:33 PM, with comments