by rajko_rad on 12/13/23, 7:24 PM with 9 comments
by dwrodri on 12/13/23, 9:51 PM
I think a large amount of people who get interested in implementing deep learning papers end up referencing lucidrains' work.
by reflectored on 12/13/23, 10:59 PM
Volatile GPU-Util 43%
by toppy on 12/13/23, 9:07 PM
by boiler_up800 on 12/13/23, 9:32 PM
by mi_lk on 12/14/23, 3:40 PM
by dontupvoteme on 12/13/23, 10:27 PM
A bit surprised about common crawl, is the financial cost there due to the massively bloated scale of the modern web they're (presumably) still trying to crawl?