from Hacker News

  • Top
  • New

Shallow Feed-Forward Neural Networks as Alternative to Attention in Transformers

by panabee on 11/21/23, 8:52 PM with 0 comments