from Hacker News

  • Top
  • New

Meta scales Language Model to 1.1T parameters using Mixture of Experts

by danielcampos93 on 1/15/22, 2:28 AM with 0 comments