from Hacker News

4M Tokens Context Model

by marvin-hansen on 1/16/25, 2:00 AM with 1 comments

  • by dyl000 on 1/17/25, 1:54 PM

    Apparently performance on larger context kinda sucks. Still impressive we have such large context on open source model.