from Hacker News

DeepSeek-V3, ultra-large open-source AI, outperforms Llama and Qwen on launch

by cannibalXxx on 12/27/24, 7:37 AM with 1 comments

  • by Alifatisk on 12/28/24, 9:43 PM

    What makes it ultra-large? It being almost 700B params?