by ahmetd on 12/7/24, 9:42 AM with 76 comments
by vunderba on 12/7/24, 4:01 PM
The tables are very similar - though you've added a custom calculator which is a nice touch.
Also for the Versus Comparison, it might be nice to have a checkbox that when clicked highlights the superlative fields of each LLM at a glance.
by ursaguild on 12/7/24, 12:38 PM
How do you see this differing from or adding to other analyses such as:
https://huggingface.co/spaces/TTS-AGI/TTS-Arena
https://huggingface.co/spaces/hf-audio/open_asr_leaderboard
https://huggingface.co/spaces/TIGER-Lab/GenAI-Arena
Great work on all the aggregation. The website is nice to navigate.
by karpatic on 12/7/24, 3:37 PM
by wslh on 12/7/24, 2:06 PM
But I think this moment mirrors financial markets during times of frenzy. When markets are volatile, one common piece of advice is to “wait and see”. Similarly, in AI, so many brilliant minds and organizations are racing to create groundbreaking innovations. Often, what you're envisioning as your next big project might already be happening, or will soon be, somewhere else in the world.
Adopting a “wait and see” strategy could be surprisingly effective. Instead of rushing in, let the dust settle, observe trends, and focus on leveraging what emerges. In a way, the entire AI ecosystem is working for you: building the foundations for your next big idea.
That said, this doesn't mean you can't integrate the state of the art into your own (working) products and services.
by gtirloni on 12/7/24, 3:42 PM
by politelemon on 12/7/24, 1:13 PM
by ursaguild on 12/7/24, 1:02 PM
by tonetegeatinst on 12/7/24, 11:46 PM
by mcklaw on 12/7/24, 11:53 AM
by xnx on 12/7/24, 11:37 AM
by lolinder on 12/8/24, 12:59 AM
In my own experiments with the chat models they seem to lose the plot after about 10 replies unless constantly "refreshed", which is a tiny fraction of the supposed 128000 token input length that 4o has. Does Gemini actually do something dramatically differently, or is their 3 million token context window pure marketing nonsense?
by nikvdp on 12/8/24, 4:17 AM
by robbiemitchell on 12/7/24, 6:16 PM
by alif_ibrahim on 12/7/24, 3:20 PM
by ProofHouse on 12/7/24, 8:52 PM
by Bigie on 12/8/24, 1:06 AM
As far as I know, there's a volcano engine in China that has impressive text-to-speech capabilities. Many local companies are using this model.
by moralestapia on 12/7/24, 3:56 PM
A small suggestion, a toggle to exclude between "free" and hosted models.
Reason is, I'm obv. interested in seeing the cheaper models first but am not interested in self-hosting which dominate the first chunk of results because they're "free".
by dangoodmanUT on 12/7/24, 2:40 PM
11labs, deepgram, etc.
by tomp on 12/7/24, 7:53 PM
you're missing a lot
TTS: 11labs, PlayHT, Cartesia, iFLYTEK, AWS Polly, Deepgram Aura
STT: Deepgram (multiple models, including Whisper), Gladia Whisper, Soniox
just off the top of my head (it's my dayjob!)
by wiradikusuma on 12/7/24, 5:13 PM
1. Maybe explain what Chat Embedding Image generation Completion Audio transcription TTS (Text To Speech) means?
2. Put a running number on the left, or at least just show total?
by mtkd on 12/7/24, 11:47 AM
by 5563221177 on 12/8/24, 3:50 AM
I've got one where "deploying" means updating a few version strings and image reverences in a different repo. The "build" clones that repo and makes the changes in the necessary spots and makes a commit. Yes, the side effect I want is that the commit gets pushed--which requires my ssh key which is not a build input--but I sort of prefer doing that bit by hand.
by shahzaibmushtaq on 12/7/24, 2:47 PM
BTW impressive idea and upvoted on PH as well.
by mentalgear on 12/7/24, 2:52 PM
by ikishorek on 12/10/24, 10:31 AM
by Its_Padar on 12/7/24, 12:38 PM
by victoriawu on 12/9/24, 8:34 AM
I wonder if adding a chatbot might be a good idea. Users could ask specific questions based on their needs, and the bot could recommend the most suitable model. Perhaps this would add more value.
by SubiculumCode on 12/7/24, 9:50 PM
by e-clinton on 12/7/24, 5:39 PM
by amelius on 12/7/24, 4:51 PM
by methou on 12/7/24, 3:06 PM
by NoZZz on 12/7/24, 5:34 PM