from Hacker News

Not Diamond — new SOTA meta-model

by randyzwitch on 7/30/24, 1:09 PM with 13 comments

  • by t5-notdiamond on 7/30/24, 2:52 PM

    Hey HN, glad to see this here—I’m the founder and CEO of Not Diamond. Not Diamond makes it super easy to train your own custom AI model routers on your data to outperform any single model by intelligently routing to the highest-quality model for each query. We beat every foundation model on every major benchmark at a lower cost and latency.

    To train a router, you literally just upload a dataset with your inputs and eval scores for different models. It’s completely agnostic to your choice of scoring metrics, frameworks, or tools. And if you don’t have your own eval data, you can still use Not Diamond’s base router out of the box—it takes <5m to set up.

    Some other features worth noting:

    • Python, TypeScript, and REST API support

    • Option to route to faster/cheaper models when doing so doesn’t impact quality

    • Joint prompt optimization interface

    • Online, real-time personalization to hyperpersonalize model recommendations to individual end users

    • Blazing fast inference speeds (<100ms)

    • Easy deployments to your private infra

    Would love to hear what folks think.

  • by tbarn on 7/30/24, 3:51 PM

    I've been working with the team at Not Diamond and trying out the private beta for a couple of weeks, and the routing experience is great. It makes it really easy to use different models and also route based on different tradeoffs. All I had to do was grab API keys from the different LLM APIs and quickly set it up.
  • by ramly on 7/30/24, 6:15 PM

    Super cool to see this on HN. Saw Tomas demo an early version of it last year. Very neat work and the team behind it is brilliant.
  • by tt2114 on 7/30/24, 4:19 PM

    Very nice, exactly what I have been looking for. Sign up was easy as well and I have it working in my personal project already.
  • by randyzwitch on 7/30/24, 3:05 PM

    Are computer vision models supported?
  • by drewfustin on 7/30/24, 2:26 PM

    Looking into it
  • by pinkbeanz on 7/30/24, 1:42 PM

    this is neat, where can i find more details on how the automatic routing logic works?