from Hacker News

Hermes 3 – Nous Research

by jasim on 11/3/24, 8:44 PM with 3 comments

  • by vintagedave on 11/4/24, 9:06 AM

    It's a fine-tuned model over Llama 3.1, which is the kind of thing I -- a non-expert -- would want to do and have though of doing if I trained a LLM for a specific programming language for private (non-cloud-hosted) use. So this is both interesting and yet I lack to knowledge to really understand its impact in the LLM world.

    > the model displays significant improvements in judgment and reward modeling.

    This seems significant?

    Technical report: https://nousresearch.com/wp-content/uploads/2024/08/Hermes-3...

  • by simonw on 11/4/24, 10:54 AM

    The documentation is missing some details: https://docs.lambdalabs.com/public-cloud/lambda-chat-api/

    When I run a prompt through it I get this back:

        {
          "id": "chat-dea5c8eddcfa4ad08d488f2501f1b3b4",
          "object": "chat.completion",
          "created": 1730717593,
          "model": "hermes3-405b",
          "choices": [
            {
              "index": 0,
              "message": {
                "role": "assistant",
                "content": "The 2020 World Series was played at Globe Life Field in Arlington, Texas. Due to the COVID-19 pandemic, the entire series was held at this neutral site to reduce travel and potential exposure to the virus."
              },
              "finish_reason": "stop",
              "content_filter_results": {
                "hate": {
                  "filtered": false
                },
                "self_harm": {
                  "filtered": false
                },
                "sexual": {
                  "filtered": false
                },
                "violence": {
                  "filtered": false
                },
                "jailbreak": {
                  "filtered": false,
                  "detected": false
                },
                "profanity": {
                  "filtered": false,
                  "detected": false
                }
              }
            }
          ],
          "usage": {
            "prompt_tokens": 65,
            "completion_tokens": 45,
            "total_tokens": 110,
            "prompt_tokens_details": null,
            "completion_tokens_details": null
          },
          "system_fingerprint": ""
        }
    
    Those content_filter_results look interesting - especially if I can turn those options on or off (I'd like to experiment with the jailbreak one for example) - but they aren't mentioned in the documentation at the moment.