Modelmetry Changelog

This page documents the changes to Modelmetry. We are constantly improving the platform, and we want to keep you updated on what's new.

  • September 18, 2024

    • feature

      Test evaluators in app

      We know how difficult it can be to fine-tune an evaluator's configuration. Therefore, we have added a testing screen directly in the app. You can now test your evaluator by providing a payload and configuring the evaluation instance's parameters.

  • September 14, 2024

    • improvement

      Attach secrets to evaluators

      Once you create a secret, you can attach it to an evaluator instance. This will allow you to use the secret when calling an external third-party API.

      Go to an evaluator instance, and in the Secrets area, select the secret(s) you want to attach. Whilst we do allow you to attach multiple secrets for an instance, you can only attach one secret per provider (e.g., one Openai secret and one Google Cloud secret).

  • September 11, 2024

    • feature

      Vault and secrets

      The new Vault is where you can manage your secrets. You can create secrets, update them, and delete them.

      You can use the Vault to store sensitive information, such as API keys, tokens, and Google CLoud service accounts, for defined providers. We cannot show you a secret's value, but we do store a small preview for you to understand what the secret is about.

      Specific secrets can get attached to instances of evaluators so they can be used when calling external third-party APIs (e.g., Openai API, Google Cloud, Azure, Groq).

      Your secrets are stored encrypted. All secrets are encrypted using AES-256 and they are only decrypted when used for a third-party API call.

  • September 1, 2024

    • feature

      View completion span payload

      You can now view beautifully formatted payloads for completion spans.

      For example, you can see the chat thread, system prompt, and the LLM output in a beautiful interface. We also added tabs to view completion options (e.g., temperature, model, tools, provider) as well as context and RAG.

  • August 25, 2024

    • feature

      Explore trace spans

      When you are viewing a trace, you can now click on a span to view its details. Explore each span's payload, findings, metrics, logs, and events.

  • August 17, 2024

    • feature

      View details of a guardrail check

      Click on a guardrail check in the table to view its details including the check's outcome, metrics, and a break down of the different evaluations ran for this check.

  • August 4, 2024

    • improvement

      View billing usage

      You can now view your billing usage. This will help you to understand how much you have used our various features and how much you have remaining.

      This is available to billing viewers in Settings, then Billing.

  • July 25, 2024

    • feature

      Change project or company logo

      Tenant administrators can now change the project or company logo. This will help you to customise the look and feel of your dashboard. Simply go to the Settings page, then General and click on the Change your logo button.

  • July 22, 2024

    • feature

      Users can update their display picture

      All signed in users can now update their display picture. Simply go to the Account page and click on the Change your avatar button.

  • July 21, 2024

    • feature

      Users can now (finally) update their password

      All users can now update their password. Simply go to the Account page and click on the Change your password button.

      If you have not signed in recently, you will be required to sign out and sign in again to be allowed a password change (for security reasons).

  • July 12, 2024

    • feature

      Custom Role-Based Access Control

      Administrators can now create, update, and remove custom roles. Until today, you could only use one of the three standard roles: member, administrator, and owner.

      With custom roles, you can hand-pick which actions you want to explicitly allow or deny, or leave unset (i.e., implicitly deny).

  • June 30, 2024

    • improvement

      View evaluator metrics

      When you are looking to create a new evaluator instance, you can now view the evaluator's metrics. This will help you to understand the evaluator's capabilities and limitations.

  • June 16, 2024

    • improvement

      Quick search for findings charts

      You can now quickly search for metrics charts by their name. This will help you to quickly find the chart you are interested in.

  • June 7, 2024

    • improvement

      Filter guardrail calls

      You can now filter guardrail calls by their date, guardrail, outcome, and metrics (numeric findings). This will help you to quickly find the guardrail calls you are interested in.

  • June 5, 2024

    • improvement

      Filter evaluators by category (+ text search)

      When you are looking to create a new evaluator instance, you can now browse our list of evaluators using categories or search them by text.

      The text search will look at the evaluator's name, description, and metrics, when checking whether it is relevant or not.

  • June 3, 2024

    • improvement

      Filter API keys by status

      In the Settings's API keys page, you can now quickly filter the table by api key status. You can view them all, or filter by enabled, disabled, and revoked statuses.

  • June 1, 2024

    • improvement

      Update display name and email

      You can now update your display name and email address. This will help you to keep your profile up-to-date. We will add further profile controls in due time.