DocumentationAPI ReferenceRelease Notes
DocumentationLog In
Release Notes
These docs are for v0.8.0. Click to read the latest docs for v0.20.0.

0.14.0 Release Notes

This version includes improvements of using the properties (improved properties screen, multiple columns in data screen, improved retrieval relevance property), output text difference highlighting, visibility for the used tokens, and more features, stability and performance improvements that are part of our 0.14.0 release.

Deepchecks LLM Evaluation 0.14.0 Release

  • 🧮 New Properties Screen
  • 🔢 Multiple Columns in Data Screen
  • 👻 Improved Retrieval Relevance Property
  • 🧑‍🤝‍🧑 Text Difference Highlighting
  • 🛤️ Usage Visibility

What's New and Improved?

  • New Properties Screen

    • Following your feedback, the properties experience was now improved, to enable searching and viewing all properties, no matter their type, conveniently.

    • Starred properties are the ones that are viewable in the Overview screen. Remove or add them to overview screen from the properties page itself or from the overview page.

    • Custom properties now support underscores in the property name.

  • Multiple Columns in Data Screen

    • You can now choose multiple properties (or similarity) columns to display simultaneously in the data page, allowing to conveniently view multiple values for the interactions.

  • Improved Retrieval Relevance

    • The “Retrieval Relevance” property is now being calculated using an improved method, leading to better detection of cases in which irrelevant information has been retrieved as part of RAG systems.
  • Text Difference Highlighting

    • You can now turn “on” the toggle in the output view when comparing interactions, to highlight the differences between the outputs of the same interaction across two different versions.

  • Usage Visibility

    • Token usage in the system is now tracked and displayed in the new “Usage” screens in the workspace settings. The token tracking mechanism will replace the existing limits on the number of interactions uploaded or on calculating LLM properties.