0.31.0 Release Notes
3 days ago by Yaron Friedman
We’re excited to introduce powerful new capabilities across translation, production monitoring, version comparisons, and property management—helping you gain deeper insights and streamline your evaluation workflows.
Deepchecks LLM Evaluation 0.31.0 Release:
- 🌐 Switched to LLM-based translation
- 📉 Score breakdown comparison in Production
- ⏱️ Latency and token metrics in comparison flows
- 🧪 Improved property page filtering and UX
What's New and Improved?
Switched to LLM-based translation
- We’ve upgraded our translation mechanism to be fully LLM-based, resulting in significantly higher translation quality while also reducing costs. This change ensures more accurate and context-aware translations across the platform.
Score breakdown comparison in Production
- The score breakdown component is now available in the Production environment, giving you deeper insights into model performance. In addition, we’ve introduced a new comparison feature that lets you analyze score breakdowns across two different time ranges. This helps uncover trends, detect potential drifts, and identify which properties may be causing performance issues—or driving improvements—enabling faster root-cause analysis and smarter decisions.

Score Breakdown Comparison in Production Environment
Latency and token metrics in comparison flows
- We've integrated latency and token metrics as key components of our version comparison flow. In the multi-version flow, you can now include the averages of these metrics for a comprehensive overview. Additionally, in the granular comparison mode—which allows you to compare two interactions side-by-side—these metrics are displayed for a detailed, direct comparison.

Average Latency and Tokens in the Version Comparison Screen
Improved property page filtering and UX
- We’ve enhanced the Properties page with better filtering and visibility. A new "In auto-annotation" tag clearly marks properties included in the YAML-defined flow. You can now filter properties by attributes like LLM usage, auto-annotation inclusion, property type, and whether they're pinned to the Overview—making it easier to find and manage relevant properties.

The New Properties Screen Includes the New Tag and Filtering