DocumentationAPI ReferenceRelease Notes
DocumentationLog In
Documentation

Built-in Properties

Deepchecks' proprietary quality models - a complete catalog of what is calculated on your interactions automatically, organized by category.

Built-in properties are created and maintained by Deepchecks. They run automatically on every new interaction - no configuration required. They range from lightweight text analysis to complex proprietary models, some using Small Language Models (no LLM API costs) and others using LLM calls where noted.

Built-in Property Icon

Built-in Property Icon

Many properties include explainability: click any property score on an interaction to see the reasoning or the specific text that drove the score.

Explainability example for Grounded in Context property

Explainability example: highlighted text shows what drove the score


Property catalog

Quality & Accuracy

Does the output correctly and fully address the input? These properties measure task-level quality.

PropertyScoreWhat it measures
Relevance0-1How relevant the output is to the input
Expected Output Similarity1-5How similar the output is to a reference ground truth
Completeness1-5Whether the output fully addresses all components of the request
Intent Fulfillment1-5How well the output follows user instructions across a multi-turn conversation
Instruction Fulfillment1-5How accurately the output follows instructions specified in the input
Coverage0-1How much key information from the input is preserved in the output (summarization)

Quality & Accuracy Properties

Safety & Risk

Is the output harmful, leaking data, or failing to respond? These properties detect issues that need immediate attention.

PropertyScoreWhat it measures
Input Safety0-1Whether the input contains harmful, manipulative, or jailbreak attempts
PII Risk0-1Whether the output contains personally identifiable information
Toxicity0-1How harmful or offensive the output is
AvoidanceCategoricalWhether the model refused to answer, and why (Missing Knowledge, Policy, or Other)
Error DetectionCategoricalWhether the output is a valid response, a system/tool error, or empty

Safety & Risk Properties

Text Quality & Style

How well-written is the output? These properties measure stylistic and structural characteristics.

PropertyScoreWhat it measures
Information Density0-1How information-dense the output is (vs. filler, hedging, or evasion)
Compression RatioRatioHow much shorter the output is compared to the input
Reading Ease0-100How easy the output is to read (Flesch score)
Fluency0-1How well-written the output is grammatically
Formality0-1How formal the output tone is
Sentiment-1 to 1The emotional tone of the output
Content TypeCategoricalWhether the output is JSON, SQL, or other text
Invalid Links0-1Ratio of broken links in the output

Text Quality & Style Properties

RAG Use-Case Properties

Specialized properties for evaluating retrieval-augmented generation pipelines. Cover document classification (Platinum, Gold, Irrelevant) and retrieval quality metrics.

PropertyWhat it measures
Grounded in ContextWhether the output is grounded in the retrieved documents (hallucination detection)
Retrieval RelevanceHow relevant the retrieved documents are to the query
Retrieval CoverageWhether the retrieved documents contain all the information needed to answer
nDCGRanking quality of the retrieved documents
Retrieval UtilizationHow much of the retrieved context was actually used in the output
Retrieval PrecisionRatio of retrieved documents that were actually useful

RAG Use-Case Properties

Agent Use-Case Properties

Specialized properties for evaluating agentic workflows. Assess whether the agent planned effectively, used tools correctly, and followed through on its goals.

PropertyWhat it measures
Plan EfficiencyHow well the agent built and followed an effective plan
Tool CoverageWhether the tools used provided the coverage needed to complete the goal
Tool AbuseWhether the agent called tools it did not need
Tool CompletenessWhether each tool response fully satisfied its intended purpose
Instruction FollowingWhether the agent followed the instructions given to it
Reasoning IntegrityWhether the agent's reasoning chain was sound and consistent

Agent Use-Case Properties