Jump to Content
Documentation
Documentation
API Reference
Release Notes
v0.6.0
v0.7.0
v0.8.0
v0.9.0
v0.10.0
v0.11.0
v0.12.0
v0.13.0
v0.14.0
v0.15.0
v0.16.0
v0.17.0
v0.18.0
v0.19.0
v0.20.0
v0.21.0
v0.22.0
v0.23.0
v0.24.0
v0.25.0
v0.26.0
Documentation
Log In
Documentation
Log In
v0.26.0
Documentation
API Reference
Release Notes
Search
Welcome
Welcome to Deepchecks LLM Evaluation
Getting Started with Deepchecks!
Deepchecks in Action
Q&A Demo: GVHD Data
Uploading the Data
Identify Problems Using Properties, Estimated Annotations and Insights
User-Value Properties and Prompt Properties
Compare Between Versions
Monitor Production Data and Research Degradation
Summarization Demo: E-Commerce Data
Uploading the Data
Configuring the Automatic Annotation
Compare Between Versions
Production Monitoring
Classification Demo: Movie Genre
Uploading the Data
Evaluation Set Analysis
Production Monitoring
User Guide
Deepchecks' SDK
Setup: Python SDK Installation & API Key Retrieval
Main SDK Classes
Data Upload
Data Download
Code Snippets: Full Examples
Hierarchy & Data Upload Format
Supported Use Cases
Features
Automatic Annotations
Customizing the Auto Annotation Configuration
Version Comparison
Root Cause Analysis (RCA)
Production Monitoring
Additional Features
Properties
Built-in Properties
Prompt Properties
User-Value Properties
Deepchecks' UI
Langchain Tracing
USAGE Scenarios
Evaluation Dataset Management
Sending Evaluation Data via Deepchecks SDK
Generating an Initial Evaluation Set (RAG Use Cases Only)
Uploading an Existing Evaluation Set via Drag & Drop UI
Excluding Undesired Interactions from the Evaluation Set
Cloning Interactions from Production into the Evaluation Set
Version Comparison
AI-Assisted Annotations
Hard Sample Mining for Fine-Tuning
Pentesting Your LLM-Based App
Configuring Nvidia's Guardrails with Deepchecks
Integrations
LLMs
OpenAI
Azure OpenAI
Vertex AI
Anthropic
Nvidia NIM
Oracle Cloud (OCI)
AWS Bedrock
Production Monitoring
Datadog Integration
New Relic Integration
Powered by
Suggest