[Old] Q&A Demo: GVHD Data
Evaluating and debugging a Q&A application, step by step
Jump right in
Use Case Background
The data in this tutorial originates from a classic Retrieval Augmented Generation bot that answers questions about the GVHD medical condition. We're evaluating a GPT-3.5 LLM-based app that uses FAISS for the retrieval embedding vectors, with differences in the retrieval strategies and temperatures between the two versions.
The knowledge base is built from a collection of online resources about the condition.
Download the Demo Data
Click here to download the two datasets used in this example.
Structure of this Example
- Start by Creating an Application and Uploading the Data , with Deepchecks' SDK or UI
- Then Explore different flows in the System, to see the automatic annotations, problem identification, version comparison capabilities and more.
Updated 23 days ago
Whatβs Next