Q&A Demo: GVHD Data
Evaluating and debugging a Q&A application, step by step
Jump right in by Creating an Application and Uploading the Data
Use Case Background
The data in this tutorial originates from a classic Retrieval Augmented Generation bot that answers questions about the GVHD medical condition. We're evaluating a GPT-3.5 LLM-based app that uses FAISS for the retrieval embedding vectors.
The knowledge base is built from a collection of online resources about the GVHD condition, and fictional data generated by chatGPT about a GVHD institute that provides medical support and appointments.
The differences between the two evaluated versions arise from the amount of data provided to the RAG application. Version v2 was given more information, resulting in better performance. Therefore, the second version was then deployed to production.
Open the Demo Notebook via Colab or Download it Locally
Click the badge below to open the Google Colab or click here to download the Notebook, set in your API token (see below), and you're ready to go!
Structure of this Example
- Start by Creating an Application and Uploading the Data , with Deepchecks' SDK or UI
- Then Explore different flows in the System, to see the automatic annotations, problem identification, version comparison capabilities and more.
Updated 2 months ago