DocumentationAPI ReferenceRelease Notes
DocumentationLog In
Documentation

Uploading the Data

For this evaluation, we will upload two distinct datasets to the Deepchecks platform:

  • Predictions on Evaluation Data: The predictions made by our candidate pipeline, based on GPT-4o, on the evaluation dataset along with the ground truth.
  • Predictions on Production Data: The predictions made by our candidate pipeline, based on GPT-4o, on the more recent production dataset.

By uploading these two datasets, we create a comprehensive framework for assessing the model's performance on both historical (labeled) data and more recent (unlabeled) production data. Uploading data to the Deepchecks system can be done either via the SDK or via the UI. Both options are explained below.

Use Deepchecks' Python SDK (Option 1)

📘

Open the Demo Notebook via Colab or click here to download the Notebook, set in your API token (see below), and you're ready to go!

Get your API key from the user menu in the app:

If running locally, we recommend the best practice of using a python virtual environment to install the Deepchecks client SDK.

Or: Use the Deepchecks' UI (Option 2)

📘

Click here to Download the Demo Data

You'll see there the three demo datasets used in this example

  1. Click the Upload Data button on the sidebar

  2. Create a New Application, name it "Movie Genre Classification", Default Interaction Type is "Classification", and a New Version named "gpt_4o".

  3. Create Custom Properties for Vote Count, Vote Average, and Label.

  4. Upload the CSV (gpt_4o_evlaution_set.csv) to the Evaluation environment of the new version.

  5. Proceed to upload the gpt_4o_prod CSV to the Production environment of this version.


👍

Success, the Movie Genre Application is now in the Deepchecks App

  • Some properties take a few minutes to calculate, so some of the data - such as properties and estimated annotations will be updated over time.
  • You'll see a ✅ Completed Processing Status in the Applications page, when processing is finished.