AWS Bedrock

This guide outlines how to integrate Deepchecks LLM Evaluation with your AWS Bedrock models to monitor and analyze their performance.

Prerequisites

Before you begin, ensure you have the following:

A Deepchecks LLM Evaluation account.
An AWS account with Bedrock enabled.
Python environment with the deepchecks-llm-client and boto3 packages installed (pip install deepchecks-llm-client boto3).

Integration Steps

Initialize Deepchecks Client:

from deepchecks_llm_client.client import DeepchecksLLMClient  

dc_client = DeepchecksLLMClient(
  api_token="YOUR_API_KEY"
)

Replace the placeholders with your actual API key, application name, and version name.

Log Interaction with AWS Bedrock Models:

Here's an example of how to log interactions with a Bedrock model using boto3:

from deepchecks_llm_client.data_types import LogInteraction, AnnotationType, EnvType
import boto3

# Configure Bedrock runtime client
bedrock_runtime = boto3.client("bedrock-runtime")

def log_bedrock_interaction(user_input, model_id):
    # Make prediction using Bedrock model
    response = bedrock_runtime.invoke_model(
        body=json.dumps({"inputText": user_input}),  # Adjust body for different models
        modelId=model_id,
        accept="application/json",
        contentType="application/json",
    )
    response_body = json.loads(response.get("body").read())
    prediction = response_body.get("results")[0].get("outputText")  # Adjust for different models

    # Log interaction to Deepchecks
    dc_client.log_interaction(
      app_name="YOUR APP NAME",
      version_name="YOUR VERSION NUMBER",
      env_type=EnvType.EVAL,
      interaction=LogInteraction(
        input=user_input,
        output=prediction,
        annotation=AnnotationType.UNKNOWN  # Add annotation if available
      )
    )

# Example usage
user_input = "Write a poem about the beauty of nature."
model_id = "amazon.titan-tg1-large"  # Replace with your desired model ID
log_bedrock_interaction(user_input, model_id)

This code snippet demonstrates how to:

Use the boto3 library to interact with Bedrock models.
Make predictions using the invoke_model method.
Log the interaction data (input, output) to Deepchecks using the log_interaction method.

View Insights in Deepchecks Dashboard:

Once you've logged interactions, head over to the Deepchecks LLM Evaluation dashboard to analyze your model's performance. You can explore various insights, compare versions, and monitor production data.

Note: This example provides a basic integration approach. You might need to adjust the prediction and logging logic based on your specific Bedrock model and use case.