Mar 10, 2026 | 5 min read

Chuck Data Now Runs Natively on AWS

Amperity's open-source AI agent for Identity Resolution now supports Redshift, EMR, and Amazon Bedrock, no platform migration required.

The short version

Most Identity Resolution tools require you to move your data into someone else's platform before you can do anything useful with it. Chuck Data was built to work the other way around: it runs Identity Resolution where your data already lives.

Until now, that meant Databricks. Now it means Amazon Web Services (AWS), too.

Chuck Data is a free, open-source, CLI-based AI agent that orchestrates Amperity's Identity Resolution engine. It accepts natural language prompts, generates SQL, and resolves fragmented customer records into unified profiles. Identity Resolution runs directly inside your own environment at no cost. No platform contract. No data migration. No new line items in your cloud budget.

What's new

Chuck Data now supports three AWS services:

  • Amazon Redshift serves as the data warehouse layer. You can run Identity Resolution directly against your Redshift tables, whether you're on Serverless or provisioned clusters. Customer records stay where they are.

  • Amazon Elastic MapReduce (EMR) handles compute. Identity Resolution workloads can now run on EMR, which integrates with the Identity and Access Management (IAM) policies, networking, and governance your team has already built out.

  • Amazon Bedrock provides Large Language Model access for Chuck's natural language capabilities. During setup, you choose from 60+ foundation models across six providers: Amazon, Anthropic, Mistral, Cohere, Meta, and Writer. Pick the model that fits your cost, performance, and compliance requirements, including older, previous versions no longer accessible anywhere else but AWS Bedrock.

The result is a fully AWS-native path through Chuck Data. If your team runs on Redshift, EMR, and Bedrock, you can add Identity Resolution to your stack without introducing a single external dependency.

How setup works

Run the /setup command, and Chuck walks you through a guided configuration. You'll select your data provider (Databricks or AWS Redshift); enter your AWS profile, region, and account ID; pick your compute layer (Databricks or EMR); and choose an LLM.

An AWS practitioner who knows their account configuration can finish in a few minutes. Chuck Data’s GitHub repo has video tutorials, demos, and step-by-step install guides for anyone who wants a more detailed walkthrough.

Security and data privacy

Chuck runs locally on your machine, and Identity Resolution executes inside your own AWS account. Amperity never accesses your data, your credentials, or your security configuration.

Usage telemetry is opt-in. During setup, Chuck clearly discloses what it shares (prompts you type, tool context, errors you encounter) and what it never shares (your data, credentials, and account security details). The entire codebase is open source and available for review on GitHub.

For teams operating under strict compliance and governance frameworks, this architecture means there's nothing new to vet beyond your existing AWS security posture.

Who this is for

Data engineers and platform teams on AWS who need to unify customer identities across fragmented sources but don't want to adopt a new platform to do it. If you've ever looked at Identity Resolution vendors and thought "I'm not signing a contract to deduplicate my customer table," Chuck Data is worth 15 minutes of your time.

It's also a starting point for teams evaluating Customer Data Platform (CDP) capabilities who aren't ready for a full platform purchase. You get production-grade Identity Resolution, running on infrastructure you already pay for, at no additional cost. From install to your first unified customer table in under 30 minutes.

GitHub


Chuck Data FAQs

Is Chuck Data free?

Yes. Chuck Data is open source and free to install. Identity Resolution runs inside your own Databricks or AWS environment at no cost to you.

What AWS services does Chuck Data support?

Amazon Redshift (both Serverless and provisioned clusters), Amazon EMR for compute, and Amazon Bedrock for LLM access.

What LLMs can I use through Amazon Bedrock?

60+ models from six providers are currently available, including Amazon, Anthropic, Mistral, Cohere, Meta, and Writer. You select your preferred model during the /setup configuration.

Does this replace my Databricks investment?

No. Chuck Data supports both Databricks and AWS. The new AWS integrations give teams a second path, not a replacement. If you're already running on Databricks, nothing changes. If you prefer AWS, you now have a fully native option. You can also evaluate both side by side using the same CLI.

Can I start using this today for free?

Yes. Install Chuck Data, run /setup, choose your AWS configuration, and start resolving identities. No license, no trial period, no sales call required.

GitHub

Can Chuck handle schema changes?

Yes. Data engineers manage their own schemas, and Chuck can be configured to work with whatever catalogs, tables, and schemas you're using. If your data model shifts, you reconfigure Chuck accordingly.

Can I see or edit the SQL that Chuck generates?

Chuck returns results rather than editable queries. But you can ask Chuck to show the underlying SQL, then copy and modify it yourself. You always have visibility into what's running.

Does my data leave my environment?

No. Chuck runs locally and the Identity Resolution process executes inside your own cloud account. Amperity does not access your data, credentials, or security configuration. The codebase is open source and auditable on GitHub.