Datasets

Datasets are an important component for processing privacy requests. They codify your organization's data architecture using FidesLang data categories (opens in a new tab) and describe the data processing activity related to each of your data stores.

Datasets are defined in YAML and contain the following components:

Fields: the schema for structured or semi-structured data stores.
Data categories: the categories of personal data contained in the data store.
Identity keys: entry points for processing privacy requests.

This guide walks through the steps for creating, annotating, and managing datasets for privacy requests. It references datasets from the Cookie House sample project, available on Github (opens in a new tab).