Skip to content
Datasets
Dataset Annotation Reference

Dataset annotation reference

This page serves as a comprehensive reference for all dataset annotations available in Fides. Dataset annotations are defined in YAML and help describe your data architecture for privacy request processing.

Dataset properties

PropertyTypeRequiredDescription
fides_keystringYesA unique identifier for the dataset
organization_fides_keystringNoThe Fides key of the organization that owns the dataset
namestringNoA human-readable name for the dataset
descriptionstringNoA description of what the dataset represents
data_categoriessequenceNoArray of FidesLang data categories that apply to this dataset
collectionssequenceYesArray of collections/tables within the dataset
tagssequenceNoArray of tags for the dataset
fides_metaYAML collectionNoAdditional metadata about the dataset
- fides_key: {{fides_key}}
  organization_fides_key: {{organization_fides_key}}
  tags: {{tags}}
  name: {{dataset_name}}
  description: {{dataset_description}}
  data_categories:
    - {{data_category}}
      {{data_category}}
  fides_meta: ...

Dataset fides_meta properties

PropertyTypeRequiredDescription
resource_idstringNoThe resource ID of the dataset
aftersequenceNoA list of collections that should be processed before this dataset
namespaceYAML collectionNoNamespace configuration for the dataset
namespace.dataset_idstringNoDataset identifier for the namespace
namespace.project_idstringNoProject identifier for the namespace
namespace.connection_typestringNoIntegration type for the namespace (e.g. "bigquery")
fides_meta:
  resource_id: {{resource_id}}
  after:
    - {{collection_id}}
  namespace:
    dataset_id: {{dataset_id}}
    project_id: {{project_id}}
    connection_type: {{connection_type}}

Collection properties

PropertyTypeRequiredDescription
namestringYesName of the collection/table
descriptionstringNoDescription of the collection's purpose
data_categoriessequenceNoArray of FidesLang data categories that apply to this collection
fieldssequenceYesArray of fields within the collection
fides_metaYAML collectionNoAdditional metadata about the collection
collections:
  - name: {{collection_name}}
    description: {{collection_description}}
    data_categories:
      - {{data_category}}
        {{data_category}}
    fields: ...
    fides_meta: ...

Collection fides_meta properties

PropertyTypeRequiredDescription
skip_processingbooleanNoIf true, this collection will be skipped during privacy request processing
aftersequenceNoA list of collections that should be processed before this collection
erase_aftersequenceNoA list of collections that should process erasures before this collection
masking_strategy_overrideYAML collectionNoThe masking strategy to use for the collection
masking_strategy_override.strategystringNoThe masking strategy to use for the collection. Valid values are "delete" or "mask"
partitioningYAML collectionNoThe partitioning strategy to use for the collection
partitioning.where_clausessequenceNoA list of where clauses to use for the collection
fides_meta:
  skip_processing: true|false
  after:
    - {{collection_id}}
      {{collection_id}}
  erase_after:
    - {{collection_id}}
      {{collection_id}}
  masking_strategy_override:
    strategy: mask|delete
  partitioning:
    where_clauses:
      - {{where_clause}}

Field properties

PropertyTypeRequiredDescription
namestringYesName of the field
descriptionstringNoDescription of the field's contents
data_categoriessequenceNoFidesLang data categories that apply to this field
fides_metaYAML collectionNoAdditional metadata used by Fides for privacy operations
fieldssequenceNoFor JSON datasets, nested data are represented as fields of fields
fields:
  - name: {{field_name}}
    description: {{field_description}}
    data_categories:
      - {{data_category}}
        {{data_category}}
    fields: ...

Field references properties

PropertyTypeRequiredDescription
datasetstringYesThe name of the dataset that contains the referenced collection-field relationship
fieldstringYesThe name of the field in the referenced collection/table
directionstringNoThe direction of the relationship (e.g. "to", "from")
references:
  - dataset: {{dataset_id}}
    field: {{collection_id}}.{{field_id}}
    direction: {{direction}}

Field fides_meta properties

PropertyTypeRequiredDescription
identitystringAt least 1 per privacy request pipelineSpecify the field that should be used as the identity key
referencessequenceNoReferences to other fields for joins
primary_keybooleanNoIf true, indicates this field is a primary key
data_typestringNoThe data type of the field (e.g. "string", "integer")
lengthintegerNoMaximum length for string/text fields
read_onlybooleanNoIf true, field cannot be modified
return_all_elementsbooleanNoIf true, field will return all elements in a collection
custom_request_fieldstringNoThe custom field in a privacy request used to associate with this field
automated_processingbooleanNoIf true, field is used in automated decision making
fides_meta:
  identity: {{field_name}}
  references: ...
  primary_key: true|false
  data_type: {{data_type}}
  length: {{length}}
  read_only: true|false
  return_all_elements: true|false
  custom_request_field: {{custom_request_field}}
  automated_processing: true|false