Amazon Redshift
Amazon Redshift (opens in a new tab) is a fully managed cloud data warehousing service that offers fast query performance and scalability for analyzing large volumes of data using an SQL-based approach.
Prerequisites
In order to integrate with Amazon Redshift, you'll need to collect the following information from the database adminstrator:
Host
: The hostname or IP address of the server where the database is running.Port
: The network port number on which the server is listening for incoming connections (default: 3306).Username
: The user account used to authenticate and access the database.Password
: The password used to authenticate and access the database.
You may optionally configure the following information:
Database
: The name of the specific database within the database server that you want to connect to.Schema
: The default schema to be used for the database connection (defaults to public).
Integrating with Amazon Redshift
To integrate an existing system with Amazon Redshift:
- Navigate to Data map → View Systems and choose the system that you want to connect to
- Click on the Integrations tab.
- Pick the vendor or system type that you want to integrate with from the Connection type drop-down menu.
- Complete the required fields for the integration and click Save.
You may confirm your connection is working using the Test connection button, if desired.
To learn more, please see our guide for Managing Integrations.
Configuring a dataset
A dataset is required to configure an integration to this database. A dataset is a configuration file that tell Fides how to query and update data in the database. To learn more about how to create and manage datasets, please see our Managing datasets guide.
An example Amazon Redshift dataset can be viewed in Github (opens in a new tab).