Configure a Google Cloud Storage Data Connection (LiveRamp-Hosted)

LiveRamp Clean Room’s application layer enables companies to securely connect distributed datasets with full control and flexibility while protecting the privacy of consumers and the rights of data owners.

To configure a LiveRamp-hosted Google Cloud Storage (GCS) data connection, see the instructions below.

Note

You can connect GCS to LiveRamp Clean Room from your own GCS account instead of using a LiveRamp-hosted GCS account. For more information, see "Configure a Google Cloud Storage Data Connection (Customer-Hosted)".

Overall Steps

Perform the following overall steps to configure a LiveRamp-hosted GCS data connection:

For information on performing these steps, see the sections below.

Guidelines

Review the following guidelines before starting the setup process:

LiveRamp Clean Room supports CSV and Parquet files, as well as Delta tables and multi-part files. All files should have a file extension. All CSV files must have a header in the first row. Headers should not have any spaces or special characters and should not exceed 50 characters. An underscore can be used in place of a space.
LiveRamp encourages the use of partition columns for optimal question run performance.

Generate a Google Cloud Storage Database in LiveRamp Clean Room

To generate a GCS database in LiveRamp Clean Room:

From the navigation pane, select Data Management → Data Source Locations.
In the row for LiveRamp-Hosted Google Cloud Storage, click Generate Location.
Note
These credentials may also be generated when creating a new data connection.

Add the Credentials

To add credentials:

From the LiveRamp Clean Room navigation pane, select Data Management → Credentials.
In the row for the Habu Google Service Account Credential Source, select "Activate" from the Actions menu
Review the credentials information and then click ACTIVATE CREDENTIALS.
The next screen displays the Google Project ID and the Credential JSON.
Copy and store the credentials in a secure location.

Use the credentials to authorize and send files to the LiveRamp-hosted GCS bucket generated in the previous procedure.

Create the Data Connection

After you've added the credentials to LiveRamp Clean Room, create the data connection:

From the LiveRamp Clean Room navigation pane, select Data Management → Data Connections.
From the Data Connections page, click New Data Connection.
From the New Data Connection screen, select "LiveRamp-Hosted Google Cloud Storage".
If you've already generated credentials, they will automatically populate. Otherwise, you can generate or regenerate credentials from this page.
Configure the data connection:
- Name: Enter a name of your choice.
- Category: Enter a category of your choice.
- Dataset Type: Select Generic.
- File Format: Select CSV, Parquet, or Delta.
  Note
  All files must have a header in the first row. Headers should not have any spaces or special characters and should not exceed 50 characters. An underscore can be used in place of a space.
  If you are uploading a CSV file, avoid double quotes in your data (such as "First Name" or "Country").
- Quote Character: If you're uploading CSV files, enter the quote character you'll be using (if any).
- Field Delimiter: If you're uploading CSV files, select the delimiter to use (comma, semicolon, pipe, or tab).
- Identifier Type
- Data Location: The Data Location will automatically populate with the GCS bucket location generated in the "Generate a Google Cloud Storage Database in LiveRamp Clean Room" section above.
- Identifier Type: You do not need to make a selection for this field.
- Data Refresh Type: Select the method to use to refresh the data:
  - Full: Replaces all existing data with new data
  - Incremental: Uses the new data to extend and modify the existing data
- Data Schema Reference File: If you enabled partitioning above, enter the location of a data schema reference file.
Review the data connection details and click Save Data Connection.
Note
All configured data connections can be seen on the Data Connections page.
Upload your data files to your specified location.

When a connection is initially configured, it will show "Verifying Access" as the configuration status. Once the connection is confirmed and the status has changed to "Mapping Required", map the table's fields.

You will receive file processing notifications via email.

Map the Fields

Once the connection is confirmed and the status has changed to "Mapping Required", map the table's fields and add metadata:

From the row for the newly created data connection, click the More Options menu (the three dots) and then click Edit Mapping.
The Map Fields screen opens, and the file column names auto-populate.
For any columns that you do not want to be queryable, slide the Include toggle to the left.
If needed, update any column labels.
Note
Ignore the field delimiter fields because this was defined in a previous step.
Click Next.
The Add Metadata screen opens.
For any column that contains PII data, slide the PII toggle to the right.
Select the data type for each column.
For columns that you want to partition, slide the Allow Partitions toggle to the right.
If a column contains PII, slide the User Identifiers toggle to the right and then select the user identifier that defines the PII data.
Click Save.

Your data connection configuration is now complete and the status changes to "Completed".

In this section:

Configure a Google Cloud Storage Data Connection (LiveRamp-Hosted)

Note

Overall Steps

Guidelines

Generate a Google Cloud Storage Database in LiveRamp Clean Room

Note

Add the Credentials

Create the Data Connection

Note

Note

Map the Fields

Note

Search results