Getting Started with LiveRamp Clean Room for Brands
LiveRamp Clean Room offers a streamlined, one-time integration that connects brands to their data assets, enabling collaboration across the ecosystem:
Interoperable: Collaborate with walled gardens, publishers, retail, and owned first-party, second-party, and third-party data globally, no matter the cloud
Flexible: Supports code-based analytics for technical users and no-code UX for business users to access insights and outcomes
Actionable: Turn insights into action with identity-powered activation to 350+ destinations
LiveRamp Clean Room Overview
A clean room is a secure environment that enables data collaboration across decentralized datasets to help uncover actionable insights and outcomes.

LiveRamp Clean Room enables the creation and ongoing support of clean room collaborations through a user-friendly interface.
Included in LiveRamp Clean Room's architecture are three core layers:

The Integration Layer, where we integrate with cloud and media ecosystems to access data at its source. This layer provides foundational support for use cases ranging from reporting insights to user list creation, complex modeling, and more.
The Application Layer, which enables teams to automate dataset workflows and define and enforce governance rules and privacy controls for multi-party collaborations.
The Intelligence Layer , where analytics and insights can be templatized, visualized, and automated.
LiveRamp Clean Room's architecture lays the foundation for LiveRamp Clean Room's question framework, which drives insights consumed from clean room outputs.
![]() |
Key Terminology
The following terminology will help you get up to speed on navigating the LiveRamp Clean Room UI:
Clean room: Secure, protected environment that allows multiple parties to bring data together for joint analysis in a privacy-compliant way.
Data connection: Used to access data at its source to minimize data movement.
Question: A business question in the form of an SQL query that can be reused for various runs or run-time parameters.
Question run: Question processing based on run-time parameters, such as dates and attributes.
Insights: Visualizations of question run results based on run-time parameters.
Roles and Responsibilities
As a partner collaborating in a clean room, there are several important steps required to properly onboard your data, often requiring members of key teams across your organization. Collaborations work best when people are aligned to roles across the clean room partner, clean room owner, and LiveRamp.
To get started, identify people in your organization that align best with the following clean room partner roles shown in the table below.
Roles | Responsibilities | |
---|---|---|
Clean Room Partner | Account and Organization Admins |
|
IT / Data Team |
| |
Marketing / Marketing Analytics |
| |
Account and Organization Admins |
| |
LiveRamp | Client Success Manager |
|
Technical Solutions Manager |
|
Overall Implementation Steps
Getting started with LiveRamp Clean Room for brands typically involves the following overall steps:
You sign the required agreements
You work with the LiveRamp Clean Room team on making key implementation decisions
Set up your account and add users
If desired, test in a sandbox environment with synthetic data
Invite clean room partners for any clean rooms you’ve created
For more information on performing these overall steps, see the sections below.
Required Agreements
When adopting LiveRamp Clean Room, LiveRamp requires their brand customers to agree and sign the following contracting documents:
LiveRamp master services agreement (MSA)
LiveRamp Data Processing Addendum (DPA)
LiveRamp statement of work (SOW) with addendum A, which offers a standardized legal framework for collaboration (Quick-Start Insights).
For advanced data collaboration use cases and features, publishers often require their own terms and conditions to be agreed upon with the brands. Those agreements need to be agreed between the parties before the collaboration.
Key Implementation Decisions
After you've signed the required agreements, your LiveRamp representative will work with you on the implementation decisions listed below, based on your situation and business needs.
Identify Partners
To begin, identify the partners (such as publishers) you wish to collaborate with. Your LiveRamp team can assist in prioritizing these partners and developing a strategy that starts small, learns quickly, and scales effectively. Once a partner agrees to collaborate, we will arrange a three-way call with you, LiveRamp, and the partner.
During this call, you will:
Identify if there are any additional agreements that need to be agreed upon between the parties before the collaboration.
Agree on the use case(s)
Align on data schemas
Align on goals and expectations
Discuss datasets and identify what needs to be done to prepare your data to be sent to the LiveRamp Clean Room
Decide where the data will be hosted
Confirm each party can connect on RampID
Choose Clean Room Offerings
Establish your technical integration with LiveRamp Clean Room once to collaborate frictionlessly across the LiveRamp network and use any of our clean room-powered offerings. This includes:
Code-based workflows for data science and technical teams, which connects a brand and an individual partner (such as a publisher, CMN, or data seller) for custom collaboration (sometimes referred to as “1:1 custom collaboration”).
No-code access via LiveRamp’s Intelligence layer, designed for business users with pre-built queries and dashboards to accelerate time to insight. Quick Start offerings, which are part of the Intelligence layer, include:
Media Intelligence: Connects a brand and an individual publisher
Retail Intelligence: Connects a brand and a RMN (plus optional publisher overlay)
With this integration specification, we hope to simplify how you work with LiveRamp and set you up for scalable success - to meet your own business goals and make it easier to collaborate with your partners.
Choose Where Data Will be Hosted
One of the first decisions to make when implementing the LiveRamp Clean Room is identifying where the data will be hosted:
Connecting to data at the source (such as your cloud warehouse)
Connecting to data you’re already sending to LiveRamp
Sending the relevant data to LiveRamp to host
This decision is not mutually exclusive of where identity resolution happens. For some clients, identity resolution may happen in the LiveRamp-hosted environment. However, the data can still be connected at the source. Your account team can assist with tradeoffs between these scenarios.
Option | Details | Benefits |
---|---|---|
Direct connection to the data source | Data connections can be configured to any cloud-based storage location, including AWS, GCS, Azure Blob, Snowflake, Google BigQuery, and Databricks. For more information, see "Cloud-Based Data Connections". | It can provide flexibility, specifically with data updates, deletion of data connections, and in troubleshooting. |
LiveRamp hosting | Sending data to LiveRamp can be executed through SFTP or setting up a connection to your S3 or GCS buckets. For more information about sending data to LiveRamp, see "Getting Your Data Into LiveRamp". | For customers with existing data feeds with LiveRamp, this offers a seamless way to get started with LiveRamp Clean Room. |
If you choose to connect to your data at source, you’ll need to gather the following prerequisite information:
Prerequisite | Notes |
---|---|
Identify where your data is stored and in what region | LiveRamp Clean Room is interoperable and supports all major clouds, including:
|
Identify the credentials associated with your cloud instance | For example:
|
Identify the region, file format, and file path associated with your configuration | For example:
|
Identify the dataset types you will be bringing into LiveRamp Clean Room | For example:
For information on formatting each dataset type, see the “Data Formatting for Collaboration” section below. |
Identify the user identifier being used for matching in LiveRamp Clean Room |
Choose How to Incorporate Identity Resolution
Once you’ve determined how to connect your data to power LiveRamp Clean Room collaboration, the next step is configuring your identity resolution workflow so that your data can be mapped to RampIDs, LiveRamp’s universal, pseudonymous identifiers.
Connecting your data in LiveRamp Clean Room with RampIDs as the join key maximizes the value, ensuring the highest fidelity view can be connected across various identifier types across your partners. For more information on RampIDs, see “RampID” and “RampID Methodology”.
Note
All RampIDs are given a “partner encoding” so that they are unique to the partner using them (this encoding is a 4-digit number that appears as part of each RampID’s value). LiveRamp hybrid clean rooms automatically translate between encodings for each collaboration partner to ensure seamless interoperability and reduce the risk of re-identification.
LiveRamp has two recommended options for identity resolution (depending on your goals) and will guide you through the process of deciding which will work best for you:
LiveRamp SFTP: Send the files you prepared to LiveRamp. Your account team will provide SFTP technical details. There, LiveRamp will apply identity resolution and connect the data to your LiveRamp Clean Room account.
Embedded Identity Resolution in your Environment:, Should you have an existing or preferred data environment, we offer cloud-based identity resolution in Snowflake, AWS, and BigQuery. Contact your account team for more details or see "Embedded Identity in Cloud Environments".
When preparing data for collaboration, consider the following recommended dataset structure for most data owners:
Universe data: This represents your full audience and likely includes all user identifiers (PII touchpoints or online identifiers) that will need to be used to resolve to RampIDs.
Note
When sending PII, it's important that as many PII touchpoints as possible are provided for LiveRamp's identity resolution capabilities to yield the best results.
Event / conversion data: Conversions or other data relevant for clean room collaboration with your partners.
All datasets should include a field for a CID (a unique customer identifier you leverage for deduping individuals across your datasets):
For PII-based universe datasets, we recommend that plaintext CIDs (not hashed) be sent (if you must hash the CIDs, we ask that you use MD5 hashing for interoperability).
For all other datasets (including universe datasets based on online identifiers), CIDs can be plaintext or hashed with any of our allowed hashing types (SHA-256, MD5, or SHA-1) but should be formatted consistently across datasets.
Note
If you use a combination of LiveRamp (SFTP) and Embedded Identity (Cloud Hosting), or if you send different dataset types, you must ensure that any hashed data is hashed using MD5 hashing in all files.
Identity Resolution Option | Datasets | Inputs | Identity Resolution Results | Update Cadence |
---|---|---|---|---|
LiveRamp (SFTP) | Universe data |
|
This resolved view will be applied to exposure data as it comes through | For most brands, this data set will need to be refreshed monthly |
Event / conversion data |
|
| If sending data through SFTP, send frequent updates for use in the LiveRamp Clean Room | |
Embedded identity (cloud hosted) | Universe data |
|
| For most brands, this data set will need to be refreshed monthly |
Event / conversion data |
| N/A, the event data can be used in the data connection directly | If connecting to data at source, ensure the data connection will update with new impression information |
To determine the best data hosting and identity resolution option for your business, you may need to mix and match solutions.
Choose How to Partition Large Datasets
LiveRamp enables you to indicate partition columns for your data connections in order to optimize query performance. Data partitioning is the process of dividing a large dataset into smaller, more manageable subsets. For partitioned data connections, data processing occurs only on relevant data during question runs which leads to faster processing times.
As a best practice, we recommend you partition the following fields for publisher measurement use cases:
Exposure date or impression date (for exposure or impression data)
Date or timestamp (for transaction or conversion data)
Brand (if applicable)
For more information, see "Data Connection Partitioning".
Set Up Your Account
Once you've made the necessary implementation decisions and your LiveRamp Clean Room organization has been created, a user with the Account Admin role can start setting up your account. This involves setting up permission sets via Roles and adding Users. Roles are the different combinations of clean room access permissions you can create and assign individual users to. When you create Users, you must assign them to a role.
Each newly created user will receive a welcome email to log into LiveRamp Clean Room. Their access level will mirror the role you have assigned them.
Note
Adding users to an organization does not mean these users have access to a configured clean room. Adding users to a clean room is a separate step.
Set Up SSO
To enable SAML-based SSO for LiveRamp Clean Room console access to your account, follow the instructions in "How to Setup SSO for LiveRamp Clean Room".
Manage User Roles
As part of LiveRamp Clean Room organization configuration, account and organization administrators can manage and add new user roles. Roles are combinations of Clean Room permission sets that can be assigned to individual users within an organization.
Clean Room provides a set of optional default roles that can be used as a starting point when setting up your permission structure. To access default roles, from the navigation pane select Admin → Roles. Default roles can be edited or deleted.
Note
Suggested roles for consideration include Org Admin, Analytics Creator, Analytics View, Marketing Manager and Data Science. You can set up as many roles as you’d like.
For more information, see “Managing User Roles”.
Add Users
Once your account administrator has access to the LiveRamp Clean Room UI, they can create additional users. There is a separate flow for adding users to the UI versus adding users to a clean room.
For more information, see "Managing Users".
Test in a Sandbox Environment with Synthetic Data
To help users get real hands on keyboard skills within the Clean Room UI, LiveRamp will provide a sandbox environment to serve as a mock environment for experimentation. This clean room will consist of sample data and sample queries to help guide your team to learn the platform.
Your LiveRamp Client Success Manager will work with you on the testing process.
Format Data for Collaboration
LiveRamp offers recommended data schemas for clean room collaboration. The fields listed in the schemas below can be used to produce measurement results for the following clean room use cases (these will have pre-written queries for your use):
Audience Analytics
Overlap of brand data against publisher data
Audience Profiles
Media Delivery
Reach / Frequency
Impressions delivered across audiences, channels, platforms, campaigns, etc.
Attribution
Conversion counts
Lift
See the sections below for information on formatting your data, including the recommended schema and format for each dataset type you might need to connect to LiveRamp Clean Room.
Note
For more information on general file formatting guidelines, see "Data Connection Requirements and Best Practices".
If you’re utilizing LiveRamp Embedded Identity in your cloud environment, the formatting guidelines are somewhat different. For more information, see “Embedded Identity in Cloud Environments” in our Identity docs site.
While not all fields might be required for your current workflows and collaborations, we recommend you include as many of these additional fields now to make it easier for you to expand your collaborations in the future without you (or your IT team) having to perform additional work on your datasets.
The recommended field names shown aren’t mandatory, but using the recommended names can reduce the number of transformations you and your collaborators might need to do during dataset provisioning or query writing.
All data files or tables in a data connection job must have the same schema in order to successfully process. If you have multiple types of schemas, create a new data connection for each one. The order of the columns should match the schema.
Recommended Dataset Types for Brands
When preparing data for collaboration, consider the following recommended dataset types:
Universe dataset: This represents your full audience and likely includes all user identifiers (PII touchpoints or online identifiers) that will be used to resolve your data to RampIDs.
Note
For brands, you might consider your CRM dataset to be your “universe” of consumers.
CRM dataset: Includes attribute (segment) data on consumers.
Conversions dataset: Includes data on conversions, such as transactions.
Product dimensions dataset: For Clean Room users with products, this includes metadata on your products.
Store dimensions dataset: For Clean Room users with stores, this includes metadata on your stores.
With the exception of datasets for product and store dimensions, all datasets should include a field for a CID (a unique customer identifier you leverage for deduplicating individuals across your datasets):
For PII-based universe datasets, we recommend that plaintext CIDs (not hashed) be sent (if you must hash the CIDs, we ask that you consistently use the same hashing for CIDs in other datasets for interoperability).
For all other datasets (including universe datasets based on online identifiers), if you sent plaintext CIDs in your universe dataset, send MD5-hashed CIDs. If you hashed the CIDs in your universe dataset, use the same hashing type.
Format a Universe Dataset
The universe dataset should represent your full audience and should include all user identifiers (PII touchpoints or online identifiers) that will be used during identity resolution to resolve to RampIDs.
LiveRamp uses this dataset to create a mapping between your CIDs and their associated RampIDs. This mapping lives in a linked dataset and allows you to use RampIDs as the join key between the various datasets in queries.
For information on formatting and hashing identifiers, see “Formatting Identifiers”.
Note
When sending PII, it’s important that as many PII touchpoints as possible are provided for LiveRamp’s identity resolution capabilities to yield the best results.
Your CRM dataset might also be able to function as a universe dataset.
Field Contents | Recommended Field Name | Field Type | Values Required? | Description/Notes |
---|---|---|---|---|
A unique user ID | cid | string | Yes |
|
Consumer’s first name | first_name | string | Yes (if Name and Postal is used as an identifier) | |
Consumer’s last name | last_name | string | Yes (if Name and Postal is used as an identifier) | |
Consumer’s address | address_1 | string | Yes (if Name and Postal is used as an identifier) | |
Consumer’s additional address information | address_2 | string | No |
|
Consumer’s city | city | string | Yes (if Name and Postal is used as an identifier) | |
Consumer’s state | state | string | Yes (if Name and Postal is used as an identifier) |
|
Consumer’s ZIP Code or postal code | zip | string | Yes (if Name and Postal is used as an identifier) |
|
Consumer’s best email address | email_1 | string | Yes (if email is used as an identifier) |
|
Consumer’s additional email address | email_2 | string | No |
|
Consumer’s additional email address | email_3 | string | No |
|
Consumer’s additional email address | email_4 | string | No |
|
Consumer’s best phone number | phone_1 | string | Yes (if phone is used as an identifier) |
|
Consumer’s additional phone number | phone_2 | string | No |
|
Consumer's mobile device ID (MAID) | maid | string | Yes (if MAIDs are used as identifiers) |
|
Format a CRM Dataset
Your CRM dataset should contain a CID for each consumer, as well as attribute (segment) data fields.
Field Contents | Recommended Field Name | Field Type | Values Required? | Description/Notes |
---|---|---|---|---|
A unique user ID | cid | string | Yes |
|
Consumer attribute category | <User Attribute 1> | varies | No |
|
Consumer attribute category | <User Attribute 2> | varies | No |
|
Consumer attribute category | <User Attribute 3> | varies | No |
|
Consumer attribute category | <User Attribute 4> | varies | No |
|
Format a Conversions Dataset
The conversions dataset should include information on the desired conversions (such as transactions or downloads).
Note
If you do not have values for any fields in this schema, we recommend that you still include those fields as placeholders.
Field Contents | Recommended Field Name | Field Type | Values Required? | Description/Notes |
---|---|---|---|---|
A unique user ID | cid | string | Yes |
|
Whether the consumer is a loyalty customer | is_card_holder | integer | No |
|
Country where the conversion occurred | country_id | string | No |
|
Unique identifier for the transaction | transaction_id | string | Yes | |
UTC timestamp of the conversion event | conversion_timestamp | timestamp | No |
|
Unique identifier for SKU or product | product_id | string | Yes | (Foreign Key for Product Table) |
Barcode number | barcode | string | No |
|
Unique identifier for the sales channel | store_id | string | No |
|
Unique identifier for a line within a receipt | trans_line_number | string | No | |
Units of a particular SKU/product ID in transaction | trans_line_quantity | int | No |
|
Value of a particular SKU/product ID in transaction | trans_line_value | numeric | No |
|
Line-level discounts | trans_line_disc_value | numeric | No | |
Whether this line represents a promotion product | trans_line_promo | integer | No |
|
Total number of items in an order/basket | trans_total_quantity | integer | No | |
Total value of order/basket | trans_total_value | numeric | No | |
Total discounts | trans_total_disc_value | numeric | No | |
Whether this line represents returned product | return_flag | integer | No |
|
Iso currency code | currency | string | No |
|
USD rate | currency_rate | numeric | No |
|
Units*Unit price of a particular SKU/product ID in transaction | gross_amt | double | Yes | |
Timestamp of the conversion event | conversion_timestamp | timestamp | Yes | |
Name of the SKU or product | product_name | string | No |
|
Where the transaction took place | sales_channel | string | No |
|
Product categorization | division | string | No |
|
Brand name of the product | brand_name | string | No |
|
Format a Product Dimensions Dataset
For Clean Room users with products, a product dimensions dataset includes metadata on your products.
Note
If you do not have values for any fields in this schema, we recommend that you still include those fields as placeholders.
Field Contents | Recommended Field Name | Field Type | Values Required? | Description/Notes |
---|---|---|---|---|
Product ID | product_id | string | Yes | A 1toN relationship between product_id and barcode is accepted (ex: product_id 123 linked to: Barcode ABC). |
Barcode number | barcode | string | Yes |
|
The product name | product_name | string | Yes | |
The product description | product_desc | string | No | |
country_id | string | No |
| |
The name of the manufacturer | supplier_name | string | Yes | Should be the name of the company manufacturing the brand/item, not the potential 3rd party supplier). |
supplier_key | integer | No | ||
The name of brand | brand_name | string | Yes |
|
brand_key | integer | No | i.e. brand_id (unique identifier for product brand) | |
The name of the first-level hierarchy category this product belongs to | hierarchy_level1_name | string | Yes | Such as the business group |
The name of the second-level hierarchy category this product belongs to | hierarchy_level2_name | string | Yes | Such as the sector |
The name of the third-level hierarchy category this product belongs to | hierarchy_level3_name | string | Yes | Such as the department |
The name of the fourth-level hierarchy category this product belongs to | hierarchy_level4_name | string | Yes | Such as the class |
The name of the fifth-level hierarchy category this product belongs to | hierarchy_level5_name | string | Yes | Such as the category (product category) |
The name of the sixth-level hierarchy category this product belongs to | hierarchy_level6_name | string | Yes | Such as the subcategory (product sub category) |
Need unit of product | need_unit | string | No | |
Whether this is a discontinued product | discontinued_flag | integer | No |
|
Whether this is a white-label product | is_white_label | integer | No |
|
Whether this is a promotion product | is_promo | integer | No |
|
Format a Store Dimensions Dataset
For Clean Room users with stores, a store dimensions dataset includes metadata on your stores.
Note
If you do not have values for any fields in this schema, we recommend that you still include those fields as placeholders.
Field Contents | Recommended Field Name | Field Type | Values Required? | Description/Notes |
---|---|---|---|---|
Main identifier for stores | store_main_id | string | No |
|
Unique identifier for sales channel | store_id | string | No |
|
Store name | store | string | No |
|
Country of store | country_id | string | No |
|
Store region | store_region | string | No | |
Default use state, or official region name for store | store_state | string | No | (or store DMA) |
City of store | store_city | string | No | |
Store's postal code | postal_code | string | No | |
Whether the store is closed | discontinued_flag | integer | No |
|
Channel type | channel_type | string | No |
|
Channel name | channel_name | string | No |
|
Delivery type | delivery_type | string | No |
|
Latitude of store location | latitude | decimal | No |
|
Longitude of store location | longitude | decimal | No |
|
Connect Your Data
Once you've confirmed that you have the required datasets ready for use, you're ready to connect your data. If you’re going to be hosting your data at its source in a cloud warehouse, you’ll need to connect that data to your Clean Room account.
Note
If you’ve chosen to have LiveRamp host the data, see the information in “Uploading Data”.
Before configuring your data connections in LiveRamp Clean Room, confirm you have the required datasets ready for use in a clean room. Knowing where your datasets live, the credentials and tables required to access those datasets, and having the proper file and table formatting in place will make the remainder of your setup much more seamless.
Connecting your data involves creating a data connection by performing the overall steps listed below. The type of connection to create will depend on your situation and business needs. Your LiveRamp representative will work with you to determine the type(s) of connections to create.
For more information on the steps to perform for your cloud provider and clean room type, see the articles in “Cloud-Based Data Connections”. Specific instructions for each step will be listed in the appropriate help article for your data connection type.
A data connection needs to be created for each dataset you want to utilize in a clean room.
After you create data connections, these connections appear on the Data Connections page, and you can create and configure clean rooms.

Perform Actions in the Cloud Provider UI
For certain data connections, you’ll have to perform tasks in the cloud provider’s UI. This often involves things like downloading credentials, creating service keys, granting permissions, generating a token, or other similar actions.
Add Credentials
For LiveRamp to be able to access your data at your cloud provider, you’ll need to create a credential within LiveRamp Clean Room for that cloud account. This often involves entering a credential or token you generate from your cloud provider.
Create a Data Connection
Once you've created the appropriate credentials, you can create the data connection itself. During this process, you’ll also specify things like the file path LiveRamp should use to access your data, the file format, the field delimiter, and the data location.
Map the Fields
Once you’ve created the appropriate data connection and LiveRamp has successfully connected to your data, you then map and configure the fields. This includes specifying which fields you want to be queryable, any updates to column labels, which fields contain PII, and which fields should be used as identifiers.
Collaborate on Clean Room Questions
Once you and your partners are up and running in the clean room, you're ready to collaborate. See the information in the "Collaboration Guides" section of our documentation site for articles to help you with collaboration activities.