Formatting File Data

Once you’ve determined what data you’ll be including, and which audience you’ll be uploading the data to (if necessary), review the information in the sections below (including the sections pertaining to your file type) to make sure that the data are formatted in a way that LiveRamp can accept.

Note

If you're formatting one of the file types listed below, keep the following in mind:

For CTV (Connected TV) event files, make sure to follow the guidelines in "Enable CTV Targeting by Sending CTV Event Files".
For conversion data files for one of our attribution programs, make sure to follow the file formatting guidelines for that program. See "Conversions API Programs" for more information and program-specific documentation.
For Simple Onboard files, make sure to follow the additional guidelines in "Required File Format for Simple Onboard Files".
If you're a reseller client (or a reseller uploading data on behalf of a reseller client), follow the additional guidelines in "Activation Guide for Reseller Clients".

Make sure to review the following guidelines for the LiveRamp workflow that you're using (in addition to our data guidelines in "Types of Data That Can Be Included in Files" and "LiveRamp Data Restrictions"):

The Activation workflow, which involves creating fields and segments from your data in Connect so that you can distribute those fields and segments to your desired destination platforms for activation. See "Activation Workflow Overview" for more information.
The Measurement Enablement workflow, which involves replacing the input identifiers (such as PII, cookies, mobile device IDs, CIDs, etc.) in the files you upload with their associated RampIDs and returning the files to the location you specify. See "Measurement Enablement Workflow" for more information.

Once you’ve made sure that all of the data are formatted correctly, finalize the file for uploading.

Note

See "Uploading Data" for an overview of the file creation and formatting process.

File Formatting Guidelines Highlights (Activation Workflow)

Make sure to properly format your files so that we can process them quickly. It is especially important to keep the following key guidelines in mind for files in our Activation workflow:

An audience key field: The audience key in your file is used to deduplicate the rows in your file, so the column values should be unique. A client customer ID, or something similar, is usually the best audience key. If you aren’t able to provide a client customer ID, you can also use an identifier field, such as email address, as an audience key.
Caution
For best results, and to avoid having ingestion processing pause or fail, make sure the audience key has a fill rate of as close to 100% as possible.
Once you decide on a format for your files and we've created an ingestion automation configuration for that audience, the audience key and the column headers for the identifier columns that we use for matching must stay exactly consistent across files. For more information, see "The Ingestion Automation Process for File Uploads".
Columns for identifiers: Identifiers are the data that LiveRamp uses to match the records in your file to RampIDs. For offline data, these are PII identifiers such as name, postal address, email address, and phone number. For online data, these might be mobile device IDs or cookies. The column header of an identifier field contains the identifier name (such as “EMAIL1”).
When using name and postal as an identifier, all required name and postal elements must be included in each record: first name, last name, street address, city, state, and ZIP.
Unlike known identifiers, LiveRamp can accept only one type of pseudonymous identifier in a file: cookies, mobile device IDs, custom IDs (CIDs), or RampID s. Also, unlike PII identifiers, there should only be one column of pseudonymous identifiers in the file.
A given field in a file can have no more than 250 distinct values. Most files uploaded to LiveRamp should not contain raw fields unless you have previously coordinated this with your LiveRamp account manager (see "Raw Field" for more information).
Tip
Example:
If the file contains the field "credit_score", that field can only have up to 250 unique credit score values. For example, the possible field/values pairings below would be 3 out of the 250 unique values:
- credit_score=551
- credit_score=552
- credit_score=553
If there are more than 250 distinct values when listing each credit score individually, grouping them is an option, such as:
- credit_score: 500 - 550
- credit_score: 551 - 600
- credit_score: 601 - 650
Columns for segment data fields: Each segment data field holds a category of data, such as “age” or “favorite color. The column header of a segment data field contains the category name.
Tip
Example:
If the file contains the field "credit_score", each unique credit score value must appear at least 25 times for us to process that segment.
For example, the value "551" would need to appear at least 25 times in the "credit_score" field in order for us to process the segment "credit_score=551".
Keep to LiveRamp's file limits and audience limits. Exceeding these product limits can result in slower or delayed ingestion and processing times, data inaccuracies within Connect, and manual intervention required to troubleshoot issues.

Note

New Customers: Let your Implementation Manager know how many distinct file formats you’ll be sending (i.e., formats with different column headers, different identifier data fields, or different audience keys).

For each distinct file format, let your Implementation Manager know if you want to match at individual or household precision level. See "Precision Levels" for more information.

File Formatting Guidelines Highlights (Measurement Enablement Workflow)

Make sure to properly format your files so that we can process them quickly. It is especially important to keep the following key guidelines in mind for files in our Measurement Enablement workflow:

Ensure your file contains identifiers that LiveRamp can recognize. A list of these identifiers can be found in our content on known identifiers and pseudonymous identifiers.
Once you decide on a format for your initial files, the column headers for the identifier columns that we use for matching must stay exactly consistent in subsequent files.
If you’ll be sending files with different identifier types (for example, if you’ll be sending both files with PII and files with mobile device IDs), let your LiveRamp Implementation Manager know. Otherwise, we will assume that all files contain the same type of identifiers.
LiveRamp's file limits.
For files containing PII, do not include unique values (a value that appears in only one row) in a segment data column. If a unique segment value is found in a row in a PII file, LiveRamp will not be able to provide the unique value next to RampIDs to maintain the pseudonymousness of the RampIDs. In certain cases where unique values must be included in the output file, work with your LiveRamp representative to obtain a privacy approval from our team.
Caution
If more than 5% of the total rows in the file would be dropped due to failing the privacy check outlined above, that file will be rejected.

Note

New Customers: Let your Implementation Manager know how many distinct file formats you’ll be sending (formats with different column headers and different identifier data fields).

Formatting Guidelines for All Files

For both column-based files and key-value files, follow the guidelines listed below (in addition to the appropriate guidelines for formatting column-based files and for formatting key-value files).

Use Quotation Marks When the Data Contains a Delimiter

Data containing punctuation characters are at risk of delimiter collision, and thus data bleed when any delimiter (such as a comma, pipe, or semicolon) also appears as part of the data values. This can cause LiveRamp to interpret data in a particular row as belonging to the wrong field.

To avoid this, enclose each value in a column-based file, and each key and value in a key-value file, in quotation marks ("), following the guidelines listed below:

Equals signs (in key-value files) and delimiters (such as commas or pipes) should be outside the quotation marks.
Make sure that all quotation marks are closed so that there is an even number of quotation mark characters per data row.
If using quotation marks, the best practice (but not required) is to enclose all values with quotation marks rather than only those with delimiter collision potential.
Note
It is not necessary to enclose empty or null fields with quotation marks.
If a value contains a quotation mark character, that value should be enclosed in quotation marks and each quotation mark character in the value should be properly escaped by putting a quotation mark character right before it, as shown in the examples below:
- LCD TV,50" becomes "LCD TV,50"""
- "early-bird" special becomes """early-bird"" special"
- 5'8" becomes "5'8"""

Put All Segment Data Related to a Given Identifier on a Single Row

For example, if you have three different segments, put them all on one row, either as three columns or three key-value pairs, rather than listing the same identifier on three rows with one segment per row. The latter approach will make file processing take significantly longer.

Caution

Multi-valued data are an exception. If you have a column where there are multiple values for the same header, each column value should be on a separate row, along with the identifier. For more information, see "Multi-Value Fields".

Do Not Use Placeholders for Empty Values

If a given field entry for a particular row of data has no value, leave it blank. Do not use a placeholder such as "NULL" or "N/A."

Format Multi-Value Fields Correctly

When sending in files with multi-value fields, make sure that those field values are formatted correctly by sending separate rows for each value a consumer has in a particular field.

Note

When sending files with multi-value fields to a particular audience, make sure to include that information when creating a support case before uploading the first file so those fields get ingested correctly.

For example, to make a consumer with a customer ID of "123" a member of the segments "Pets_Owned=Dog", "Pets_Owned=Cat", and "Pets_Owned=Hamster", format a column-based file like the example below:

Customer_ID	Pets_Owned
123	Dog
123	Cat
123	Hamster

For a key-value file, format the file like the example below:

123,Pets_Owned=Dog
123,Pets_Owned=Cat
123,Pets_Owned=Hamster

For more information, see "Multi-Value Fields".

Use UTF-8 Encoding

UTF-8 is in wide adoption and ensures maximum compatibility across different systems. ASCII encoding is accepted as it is included in the UTF-8 standard. When you're saving your file, make sure not to save it as something other than a UTF-8 encoded file type.

Formatting Dates

You can include segment data fields that contain calendar date values (such as date of most recent purchase) in the files you upload.

If you want to use a date field to create derived segments, use one of our preferred date formats: YYYYMM or YYYYMMDD. Do not include forward slashes (/) or hyphens (-).

This creates a numeric field that LiveRamp can map into segment ID codes that you can later modify into segment ranges using LiveRamp’s Segment Creator.

Note

Privacy issues: Keep in mind that LiveRamp will not distribute any data that could easily identify a group of fewer than 25 individuals (or fewer than two individuals for measurement files). Using the YYYYMM format can help avoid this situation.

Formatting Segment List Data

Not commonly used, these are single-field identifier files, typically device-based identifiers such as cookies tied to single PII identifier.

LiveRamp accepts non-rectangular segment list-based data. These are usually files with a single identifier field (typically a device-based identifier, such as a cookie ID, or PII-based data tied only to a single email address or phone number).

The identifier should be the first field of each row:
<identifier1>,seg1,seg2
<identifier2>,seg2
Separate the identifier and each segment with one of the allowed delimiters.
Segments should be unique per row. That is, do not include the same segment multiple times on a single row.

Recommended File Limits

LiveRamp has developed recommended file limits to minimize delays and ensure maximum performance. When preparing your files, make sure to take into account the recommended file limits listed below.

Caution

Depending on your contract, you might be required to adhere to these limits.

Item	Limit
Rows	Minimum of 25 rows per file
Rows	Maximum of 500,000,000 rows per file (100,000,000 for UK data files and 30,000,000 for French data files)
Segment data fields (segment categories, such as "Gender")	Maximum of 500 per file Caution For Measurement Enablement workflow files, the maximum is 400 columns. Note This does not include identifier data fields.
Distinct field/value pairs for segment data (such as "Gender=Female")	Maximum of 5,000 total per file
Character limits for segment names (which are a combination of the field label and field value label, for example, "FavoriteSport=Baseball")	Maximum 100 characters Note 100 characters is the limit for a segment that might be sent to Facebook. If a segment will not be sent to Facebook, the character limit is 150.
Character limits for each line or row	Maximum of 10,000 characters per line or row (1,000 for EU data files)
Files for Measurement Enablement workflows	Column-based files only (no key-value files)
File size (uncompressed)	Maximum 50 GB

Note

If a file is larger than 50 GB (uncompressed), split the file into smaller files. For more information, see “Splitting Large Files Before Uploading”.
When creating files for upload to a particular , make sure to take into account the recommended limits as well as all recommendations on file creation and formatting.

Allowed File Types

The file types listed below can be accepted, as long as they use one of the allowed file delimiters and meet all other file formatting requirements:

Comma-separated values files (.csv)
Tab-separated values files (.tsv)
Pipe (|)-separated values files (.psv)
Semicolon-separated values files (.scsv)
Text files (.txt)

When creating files for upload, remember that files must be provided in a flat, text-based format. As a general rule of thumb, if the file contents cannot be previewed within a command-line terminal or simple text editor (like WordPad or TextEdit), we cannot accept it.

Files can be column-based with delimiters or in key-value format (except for EU data files, which can only be sent as column-based files). Files can also be compressed, which will result in a different file extension.

Caution

No Excel or Word files: Non-text-based file types, such as Microsoft Excel (.xls or .xlsx) or Word (.doc or .docx) files, cannot be accepted.
For Mac Excel files: If exporting data from Microsoft Excel for Mac, choose the "Windows Comma Separated (.csv)" option (do not use the MS-DOS or Macintosh CSV versions, or any non-UTF-8 option). Make sure that extra columns haven't been added during the export process before uploading.
EU data cannot be sent in TXT files. See "Creating Data Files for EMEA Countries" for complete information on formatting EU data files.

Allowed File Delimiters

Use one of the delimiters listed below to separate values when creating a file for uploading:

Commas (comma-separated values files, or .csv files)
Tabs (tab-separated values files, or .tsv files)
Pipes (|) (pipe-separated values files, or .psv files)
Semicolons (semicolon-separated values files, or .scsv files)

Make sure to follow all other guidelines for allowed file types and for formatting file data.

Note

Enclose values in quotation marks when the data contains a delimiter: Data containing punctuation characters is at risk of delimiter collision , and thus data bleed, when any delimiter (such as a comma, pipe, or semicolon) also appears as part of the data values. This can cause LiveRamp to interpret data in a particular row as belonging to the wrong field. To prevent this, enclose values in quotation marks ("). See "Formatting Guidelines for All Files" for more information.

Avoiding File Ingestion Failure

There are many possible considerations to keep in mind when creating files for upload, but the cause of file ingestion failure or delay often falls into one of the categories listed below.

To be able to automatically ingest files that you upload for processing (which speeds up and streamlines the ingestion process for your files), we create an automation configuration for that particular audience. Often we will use a file from another of your audiences, or the first file you upload to your new audience, as a "seed" or "template" to create this configuration.

Once this configuration has been enabled, it's very important that subsequent files that are uploaded for that audience stay consistent with the "seed" or "template" configuration. If you upload a file that is not consistent (such as a file with different headers or a different audience key), the file will be paused in the ingestion process until the file is manually mapped to the audience.

For more information on the automation configuration, see "The Ingestion Automation Process for File Uploads".

Note

If you need to upload a file that does not match the initial automation configuration, create a support case so we can make sure the file is ingested correctly.

To avoid ingestion issues, be sure to keep the same file formatting as the original seed file, especially including the following:

The same identifier fields (including having the same audience key field(s) as the original file
The same field names (headers in a column-based file and keys in a key-value file) for identifier fields and for any previously-included segment data fields as the original file

Note

Files uploaded for a particular are put into a queue for that , so a delay in the processing of a file containing issues will delay all subsequent files that have been uploaded to that .
If you upload a file with new segment data fields that weren't in the original file, those fields should upload without an issue. However, you might want to monitor the file upload on the Files page to make sure there are no problems. If the processing pauses or fails, use the appropriate Troubleshooting File Upload Issue quick case to have the Support team investigate the issue.

For examples of properly formatted files (including downloadable files that can be used as templates), as well as files containing common errors, see "File Formatting Examples."

Header Issues

Header issues are the most common cause of file upload failure. The most common header issues are:

A header for an identifier column (including an identifier column that is being used as the audience key) that was in the original file uploaded for that audience is missing, or is named differently (including being capitalized differently).
Caution
- When including segment data columns that were in previously-uploaded files, make sure that the headers for those columns also stay consistent. If a header for a segment data column changes, a new field will be created instead of having the data from that column update the existing field.
- If you are using the "full refresh" option to update the audience, keep in mind that any previously-uploaded segment data columns need to be included in the file if you want to keep that data in the associated fields. Otherwise, if a previously-uploaded segment data column is not included, the associated field will have all the members removed. See "Ways to Update an Existing Audience" for more information.
The file contains two or more headers that are exactly the same (sometimes extra headers and columns are added when saving from Excel).

Make sure that:

Every column-based file you upload has a header row.
Make sure that the headers are unique and that headers for identifier columns (including identifier columns that are being used as the audience key) and headers for any previously-included segment data columns match the headers in the original file uploaded for that audience.

See "Formatting Column-Based Files" for more information.

Audience Key Issues

Keep in mind that every file uploaded to a particular audience in an Activation workflow must use the same audience key as the original file uploaded for that audience so that we can effectively deduplicate and consolidate all rows within that audience.

The most common audience key issues are:

The audience key is different from the one used for the first file uploaded for that audience.
An identifier that was used originally as the audience key is not included in the file, such as “city” when the audience key is name and postal address.
The header for an audience key column is different from the header used in the original file.
The audience key has a low fill rate. For best results, and to avoid having ingestion processing pause or fail, make sure the audience key has a fill rate of as close to 100% as possible.

Identifier Issues

An identifier is formatted incorrectly, such as including both the first and last name in the same field instead of having separate “first name” and “last name” fields.

See "Formatting Identifiers" for more information.

Delimiter Issues

There is an extra delimiter (such as a comma) in a row, making it appear as though the row has more fields in it than it should. When this happens, our system gets confused while trying to parse your file. Watch out for this, especially when creating CSV files from Excel files.
The data contains the delimiter but is not enclosed in quotation marks. Data containing punctuation characters are at risk of delimiter collision and thus data bleed, where the specified delimiter (such as a comma) also appears as part of the data values. This can cause LiveRamp to interpret data in a particular row as belonging to the wrong field. See "Formatting Guidelines for All Files" for more information.

Raw Field Issues

A "raw field" is a field in a given file that has over 250 distinct values. Most files uploaded to LiveRamp should not contain raw fields unless you have previously coordinated this with your LiveRamp account manager.

The values in raw fields cannot be separated out and managed individually in Connect. Many platforms also cannot accept raw fields.

For more information, see "Raw Field".

New Identifier Field Issues

After the upload of the first file in an audience, subsequent files need to use the same file formatting. All originally-included fields should be included with the exact same headers. If you include an additional identifier field (such as a new email address column in a column-based file) without having LiveRamp first adjust the mapping to include that new field, the file will not process until that mapping has been adjusted.

To avoid delays or processing failures, if you need to add a new identifier field to a subsequent file, first create a support case so that we can adjust the mapping before you upload that file.

Multi-Value Field Issues

The default format for fields in a file for ingestion (columns in a column-based file) is for each field to be a "single-value" field. A single-value field is a field where each consumer record can only be a member of one segment in that field. For example, a file might have a column for the field "Favorite_Pet", where each consumer record can only have one value associated with them (such as "Favorite_Pet=Dog" or "Favorite_Pet=Cat", but not both).

But for some fields you might want to allow each consumer record to have multiple values for that field. For example, you might want to create a field in Connect called "Pets_Owned" where each consumer record might be a member of multiple segments, such as "Pets_Owned=Dog", "Pets_Owned=Cat", and "Pets_Owned=Hamster". This type of field is called a "multi-value field".

For consumers that have multiple values for a particular multi-value field, make sure to create separate rows for each value. For more information on formatting multi-value fields, see "Formatting Guidelines for All Files".

Before you upload the first file to a new audience where the file will contain one or more multi-value fields, use the Set Up Audience for First File Upload (Activation) quick case to create a support case to make sure the audience is configured correctly to process those multi-value fields.

Once the ingestion automation process has been set up for the initial file, make sure to create another support case if you change any of the single-value fields to multi-value fields so that the file ingests properly.

For more information on multi-value fields, see "Multi-Value Fields".

In this section:

multi-value fieldsrectangularpseudonymous identifiersAAIDsIDFAsAAIDsIDFAsAAIDsIDFAsrectangular

Formatting File Data

Note

Note

File Formatting Guidelines Highlights (Activation Workflow)

Caution

Tip

Tip

Note

File Formatting Guidelines Highlights (Measurement Enablement Workflow)

Caution

Note

Formatting Guidelines for All Files

Use Quotation Marks When the Data Contains a Delimiter

Note

Put All Segment Data Related to a Given Identifier on a Single Row

Caution

Do Not Use Placeholders for Empty Values

Format Multi-Value Fields Correctly

Note

Use UTF-8 Encoding

Formatting Dates

Note

Formatting Segment List Data

Recommended File Limits

Caution

Caution

Note

Note

Note

Allowed File Types

Caution

Allowed File Delimiters

Note

Avoiding File Ingestion Failure

Note

Note

Header Issues

Caution

Audience Key Issues

Identifier Issues

Delimiter Issues

Raw Field Issues

New Identifier Field Issues

Multi-Value Field Issues

Search results