Identity Engine

LiveRamp Enterprise Identity Engine is LiveRamp's first-party identity resolution tool, which uses your first-party data to develop and maintain customer identifiers that represent individual people. To continuously build your first-party graph, you can configure a customer data and identity resolution workflow, which is then run using automated Spark and BigQuery jobs on a specified schedule.

To access Identity Engine, you must have one of the following roles:

Super User - Admin: Access to all functionalities in Identity Engine, including workflow creation and management.
Generalist Viewer: View workflows and their details.

To learn more about roles, see Roles and Permissions.

Each identity resolution workflow that you configure has the following stages:

Data preparation: A data source is validated, transformed, and enriched to maximize identity resolution outcomes. This can include removing null or invalid values, normalizing, applying data hygiene, mapping to a common taxonomy, and enriching with deterministic signal and metadata from the LiveRamp Known Identity Graph (AKA "AbiliTec Graph"). For information, see "Configure a Data Preparation Process".
Resolution: Records from source files are grouped into clusters representing a person, and then an Enterprise ID is associated with this cluster. The resolution process involves pairwise matching in which pairs of records are generated and compared using specified rules. These rules are reapplied each time the graph is run to evaluate whether pairs should remain tied to a persistent Enterprise ID or split or merged into a new ID cluster. The outcome of the resolution process will be a customer-specific first-party graph. For information, see "Configure a Resolution Process".
Export: Elements of the first-party graph are extracted into a file for consumption by downstream clients of the identity graph. You can configure different exports in the export process using SQL or a column selector to create different views of the output for systems that have distinct requirements or use cases. For information, see "Configure an Export Process".

Each workflow is grouped into processes that appear in the UI as boxes that can be visually tied together. You can configure each stage in the workflow and specify arguments to be run by the corresponding Spark jobs, which are run on your organization's dedicated BigQuery and Dataproc resources within LiveRamp. Alternatively, you can implement a remote execution strategy to run these processes within your organization's Google Cloud Platform.

Identity Engine FAQs

Q. Is "Identity Engine" the same product as "Portrait Engine"?

A. Yes, Identity Engine is LiveRamp's new product name for Portrait Engine.

Q. Is "Enterprise ID" the same identifier as "Portrait ID"?

A. Yes, "Enterprise ID" is our new name for the identifier formerly known as "Portrait ID." However, the data structure still refers to "PortraitID," so you will have to reference this legacy name when writing SQL queries for Export processes.

Q. What is an Enterprise ID?

A. An enterprise identifier is a customer-specific identifier. The Enterprise ID is the view that LiveRamp clients apply to the data that they receive and organize through LiveRamp, around which that data is organized and maintained.

LiveRamp Enterprise Identity Engine generates and maintains Enterprise IDs, which represent a person, household, business, or some other entity that is relevant to your organization. Enterprise IDs are matched and deconflicted against RampIDs. Enterprise IDs enable you to organize and maintain your data within LiveRamp and provide a primary key when acquiring third-party attributes for your first-party data and matching impressions for measurement.

Q. What data should be sent to Identity Engine?

A. You should send any data that you would like to use for resolving records into an Enterprise ID. This typically means available PII such as Name, Address, Email, and Phone. It can also include other identifying information such as year of birth, name suffixes, gender, a middle initial, and so on.

Q. What do the match confidence levels mean?

A. Match confidence is a match metadata parameter that provides information about which data types yield the best results. The lower the match level, the higher the confidence. For example, if your input data includes Name + Address + ZIP, it has a match confidence of 1. If it only includes Email, it has a confidence level of 6.

For information, see "Match Metadata" and "Match Cascade"

Q. What match resolution logic is available?

A. Identity Engine resolution processes support the following match methods:

Logic	Description
First N Char	Match on the first N characters of the field. For example, "Robert" and "Rupert" match on the first 1 characters.
Fuzzy Score	Use fuzzy matching on a similarity score between strings and consider various transformations and substitutions. For more information, see "approximate string matching".
Jaccard	Calculate the similarity of two strings based on the number of common characters divided by the total number of unique characters. The closer to 1, the more similar the two strings. For example, "Robert" and "Rupert" yield a Jaccard similarity of 0.5. For more information, see "Jaccard index".
Jaro	Measure the similarity between two strings on a scale from 0 (no match) to 1 (identical), with an emphasis on matching prefixes and adjusting for common typos or variations. For example, "Robert" and "Rupert" yield a Jaro similarity of 0.778. For more information, see "Jaro–Winkler distance".
Levenshtein	Calculate the minimum number of single-character edits (insertions, deletions, or substitutions) needed to change one word to another. For example, "Robert" and "Rupert" yield a Levenshtein distance of 2 (substitute "u" for "o" and "p" for "b"). For more information, see "Levenshtein distance".
Soundex	Index names by sound, retain the first letter of the name and calculate a 3-digit numerical code for the remaining letters. For example, "Robert" and "Rupert" yield a Soundex code of R163. For more information, see "Soundex".
Strict	Requires an exact match.

Note

Each rule must have at least one strict match condition.

Q. What are some example rules?

A. You can specify a wide range of resolution processing rules such as the following examples:

Sample Rule	Description
ConsumerLink (AbiliTec)	ConsumerLink matches with match confidence from AbiliTec. For information, see "Using ConsumerLinks" and "Match Metadata".
Email + Name	Uses the email address and the full name to create matches.
Fuzzy First Name + Strict Last Name + Email	Use fuzzy matching of the First Name, an exact match of the Last Name, and a normalized email match.
Fuzzy First Name + Strict Last Name + Phone	Use fuzzy matching of the First Name, an exact match of the Last Name, and match one of the provided phone numbers (home, mobile, or work).
Strict Name + AddressLink	Require an exact match of the First Name and Last Name, and the AddressLink for the physical address. For more information about AddressLink, see "AbiliTec Link Types".

Q. How does the besting logic work?

A. When specifying resolution rules, you can select the Determine best available attributes based on LiveRamp Graph option to have Identity Engine rank available PII or AbiliTec link information per Enterprise ID and find 1 best output for each touchpoint. For more information, see "Configure a Resolution Process" and "Determining Best Available Attributes".

Q. What normalization does Identity Engine apply during data preparation?

A. Identity Engine performs the following operations to validate and normalize your data:

Category	Operation	Description
Address	Normalization	Change all lowercase letters to uppercase. Remove diacritic characters. Replace nulls. Trim leading and trailing spaces.
Email	Extract isotopes	For email addresses with a Google email domain, extract a list of Google isotopes: Remove characters after a + character and before the @ character. For example, john+doe@gmail.com → john@gmail.com Remove periods in the email prefix. For example, john.doe@gmail.com → johndoe@gmail.com Gmail accounts can use `@gmail` and `@googlemail`, so Identity Engine will check both email domains when provided with one. For example, if john.doe@gmail.com is provided, john.doe@googlemail.com is also checked.
	Normalization	Change all uppercase letters to lowercase. Remove diacritic characters. Replace nulls. Trim leading and trailing spaces.
	Suspected	Check for email addresses containing certain keywords, such as "no-reply" or "info".
	Valid email	Validate email using the Apache EmailValidator library. For more information, see "EmailValidator".
People	Extract gender	Extract gender from names if not supplied.
	Extract nicknames	Extract nicknames from the first name.
	Extract suffix	Extract suffixes from names if not supplied.
	Extract YOB	Extract year of birth from date of birth.
	Normalization	Change all lowercase letters to uppercase. Remove diacritic characters. Replace nulls. Trim leading and trailing spaces.
	Valid DOB	Invalidate DOB if greater than 100 years old.
Phone	Extract internal number	Extract phone number in internal number format.
	Extract international number	Extract phone number in international number format.
	Extract phone type	Extract the type of phone (mobile, landline, other).
	Normalization	Regex to remove non-numeric characters and "+". Phone numbers should be normalized to a 10-digit string of numbers only, so dashes or a leading plus character are removed. If the number starts with a 0 or a 1, it can exceed 10 digits. Trim leading and trailing spaces.
	Valid phone	Validate phone numbers using Google's libphonenumber library. For more information, see "libphonenumber".

Q. How do I get technical support?

A. Create a "LiveRamp Data Hub" case in the LiveRamp Community portal or contact your LiveRamp representative.

In this section:

Identity Engine

Identity Engine FAQs

Note

Search results