Skip to content
 Welcome back Badgers!
New Student Orientation Info →

Data De-Identification

Data Analytics and AI

The ability to analyze data effectively has become more powerful than ever before. AI-driven data analytics can unlock insights that drive innovation and improve outcomes. However, this data still needs to be protected, especially when dealing with sensitive information that could compromise individual privacy. Balancing the promise of AI with the need to protect privacy is a critical challenge. This is where data de-identification becomes essential—allowing organizations to leverage AI's potential while ensuring personal data remains secure and anonymous.

De-Identifying Data

AI services have great capabilities that can increase the quality and efficiency of data analytics. However, data analytics often deals with sensitive data that should not be shared without an appropriate privacy agreement in place. Because of this concern, data can be de-identified so that private data is not exposed with an AI service while still leveraging the strengths that the AI service has.

General Information De-Identification Checklist Methods For De-Identification Contact Us

Personal Data

Personal data is any information related to an identifiable person. Personal data includes a wide variety of direct identifiers, as well as indirect identifiers.

Direct Identifiers vs Indirect Identifiers

Direct identifiers are anything that can directly identify an individual (full name, social security number, etc). Indirect identifiers are identifiers that do not identify an individual on their own, but can be used alongside other identifiers to identify an individual (race, ethnicity, age, zip code, birthday, etc). Both direct and indirect identifiers should not be present in data that is being shared with an AI service.

De-Identified Data vs Anonymized Data

Within the privacy world, it is important to note the difference between de-identified and anonymized data. De-identified data is NOT the same as anonymized data, and should never be displayed or marketed as such. Anonymized data can only be claimed as such when the entirety of the data across an organization does not have any direct or indirect identifiers that could lead to the identity of any person.

Types of Analyses

Once data has been de-identified, there are many types of analyses that can be performed in order to draw conclusions from the data. Common types of analyses used across different career fields are listed below:

  • Market Basket Analysis

    Determining which products users frequently purchase together.
  • Market Trend Analysis

    Identifying trends in a given market to anticipate demand and adapt strategies.
  • Campaign Performance Analysis

    Evaluating success of marketing campaigns in terms of conversions, engagement, or recruitments.
  • Sentiment Analysis

    Analyzing customer or employee feedback, reviews, or engagement to understand how they feel about the brand, product, or service.
  • Cost-Benefit Analysis

    Comparing costs to the benefit of an action or decision.
  • Sales or Usage Forecasting

    Predicting future sales or usage based on historical data and market conditions.
  • Process Efficiency Analysis

    Examining workflows to identify bottlenecks or areas of improvement.
  • Survey Analysis

    Drawing insights from customer preferences in order to direct areas of improvement or focus.