Jump to content

De-anonymization: Difference between revisions

From Consumer Rights Wiki
How data is anonymized: No need for that!
How data is anonymized: +part suggested by Jerry468
Line 7: Line 7:


==How data is anonymized==
==How data is anonymized==
Anonymization, in practice, also involves around collecting user data that is said to be "aggregated/de-identified basis" which involves the usage of [[wikipedia:K-anonymity|k-anonymity]]. There are also forms of data collection that also used in different methods such as [[wikipedia:T-closeness|''t''-closeness]], [[wikipedia:L-diversity|''l''-diversity]], and [[wikipedia:Differential_privacy|differential privacy]], however there are other forms of data collection that is also used, which have yet to be disclosed to the customers.
Before de-anonymization happens, it needs to be anonymized. Anonymization, in practice, also involves around collecting user data that is said to be "aggregated/de-identified basis" which involves the usage of [[wikipedia:K-anonymity|k-anonymity]]. There are also forms of data collection that also used in different methods such as [[wikipedia:T-closeness|''t''-closeness]], [[wikipedia:L-diversity|''l''-diversity]], and [[wikipedia:Differential_privacy|differential privacy]], however there are other forms of data collection that is also used, which have yet to be disclosed to the customers.


==Why it is a problem==
==Why it is a problem==

Revision as of 15:43, 22 October 2025

Article Status Notice: This Article is a stub


This article is underdeveloped, and needs additional work to meet the wiki's Content Guidelines and be in line with our Mission Statement for comprehensive coverage of consumer protection issues. Learn more ▼

De-anonymization is the process or final state of revealing the true identity of an anonymous or pseudonymous person. All data linked to the anonymous or pseudonymous entity can then be connected to the true identity.

How it works

The core of de-anonymization involves making inferences to connect different types of obfuscated data, sometimes even across platforms.

How data is anonymized

Before de-anonymization happens, it needs to be anonymized. Anonymization, in practice, also involves around collecting user data that is said to be "aggregated/de-identified basis" which involves the usage of k-anonymity. There are also forms of data collection that also used in different methods such as t-closeness, l-diversity, and differential privacy, however there are other forms of data collection that is also used, which have yet to be disclosed to the customers.

Why it is a problem

Many privacy policies describe the disclosure of anonymized data to third parties in an effort to "limit unwarranted data collection". However, de-anonymization circumvents these privacy measures, allowing these third parties to engage in practices such as data sales or targeted advertising as normal. This is however, an issue when it comes to privacy, as an adversary (e.g telemarketer) will be able to conduct an research on those records in order to attempt to reveal the data that is aggregated.[1]

Examples

[1]

  1. Narayanan & Shmatikov, Arvind & Vitaly (November 11, 2006). How To Break Anonymity of the Netflix Prize Dataset. United States, Taxes, Austin.: The University of Texas at Austin.