k-anonymous microaggregation with preservation of statistical dependence

�������-Anonymous microaggregation emerges as an essential building block in statistical disclosure control, a field concerning the postprocessing of the demographic portion of surveys containing sensitive information, in order to safeguard the anonymity of the respondents. Traditionally, this form...

Descripción completa

Detalles Bibliográficos
Autores: Rebollo Monedero, David|||0000-0002-0783-2382, Forné Muñoz, Jorge|||0000-0002-8401-3292, Soriano Ibáñez, Miguel|||0000-0003-0457-8531
Tipo de recurso: artículo
Fecha de publicación:2016
País:España
Institución:Universitat Politècnica de Catalunya (UPC)
Repositorio:UPCommons. Portal del coneixement obert de la UPC
Idioma:inglés
OAI Identifier:oai:upcommons.upc.edu:2117/92693
Acceso en línea:https://hdl.handle.net/2117/92693
https://dx.doi.org/10.1016/j.ins.2016.01.012
Access Level:acceso abierto
Palabra clave:Computer security
¿-Anonymity
microaggregation
statistical disclosure control
statistical dependence
predictability
Seguretat informàtica
Intercanvi electrònic de dades
Àrees temàtiques de la UPC::Informàtica
Àrees temàtiques de la UPC::Matemàtiques i estadística
Descripción
Sumario:�������-Anonymous microaggregation emerges as an essential building block in statistical disclosure control, a field concerning the postprocessing of the demographic portion of surveys containing sensitive information, in order to safeguard the anonymity of the respondents. Traditionally, this form of microaggregation has been formulated to characterize both the privacy attained and the inherent information loss due to the aggregation of quasi-identifiers, which may otherwise be exploited to reidentify the individuals to which a record in a published database refer. Because the ulterior purposes of such databases involves the analysis of the statistical dependence between demographic attributes and sensitive data, we must articulate mechanisms to enable the preservation of the statistical dependence between quasi-identifiers and confidential attributes, beyond the mere degradation of the quasi-identifiers alone. This work addresses the problem of ��������������-anonymous microaggregation with preservation of statistical dependence in a formal, systematic manner, modeling statistical dependence as predictability of the confidential attributes from the perturbed quasi-identifiers. We proceed by introducing a second mean squared error term in a combined Lagrangian cost that enables us to regulate the trade-off between quasi-identifier distortion and the confidential-attribute predictability. A Lagrangian multiplier enables us to gracefully weigh the importance of each of the two competing objectives.