A Simple Method for Limiting Disclosure in Continuous Microdata Based on Principal Component Analysis

Aida Calviño

Open Access

A Simple Method for Limiting Disclosure in Continuous Microdata Based on Principal Component Analysis

Aida Calviño

| Feb 21, 2017

Journal of Official Statistics

Volume 33 (2017): Issue 1 (March 2017)

About this article

Cite

Page range: 15 - 41

Received: Sep 01, 2015

Accepted: Aug 01, 2016

DOI: https://doi.org/10.1515/jos-2017-0002

Keywords
Statistical disclosure control, microdata protection, hybrid microdata, masking method, propensity score

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

In this article we propose a simple and versatile method for limiting disclosure in continuous microdata based on Principal Component Analysis (PCA). Instead of perturbing the original variables, we propose to alter the principal components, as they contain the same information but are uncorrelated, which permits working on each component separately, reducing processing times. The number and weight of the perturbed components determine the level of protection and distortion of the masked data. The method provides preservation of the mean vector and the variance-covariance matrix. Furthermore, depending on the technique chosen to perturb the principal components, the proposed method can provide masked, hybrid or fully synthetic data sets. Some examples of application and comparison with other methods previously proposed in the literature (in terms of disclosure risk and data utility) are also included.

eISSN:: 2001-7367
Language:: English

Publication timeframe:: 4 times per year
Journal Subjects:: Mathematics, Probability and Statistics

Journal RSS Feed

A Simple Method for Limiting Disclosure in Continuous Microdata Based on Principal Component Analysis

Published Online: Feb 21, 2017

Page range: 15 - 41

Received: Sep 01, 2015

Accepted: Aug 01, 2016

DOI: https://doi.org/10.1515/jos-2017-0002

Keywords
Statistical disclosure control, microdata protection, hybrid microdata, masking method, propensity score

© by Aida Calviño

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

A Simple Method for Limiting Disclosure in Continuous Microdata Based on Principal Component Analysis

Published Online: Feb 21, 2017

Page range: 15 - 41

Received: Sep 01, 2015

Accepted: Aug 01, 2016

DOI: https://doi.org/10.1515/jos-2017-0002

KeywordsStatistical disclosure control, microdata protection, hybrid microdata, masking method, propensity score

© by Aida Calviño

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Keywords
Statistical disclosure control, microdata protection, hybrid microdata, masking method, propensity score