A Contamination Model for Selective Editing

Marco Di Zio; Ugo Guarnera

Open Access

A Contamination Model for Selective Editing

Marco Di Zio

and

Ugo Guarnera

| Nov 09, 2013

Journal of Official Statistics

Volume 29 (2013): Issue 4 (December 2013)

About this article

Cite

Page range: 539 - 555

DOI: https://doi.org/10.2478/jos-2013-0039

Keywords
Statistical data editing, influential errors, finite mixture models, score function

This content is open access.

The aim of selective editing is to identify observations affected by influential errors. A score function based on the impact of the potential error on target estimates is useful to prioritize observations for accurate reviewing. We assume a Gaussian model for true data and an “intermittent” error mechanism such that a proportion of data is contaminated by an additive Gaussian error. In this setting, scores can be related to the expected value of errors affecting data. Consequently, a set of units can be selected such that the expected residual error in data is below a prefixed threshold. In the context of economic surveys when positive variables are analyzed, the method is more realistically applied to logarithms of data instead of data in their original scale. The method is illustrated through an experimental study on real business survey data where contamination is simulated according to error mechanisms frequently encountered in the practical context of economic surveys.

eISSN:: 2001-7367
Language:: English

Publication timeframe:: 4 times per year
Journal Subjects:: Mathematics, Probability and Statistics

Journal RSS Feed

A Contamination Model for Selective Editing

Published Online: Nov 09, 2013

Page range: 539 - 555

DOI: https://doi.org/10.2478/jos-2013-0039

KeywordsStatistical data editing, influential errors, finite mixture models, score function

This content is open access.

Keywords
Statistical data editing, influential errors, finite mixture models, score function