Accuracy of Mixed-Source Statistics as Affected by Classification Errors

Publications in official statistics are increasingly based on a combination of sources. Although combining data sources may result in nearly complete coverage of the target population, the outcomes are not error free. Estimating the effect of nonsampling errors on the accuracy of mixed-source statistics is crucial for decision making, but it is not straightforward. Here we simulate the effect of classification errors on the accuracy of turnover-level estimates in car-trade industries. We combine an audit sample, the dynamics in the business register, and expert knowledge to estimate a transition matrix of classification-error probabilities. Bias and variance of the turnover estimates caused by classification errors are estimated by a bootstrap resampling approach. In addition, we study the extent to which manual selective editing at micro level can improve the accuracy. Our analyses reveal which industries do not meet preset quality criteria. Surprisingly, more selective editing can result in less accurate estimates for specific industries, and a fixed allocation of editing effort over industries is more effective than an allocation in proportion with the accuracy and population size of each industry. We discuss how to develop a practical method that can be implemented in production to estimate the accuracy of register-based estimates.

eISSN:: 2001-7367
Język:: Angielski

Częstotliwość wydawania:: 4 razy w roku
Dziedziny czasopisma:: Mathematics, Probability and Statistics

Kanał RSS czasopisma

Accuracy of Mixed-Source Statistics as Affected by Classification Errors

Data publikacji: 23 wrz 2016

Zakres stron: 619 - 642

Otrzymano: 01 paź 2015

Przyjęty: 01 maj 2016

DOI: https://doi.org/10.1515/jos-2016-0032

Słowa kluczoweAccuracy, editing, administrative data, short-term business statistics, bootstrap resampling

© 2016 Arnout van Delden et al., published by De Gruyter Open

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Słowa kluczowe
Accuracy, editing, administrative data, short-term business statistics, bootstrap resampling