Adding measurement error to location data to protect subject confidentiality while allowing for consistent estimation of exposure effects
Mahesh Karra,
David Canning and
Ryoko Sato
Journal of the Royal Statistical Society Series C, 2020, vol. 69, issue 5, 1251-1268
Abstract:
In public use data sets, it is desirable not to report a respondent's location precisely to protect subject confidentiality. However, the direct use of perturbed location data to construct explanatory exposure variables for regression models will generally make naive estimates of all parameters biased and inconsistent. We propose an approach where a perturbation vector, consisting of a random distance at a random angle, is added to a respondent's reported geographic co‐ordinates. We show that, as long as the distribution of the perturbation is public and there is an underlying prior population density map, external researchers can construct unbiased and consistent estimates of location‐dependent exposure effects by using numerical integration techniques over all possible actual locations, although coefficient confidence intervals are wider than if the true location data were known. We examine our method by using a Monte Carlo simulation exercise and apply it to a real world example using data on perceived and actual distance to a health facility in Tanzania.
Date: 2020
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1111/rssc.12439
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jorssc:v:69:y:2020:i:5:p:1251-1268
Ordering information: This journal article can be ordered from
http://ordering.onli ... 1111/(ISSN)1467-9876
Access Statistics for this article
Journal of the Royal Statistical Society Series C is currently edited by R. Chandler and P. W. F. Smith
More articles in Journal of the Royal Statistical Society Series C from Royal Statistical Society Contact information at EDIRC.
Bibliographic data for series maintained by Wiley Content Delivery ().