Research Article
Open Access Peer-reviewed

Anonymization of Georeferenced Public Health Microdata

Simon Cremer1, Lydia Jehmlich1, Rainer Lenz1,

1Cologne university of technology, arts and sciences, Cologne, Germany

American Journal of Public Health Research. 2025, 13(6), 257-262. DOI: 10.12691/ajphr-13-6-1
Received October 07, 2025; Revised November 09, 2025; Accepted November 17, 2025

Abstract

Whether for the use of targeted advertising measures or tracing the spatial spread of viruses such as the recent corona virus: georeferenced microdata can - depending on the attributes it is provided with - hold enormous added value for society, science and research. However, the desired information can often not be extracted despite the inherent analytical content. The reason for this is that access to personal georeferenced datasets is severely restricted, as these are subject to statutory data protection. One way out of this dilemma is to apply a suitable anonymization method that guarantees data protection without significantly reducing the analytical validity of data. Based on the EU INSPIRE directive, the statistical offices of the EU are successively implementing the georeferencing of their surveys. This paper discusses selected anonymization methods being most promising for anonymizing georeferenced health data for research purposes, as they offer scope for combinations or more specific adaptations in order to balance out the trade-off between privacy and analytical validity of georeferenced health microdata.

Keywords:

data anonymization, disclosure risk, health microdata, location privacy, spatial analysis
[1]  Ronning, G., Sturm, R., Höhne, J., Lenz, R., Rosemann, M., Scheffler, M. and Vorgrimler, D., Handbuch zur Anonymisierung wirtschaftsstatistischer Mikrodaten, Statistisches Bundesamt, Wiesbaden, Series ‘Statistik und Wissenschaft’, vol. 4, 2005.
 
[2]  JdS 2025, 56 ème Journées de Statistique de la Societé Francaise de Statistique, Université Aix-Marseille, june 2025, see: https://jds2025.sciencesconf.org/?lang=fr.
 
[3]  Lenz, R., Measuring the disclosure protection of micro aggregated business microdata - an analysis taking as an example the German Structure of Costs Survey, Journal of Official Statistics 22 (4), 681-710, 2006.
 
[4]  Eurostat, Statistical confidentiality and personal data protection, 2023. Available at: https:// ec.europa.eu/ eurostat/ de/web/ microdata/statistical-confidentiality-and-personal-data-protection.
 
[5]  Broen, K., Rob, T. and Jon, Z., Measuring the impact of spatial perturbations on the relationship between data privacy and validity of descriptive Statistics, 2021, International Journal of Health Geographics, 20 (3).View Article  PubMed
 
[6]  Dwork, C., Differential privacy, International colloquium on automata, languages, and programming, 1-12, 2006.View Article
 
[7]  Gao, S., Rao, J., Liu, X., Kang, Y., Huang, Q., App, J., Exploring the effectiveness of geomasking techniques for protecting the geoprivacy of Twitter users, Journal of spatial inform. science, 19, 105–129, 2019.View Article
 
[8]  Armstrong, M.P., Rushton, G. and Zimmermann, D.L., Geographically Masking Health Data to preserve Confidentiality, Statistics in Medicine, 18, 497-525, 1999.View Article
 
[9]  Kwan, M., Casas, I. and Schmitz, B. (2004), Protection of Geoprivacy and Accuracy of Spa-tial Information: How Effective Are Geographical Masks? Cartographica, 39(2), 15-28, 2004.View Article
 
[10]  Houfaf-Khoufaf, W. and Touya, G., Geographically Masking Addresses to Study COVID-19 Clusters, Univ. Gustave Eiffel, 2021.View Article  PubMed
 
[11]  Delmelle, E. M., Desjardins, M. R.; Jung, P., Owusu, C., Hohl, A., Dony, C., Uncertainty in geospatial health: challenges and oppurtunities ahead, Annals of Epidemology, 65, 15-30, 2022.View Article  PubMed
 
[12]  Hampton, K.H., Fitch, M.K., Allshouse, W.B., Doherty, I.A., Gesink, D.C., Leone, P.A. and Miller, W.C., Mapping Health Data: Improved Privacy Protection with Donut Method Geomasking, American Journal of Epidemiology, 172 (9), 1062–1069, 2010.View Article  PubMed
 
[13]  Ocaña-Riola, R., Common errors in disease mapping, Geospatial Health, 4 (2), 139-154, 2010.View Article  PubMed
 
[14]  Seidl, D.E., Paulus, G., Jankowski, P., and Regenfelder, M., Spatial obfuscation methods for privacy protection of household-level data, Applied Geography, 63, 253-263, 2015.View Article
 
[15]  Kounadi, O. and Leitner, M., Adaptive areal elimination: A transparent way of disclo-sing protected spatial datasets, Computers, Environment and Urban Systems, 57, 59–67.View Article
 
[16]  Koller, D., Wohlrab, D., Sedlmeir, G. and Augustin, J., Geografische Ansätze in der Gesundheitsberichterstattung, Bundesgesundheitsblatt, 63, 1108–1117, 2020.View Article  PubMed
 
[17]  Erfuhrt, K., Groß, M., Rendtel, U., Schmid, T., Kernel density smoothing of composite spatial data on administrative area level - A case study of voting data in Berlin, AStA Wirtsch Sozialstat Arch, 16, 25–49, 2022.View Article
 
[18]  MacEachren, AM., How Maps Work: Representation, Visualization, and Design, The Guilford Press, 1995.
 
[19]  Zandbergen, P. A., Ensuring Confidentiality of Geocoded Health Data: Assessing Geographic Masking Strategies for Individual-Level Data, Advances in Medicine, 2014.View Article  PubMed
 
[20]  Swanlund, D., Schuurman, N., Zandbergen, P., Brussoni, M., Street masking: a network-based geographic mask for easily protecting geoprivacy, International Journal of Health Geographics, 19 (26), 1-11, 2020.View Article  PubMed
 
[21]  Wang, J., Kim, J., Kwan, M.-P., An exploratory assessment of the effectiveness of geomasking methods on privacy protection and analytical accuracy for individual-level geospatial data, Cartography and Geographic Information Society, 49 (5), 385-406, 2022.View Article
 
[22]  Kounadi, O., Towards geoprivacy guidelines for spatial data, ETH Zürich Research Collection, ETH Library, 2015.
 
[23]  Leitner, M., Curtis, A., A first step towards a framework for presenting the location of confidential point data on maps—results of an empirical perceptual study, International Journal of Geographical Information Science, 20 (7), 813-822. 2006.View Article
 
[24]  Polzin, F., Kounadi, O., Adaptive Voronoi Masking: A Method to Protect Confidential Discrete Spatial Data, 11th International Conference on Geographic Information Science, 2021.
 
[25]  Cremer, S., Jehmlich, L. and Lenz, R., Masking georeferenced health data - an analysis taking the example of partially synthetic data on sleep disorder, Privacy in Statistical Databases, Domingo-Ferrer and M. Önen (Eds.), LNCS 14915, 297–309, 2024.View Article
 
[26]  Dalenius, T., Reiss, S. P., Data Swapping: A Technique for Disclosure Control, Journal of Statistical Planning and Inference, 6, 73-85, 1982.View Article
 
[27]  Gutmann, M. P., Witkowski, K., Colyer, C., McFarland O’Rourke, J., McNally, J., Providing Spatial Data for Secondary Analysis: Issues and Current Practices Relating to Confidentiality, Population Research and Policy Review 27 (6), 639-665.View Article  PubMed
 
[28]  Zhang, S., Freundschuh, S.M., Lenzer, K., Zandbergen, P.A., The Location Swapping Method for Geomasking, Cartography and Geographic Information Science,44 (1), 22-34, 2017.View Article
 
[29]  Richter, W., The verified neighbor approach to geoprivacy: An improved method for geographic masking, Journal of Exposure Science and Environmental Epidemiology 28, 109–118, 2018.View Article  PubMed
 
[30]  Swanlund, D., Schuurman, N., Zandbergen, P., Brussoni, M., Street masking: a network-based geographic mask for easily protecting geoprivacy, International Journal of Health Geographics, 19 (26), 1-11, 2020.View Article  PubMed
 
[31]  Li, N., Li, T. and Venkatasubramanian, S. (Eds.), t-closeness: Privacy beyond k-anonymity and l-diversity, IEEE 23rd international conference on data engineering.
 
[32]  Lenz, R., On the way to remote access to German official microdata, Statistique et nouvelles technologies de l’information, Revue des Nouvelles Technologies de l’information, 125-138, 2011 (paper presented at the 41ème Journées de Statistique de la Societé Francaise de Statistique, Bordeaux 2009).
 
[33]  Research data centres of the Federal Statistical Office and the Statistical Offices of the Federal States, https://www.forschungs-datenzentrum.de/de, last accessed on 3 september 2025.
 
[34]  Wenzel, M., Corr, D., Riedel, N., Hapfelmeier, J., Zimmermann, L., State of efficient research on secondary health data in the German health data lab, Georg Thieme Verlag, Stuttgart, 2025.