Differential privacy for metric spaces: information-theoretic models for privacy and utility with new applications to metric domains

Fernandes, Natasha

doi:10.25949/21365637.v1

01whole.pdf (9.66 MB)

Differential privacy for metric spaces: information-theoretic models for privacy and utility with new applications to metric domains

thesis

posted on 2022-10-19, 21:52 authored by Natasha FernandesNatasha Fernandes

The problem of data privacy – protecting sensitive or personal data from discovery – has been a long-standing research issue. In this regard, differential privacy, introduced in 2006, is considered to be the gold standard. Differential privacy was designed to protect the privacy of individuals in statistical datasets such as census datasets. Its widespread popularity has led to interest in applying differential privacy to new domains for which it was not originally designed, such as text documents. This raises questions regarding the interpretability of differential privacy’s guarantees, which are usually expressed in the language of statistical disclosure control. In addition, it escalates the need for answers to core issues currently debated in the differential privacy community: how does the application of differential privacy protect against inference attacks? How can the use of noise-adding mechanisms guarantee the release of useful information? And how can this privacy-utility balance be achieved? The goal of this thesis is to address these foundational questions. Firstly, we approach the problem of interpretability by exploring a generalisation of differential privacy for metric domains known as metric differential privacy or d-privacy. Metric differential privacy abstracts away from the particulars of statistical databases and permits reasoning about privacy on more general domains endowed with a metric. This allows differential privacy’s guarantees to be understood in more general terms which can be applied to arbitrary domains of interest, including text documents. Secondly, we propose to study the key questions surrounding privacy and utility in differential privacy using the Quantitative Information Flow (QIF) framework, an information-theoretic framework previously used to analyse threats to secure systems. In this thesis, we repurpose QIF to analyse the privacy and utility guarantees provided by differentially private systems modelled as probabilistic channels. Using information flow analysis we examine the privacy characteristics of d-private mechanisms, finding new ways to compare them with respect to the protection they afford against arbitrary adversarial threats; we examine the utility characteristics of d-private mechanisms, discovering a new characterisation for optimal mechanisms and a proof of the universal optimality of the Laplace mechanism; and we re-examine the well-known privacy-utility trade-off for d-private mechanisms, finding new models for describing the relationship between privacy and utility via correlations. The second part of this thesis is dedicated to the demonstration of the practical applicability of d-privacy to novel and complex domains. We present three new sample applications of d-privacy: to text document privacy, statistical utility and private nearest neighbour search. In each of these applications, we show how the use of d-privacy, and an understanding of the metrics on the domain, permit reasoning about privacy and utility. This opens up new methods of exploring privacy in these domains, as well as providing guidelines for further applications of differential privacy to new domains.

History

1 Overview -- I Foundations -- 2 Quantitative Information Flow -- 3 Metric Differential Privacy -- II Analysis -- 4 Comparing Privacy Mechanisms -- 5 Inference Attacks -- 6 Optimality I: Discrete Mechanisms -- 7 Optimality II: Continuous Mechanisms -- III Applications -- 8 Statistical Utility for Local Differential Privacy -- 9 Text Document Privacy -- 10 Locality Sensitive Hashing with Differential Privacy -- 11 Conclusions and Future Work -- References -- Appendices

Notes

A thesis submitted in fulfilment of the requirements for the degree of Doctor of Philosophy Cotutelle thesis in conjunction with the Institut Polytechnique de Paris (Ecole Polytechnique)

Awarding Institution

Macquarie University

Degree Type

Thesis PhD

Degree

Thesis (PhD), Department of Computing, Faculty of Science and Engineering, Macquarie University

Department, Centre or School

Department of Computing

Year of Award

2021

Principal Supervisor

Annabelle McIver

Additional Supervisor 1

Catuscia Palamidessi

Rights

Copyright: The Author Copyright disclaimer: https://www.mq.edu.au/copyright-disclaimer

Language

English

Extent

312 pages

Usage metrics

Keywords

Differential Privacy quantitative information flow privacy-utility trade-off

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Differential privacy for metric spaces: information-theoretic models for privacy and utility with new applications to metric domains

History

Table of Contents

Notes

Awarding Institution

Degree Type

Degree

Department, Centre or School

Year of Award

Principal Supervisor

Additional Supervisor 1

Rights

Language

Extent

Usage metrics

Categories

Keywords

Licence

Exports