Foundations and Trends® in Theoretical Computer Science > Vol 9 > Issue 3–4

The Algorithmic Foundations of Differential Privacy

By Cynthia Dwork, Microsoft Research, USA, dwork@microsoft.com | Aaron Roth, University of Pennsylvania, USA, aaroth@gmail.com

 
Suggested Citation
Cynthia Dwork and Aaron Roth (2014), "The Algorithmic Foundations of Differential Privacy", Foundations and Trends® in Theoretical Computer Science: Vol. 9: No. 3–4, pp 211-407. http://dx.doi.org/10.1561/0400000042

Publication Date: 11 Aug 2014
© 2014 C. Dwork and A. Roth
 
Subjects
Algorithmic game theory,  Cryptography and information security,  Database theory,  Design and analysis of algorithms
 

Free Preview:

Download extract

Share

Download article
In this article:
Preface 
1. The Promise of Differential Privacy 
2. Basic Terms 
3. Basic Techniques and Composition Theorems 
4. Releasing Linear Queries with Correlated Error 
5. Generalizations 
6. Boosting for Queries 
7. When Worst-Case Sensitivity is Atypical 
8. Lower Bounds and Separation Results 
9. Differential Privacy and Computational Complexity 
10. Differential Privacy and Mechanism Design 
11. Differential Privacy and Machine Learning 
12. Additional Models 
13. Reflections 
Appendices 
Acknowledgments 
References 

Abstract

The problem of privacy-preserving data analysis has a long history spanning multiple disciplines. As electronic data about individuals becomes increasingly detailed, and as technology enables ever more powerful collection and curation of these data, the need increases for a robust, meaningful, and mathematically rigorous definition of privacy, together with a computationally rich class of algorithms that satisfy this definition. Differential Privacy is such a definition.

After motivating and discussing the meaning of differential privacy, the preponderance of this monograph is devoted to fundamental techniques for achieving differential privacy, and application of these techniques in creative combinations, using the query-release problem as an ongoing example. A key point is that, by rethinking the computational goal, one can often obtain far better results than would be achieved by methodically replacing each step of a non-private computation with a differentially private implementation. Despite some astonishingly powerful computational results, there are still fundamental limitations — not just on what can be achieved with differential privacy but on what can be achieved with any method that protects against a complete breakdown in privacy. Virtually all the algorithms discussed herein maintain differential privacy against adversaries of arbitrary computational power. Certain algorithms are computationally intensive, others are efficient. Computational complexity for the adversary and the algorithm are both discussed.

We then turn from fundamentals to applications other than queryrelease, discussing differentially private methods for mechanism design and machine learning. The vast majority of the literature on differentially private algorithms considers a single, static, database that is subject to many analyses. Differential privacy in other models, including distributed databases and computations on data streams is discussed.

Finally, we note that this work is meant as a thorough introduction to the problems and techniques of differential privacy, but is not intended to be an exhaustive survey — there is by now a vast amount of work in differential privacy, and we can cover only a small portion of it.

DOI:10.1561/0400000042
ISBN: 978-1-60198-818-8
286 pp. $99.00
Buy book (pb)
 
ISBN: 978-1-60198-819-5
286 pp. $240.00
Buy E-book (.pdf)
Table of contents:
Preface
1. The Promise of Differential Privacy
2. Basic Terms
3. Basic Techniques and Composition Theorems
4. Releasing Linear Queries with Correlated Error
5. Generalizations
6. Boosting for Queries
7. When Worst-Case Sensitivity is Atypical
8. Lower Bounds and Separation Results
9. Differential Privacy and Computational Complexity
10. Differential Privacy and Mechanism Design
11. Differential Privacy and Machine Learning
12. Additional Models
13. Reflections
Appendices
Acknowledgments
References

The Algorithmic Foundations of Differential Privacy

The problem of privacy-preserving data analysis has a long history spanning multiple disciplines. As electronic data about individuals becomes increasingly detailed, and as technology enables ever more powerful collection and curation of these data, the need increases for a robust, meaningful, and mathematically rigorous definition of privacy, together with a computationally rich class of algorithms that satisfy this definition. Differential Privacy is such a definition.

The Algorithmic Foundations of Differential Privacy starts out by motivating and discussing the meaning of differential privacy, and proceeds to explore the fundamental techniques for achieving differential privacy, and the application of these techniques in creative combinations, using the query-release problem as an ongoing example. A key point is that, by rethinking the computational goal, one can often obtain far better results than would be achieved by methodically replacing each step of a non-private computation with a differentially private implementation. Despite some powerful computational results, there are still fundamental limitations.

Virtually all the algorithms discussed herein maintain differential privacy against adversaries of arbitrary computational power — certain algorithms are computationally intensive, others are efficient. Computational complexity for the adversary and the algorithm are both discussed. The monograph then turns from fundamentals to applications other than query-release, discussing differentially private methods for mechanism design and machine learning. The vast majority of the literature on differentially private algorithms considers a single, static, database that is subject to many analyses. Differential privacy in other models, including distributed databases and computations on data streams, is discussed.

The Algorithmic Foundations of Differential Privacy is meant as a thorough introduction to the problems and techniques of differential privacy, and is an invaluable reference for anyone with an interest in the topic.

 
TCS-042