Generalization Bounds: Perspectives from Information Theory and PAC-Bayes

Fredrik Hellström; Giuseppe Durisi; Benjamin Guedj; Maxim Raginsky

doi:10.1561/2200000112

Foundations and Trends® in Machine Learning > Vol 18 > Issue 1

Generalization Bounds: Perspectives from Information Theory and PAC-Bayes

By Fredrik Hellström, University College London, UK, f.hellstrom@ucl.ac.uk | Giuseppe Durisi, Chalmers University of Technology, Sweden, durisi@chalmers.se | Benjamin Guedj, Inria, France and University College London, UK, benjamin.guedj@inria.fr | Maxim Raginsky, University of Illinois, USA, maxim@illinois.edu

Suggested Citation

Fredrik Hellström, Giuseppe Durisi, Benjamin Guedj and Maxim Raginsky (2025), "Generalization Bounds: Perspectives from Information Theory and PAC-Bayes", Foundations and Trends® in Machine Learning: Vol. 18: No. 1, pp 1-223. http://dx.doi.org/10.1561/2200000112

Publication Date: 23 Jan 2025

Subjects

Information theory and computer science, Information theory and statistics, Pattern recognition and learning, Learning and statistical methods, Statistical/Machine learning, Statistical learning theory, Deep learning, Classification and prediction, Reinforcement learning, Design and analysis of algorithms

Book details

ISBN: 978-1-63828-420-8

242 pp. $99.00

Buy book (pb)

ISBN: 978-1-63828-421-5

242 pp. $160.00

Buy E-book (.pdf)

Table of contents:

1. Introduction: On Generalization and Learning

2. Information-Theoretic Approach to Generalization

3. Tools

4. Generalization Bounds in Expectation

5. Generalization Bounds in Probability

6. The CMI Framework

7. The Information Complexity of Learning Algorithms

8. Neural Networks and Iterative Algorithms

9. Alternative Learning Models

10. Concluding Remarks

Acknowledgements

References

Generalization Bounds: Perspectives from Information Theory and PAC-Bayes

Artificial intelligence and machine learning have emerged as driving forces behind transformative advancements in various fields, and have become increasingly pervasive in many industries and daily life. As these technologies continue to gain momentum, so does the need to develop a deeper understanding of their underlying principles, capabilities, and limitations. In this monograph, the authors focus on the theory of machine learning and statistical learning theory, with a particular focus on the generalization capabilities of learning algorithms.

Part I covers the foundations of information-theoretic and PAC-Bayesian generalization bounds for standard supervised learning. Part II explores the applications of generalization bounds, as well as extensions to settings beyond standard supervised learning. Several important areas of application include neural networks, federated learning and reinforcement learning. The monograph concludes with a broader discussion of information-theoretic and PAC-Bayesian generalization bounds as a whole.

This monograph will be of interest to students and researchers working in generalization and theoretical machine learning. It provides a comprehensive introduction to information-theoretic generalization bounds and their connection to PAC-Bayes, serving as a foundation from which the most recent developments are accessible.

Generalization Bounds: Perspectives from Information Theory and PAC-Bayes

Free Preview:

Share

Journal details

Abstract

Book details

Generalization Bounds: Perspectives from Information Theory and PAC-Bayes