By Elaine Angelino, University of California, Berkeley, USA, elaine@eecs.berkeley.edu | Matthew James Johnson, Harvard University, USA, mattjj@csail.mit.edu | Ryan P. Adams, Harvard University and Twitter, USA, rpa@seas.harvard.edu
Datasets are growing not just in size but in complexity, creating a demand for rich models and quantification of uncertainty. Bayesian methods are an excellent fit for this demand, but scaling Bayesian inference is a challenge. In response to this challenge, there has been considerable recent work based on varying assumptions about model structure, underlying computational resources, and the importance of asymptotic correctness. As a result, there is a zoo of ideas with a wide range of assumptions and applicability. In this paper, we seek to identify unifying principles, patterns, and intuitions for scaling Bayesian inference. We review existing work on utilizing modern computing resources with both MCMC and variational approximation techniques. From this taxonomy of ideas, we characterize the general principles that have proven successful for designing scalable inference procedures and comment on the path forward.
Datasets are growing not just in size but in complexity, creating a demand for rich models and quantification of uncertainty. Bayesian methods are an excellent fit for this demand, but scaling Bayesian inference is a challenge. In response to this challenge, there has been considerable recent work based on varying assumptions about model structure, underlying computational resources, and the importance of asymptotic correctness. As a result, there is a zoo of ideas with a wide range of assumptions and applicability.
Patterns of Scalable Bayesian Inference seeks to identify unifying principles, patterns, and intuitions for scaling Bayesian inference. It examines how these techniques can be scaled up to larger problems and scaled out across parallel computational resources. It reviews existing work on utilizing modern computing resources with both MCMC and variational approximation techniques. From this taxonomy of ideas, it characterizes the general principles that have proven successful for designing scalable inference procedures and addresses some of the significant open questions and challenges.