Abstract
A common divide-and-conquer approach for Bayesian computation with big data is to partition the data, perform local inference for each piece separately, and combine the results to obtain a global posterior approximation. While being conceptually and computationally appealing, this method involves the problematic need to also split the prior for the local inferences; these weakened priors may not provide enough regularization for each separate computation, thus eliminating one of the key advantages of Bayesian methods. To resolve this dilemma while still retaining the generalizability of the underlying local inference method, we apply the idea of expectation propagation (EP) as a framework for distributed Bayesian inference. The central idea is to iteratively update approximations to the local likelihoods given the state of the other approximations and the prior. The present paper has two roles: we review the steps that are needed to keep EP algorithms numerically stable, and we suggest a general approach, inspired by EP, for approaching data partitioning problems in a way that achieves the computational benefits of parallelism while allowing each local update to make use of relevant information from the other sites. In addition, we demonstrate how the method can be applied in a hierarchical context to make use of partitioning of both data and parameters. The paper describes a general algorithmic framework, rather than a specific algorithm, and presents an example implementation for it.
Original language | English |
---|---|
Journal | Journal of Machine Learning Research |
Volume | 21 |
Publication status | Published - Jan 1 2020 |
Bibliographical note
Publisher Copyright:© 2020 Aki Vehtari, Andrew Gelman, Tuomas Sivula, Pasi Jylänki, Dustin Tran, Swupnil Sahai, Paul Blomstedt, John P. Cunningham, David Schiminovich, and Christian P. Robert.
Funding
We thank David Blei, Ole Winther, Bob Carpenter, and anonymous reviewers for helpful comments, and the U.S. National Science Foundation, Institute for Education Sciences, Office of Naval Research, Moore and Sloan Foundations, and Academy of Finland (grant 298742 and the Finnish Centre of Excellence in Computational Inference Research COIN) for partial support of this research. We also acknowledge the computational resources provided by the Aalto Science-IT project.
Funders | Funder number |
---|---|
Institute for Education Sciences | |
National Science Foundation | |
Office of Naval Research | |
Institute of Education Sciences | |
Suomen Akatemia | 298742 |
ASJC Scopus Subject Areas
- Software
- Artificial Intelligence
- Control and Systems Engineering
- Statistics and Probability