Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3519935.3519999acmconferencesArticle/Chapter ViewAbstractPublication PagesstocConference Proceedingsconference-collections
research-article

Simple parallel algorithms for single-site dynamics

Published: 10 June 2022 Publication History

Abstract

The single-site dynamics are a canonical class of Markov chains for sampling from high-dimensional probability distributions, e.g. the ones represented by graphical models.
We give a simple and generic parallel algorithm that can faithfully simulate single-site dynamics. When the chain asymptotically satisfies the ℓp-Dobrushin’s condition, specifically, when the Dobrushin’s influence matrix has constantly bounded ℓp-induced operator norm for an arbitrary p∈[1, ∞], the parallel simulation of N steps of single-site updates succeeds within O(N/n+logn) depth of parallel computing using Õ(m) processors, where n is the number of sites and m is the size of graphical model. Since the Dobrushin’s condition is almost always satisfied asymptotically by mixing chains, this parallel simulation algorithm essentially transforms single-site dynamics with optimal O(nlogn) mixing time to algorithms for sampling. In particular we obtain samplers, for the Ising models on general graphs in the uniqueness regime, and for satisfying solutions of CNF formulas in a local lemma regime. With non-adaptive simulated annealing, these samplers can be transformed routinely to algorithms for approximate counting.
A key step in our parallel simulation algorithm, is a so-called “universal coupling” procedure, which tries to simultaneously couple all distributions over the same sample space. We construct such a universal coupling, that for every pair of distributions the coupled probability is at least their Jaccard similarity. We also prove that this is optimal in the worst case. The universal coupling and its applications are of independent interests.

References

[1]
Amr Ahmed, Moahmed Aly, Joseph Gonzalez, Shravan Narayanamurthy, and Alexander J Smola. 2012. Scalable inference in latent variable models. In Proceedings of the 5th ACM international conference on Web search and data mining (WSDM). 123–132.
[2]
Nima Anari, Nathan Hu, Amin Saberi, and Aaron Schild. 2021. Sampling Arborescences in Parallel. In Proceedings of the 12th Innovations in Theoretical Computer Science Conference (ITCS). 185, 18.
[3]
Nima Anari, Vishesh Jain, Frederic Koehler, Huy Tuan Pham, and Thuy-Duong Vuong. 2021. Entropic Independence II: Optimal Sampling and Concentration via Restricted Modified Log-Sobolev Inequalities. ArXiv preprint, arXiv:2111.03247.
[4]
Amartya Shankha Biswas, Ronitt Rubinfeld, and Anak Yodpinyanee. 2020. Local Access to Huge Random Objects Through Partial Sampling. In Proceedings of the 11th Innovations in Theoretical Computer Science Conference (ITCS). 151, 27.
[5]
Antonio Blanca, Pietro Caputo, Zongchen Chen, Daniel Parisi, Daniel Štefankovič, and Eric Vigoda. 2022. On Mixing of Markov Chains: Coupling, Spectral Independence, and Entropy Factorization. In Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA). 3670–3692.
[6]
Russ Bubley and Martin Dyer. 1997. Path Coupling: A Technique for Proving Rapid Mixing in Markov Chains. In Proceedings of the 38th Annual IEEE Symposium on Foundations of Computer Science (FOCS). 223–231.
[7]
Xiaoyu Chen, Weiming Feng, Yitong Yin, and Xinyuan Zhang. 2021. Optimal Mixing Time for the Ising Model in the Uniqueness Regime. ArXiv preprint, arXiv:2111.03034.
[8]
Xiaoyu Chen, Weiming Feng, Yitong Yin, and Xinyuan Zhang. 2022. Optimal mixing for two-state anti-ferromagnetic spin systems. ArXiv preprint, arXiv:2203.07771.
[9]
Zongchen Chen, Kuikui Liu, and Eric Vigoda. 2021. Optimal Mixing of Glauber Dynamics: Entropy Factorization via High-Dimensional Expansion. In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing (STOC). 1537–1550.
[10]
Constantinos Daskalakis, Nishanth Dikkala, and Siddhartha Jayanti. 2018. HOGWILD!-Gibbs can be PanAccurate. In Proceedings of the 31st Advances in Neural Information Processing Systems (NIPS). 32–41.
[11]
Christopher De Sa, Kunle Olukotun, and Christopher Ré. 2016. Ensuring Rapid Mixing and Low Bias for Asynchronous Gibbs Sampling. In Proceedings of the 33rd International Conference on Machine Learning (ICML). 1567–1576.
[12]
Christopher De Sa, Ce Zhang, Kunle Olukotun, and Christopher Ré. 2015. Rapidly mixing Gibbs sampling for a class of factor graphs using hierarchy width. In Proceedings of the 28th Advances in Neural Information Processing Systems (NIPS). 3097–3105.
[13]
Roland L Dobrushin. 1970. Prescribing a system of random variables by conditional distributions. Theory of Probability & Its Applications, 15, 3 (1970), 458–486.
[14]
Roland L Dobrushin and Senya B Shlosman. 1985. Completely analytical Gibbs fields. In Statistical physics and dynamical systems. Springer, 371–403.
[15]
Roland Lvovich Dobrushin and Senya B Shlosman. 1985. Constructive criterion for the uniqueness of Gibbs field. In Statistical physics and dynamical systems. Springer, 347–370.
[16]
Martin Dyer, Alan Frieze, and Ravi Kannan. 1991. A random polynomial-time algorithm for approximating the volume of convex bodies. Journal of the ACM (JACM), 38, 1 (1991), 1–17.
[17]
Martin Dyer, Leslie Ann Goldberg, and Mark Jerrum. 2009. Matrix norms and rapid mixing for spin systems. The Annals of Applied Probability, 19, 1 (2009), 71–107.
[18]
Ronen Eldan, Frederic Koehler, and Ofer Zeitouni. 2021. A spectral condition for spectral gap: fast mixing in high-temperature Ising models. Probability Theory and Related Fields, 1–17.
[19]
Weiming Feng, Heng Guo, Yitong Yin, and Chihao Zhang. 2021. Fast sampling and counting k-SAT solutions in the local lemma regime. Journal of the ACM (JACM), 68, 6 (2021), 1–42.
[20]
Weiming Feng, Thomas P Hayes, and Yitong Yin. 2021. Distributed metropolis sampler with optimal parallelism. In Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms (SODA). 2121–2140.
[21]
Weiming Feng, Yuxin Sun, and Yitong Yin. 2017. What Can be Sampled Locally? In Proceedings of the 36th ACM Symposium on Principles of Distributed Computing (PODC). 121–130.
[22]
Weiming Feng, Nisheeth K Vishnoi, and Yitong Yin. 2021. Dynamic sampling from graphical models. SIAM J. Comput., 50, 2 (2021), 350–381.
[23]
Weiming Feng and Yitong Yin. 2018. On Local Distributed Sampling and Counting. In Proceedings of the 37th ACM Symposium on Principles of Distributed Computing (PODC). 189–198.
[24]
Manuela Fischer and Mohsen Ghaffari. 2018. A simple parallel and distributed sampling technique: Local glauber dynamics. In 32nd International Symposium on Distributed Computing (DISC). 121, 26–1.
[25]
A Gerschcnfeld and A Monianari. 2007. Reconstruction for Models on Random Graphs. In Proceedings of the 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS). 194–204.
[26]
Roy J. Glauber. 1963. Time-dependent statistics of the Ising model. Journal of mathematical physics, 4, 2 (1963), 294–307.
[27]
Joseph E Gonzalez, Yucheng Low, Arthur Gretton, and Carlos Guestrin. 2011. Parallel Gibbs Sampling: From Colored Fields to Thin Junction Trees. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics (AISTATS). 324–332.
[28]
Heng Guo, Mark Jerrum, and Jingcheng Liu. 2019. Uniform sampling through the Lovász local lemma. Journal of the ACM (JACM), 66, 3 (2019), 1–31.
[29]
Thomas P. Hayes. 2006. A simple condition implying rapid mixing of single-site dynamics on spin systems. In Proceedings of the 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS). 39–46.
[30]
Thomas P. Hayes and Alistair Sinclair. 2007. A General Lower Bound for Mixing of Single-Site Dynamics on Graphs. The Annals of Applied Probability, 931–952.
[31]
Mark Jerrum and Alistair Sinclair. 1993. Polynomial-time approximation algorithms for the Ising model. SIAM J. Comput., 22, 5 (1993), 1087–1116.
[32]
Michael I Jordan, Jason D Lee, and Yun Yang. 2019. Communication-efficient distributed statistical inference. J. Amer. Statist. Assoc., 114, 526 (2019), 668–681.
[33]
Richard M Karp and Vijaya Ramachandran. 1991. Parallel algorithms for shared-memory machines. In Handbook of theoretical computer science (vol. A) algorithms and complexity. 869–941.
[34]
David A. Levin, Yuval Peres, and Elizabeth L. Wilmer. 2017. Markov chains and mixing times. American Mathematical Society, Providence, RI.
[35]
Katalin Marton. 2019. Logarithmic Sobolev inequalities in discrete product spaces. Combinatorics, Probability and Computing, 28, 6 (2019), 919–935.
[36]
Nicholas Metropolis. 1987. The beginning of the Monte Carlo method. Los Alamos Science, 15, 584 (1987), 125–130.
[37]
Marc Mezard and Andrea Montanari. 2009. Information, physics, and computation. Oxford University Press.
[38]
M. Mitzenmacher and E. Upfal. 2017. Probability and Computing: Randomization and Probabilistic Techniques in Algorithms and Data Analysis. Cambridge University Press. isbn:9781107154889 lccn:2016041654
[39]
Ankur Moitra. 2019. Approximate counting, the Lovász local lemma, and inference in graphical models. Journal of the ACM (JACM), 66, 2 (2019), 1–25.
[40]
Elchanan Mossel and Allan Sly. 2013. Exact thresholds for Ising–Gibbs samplers on general graphs. The Annals of Probability, 41, 1 (2013), 294–328.
[41]
Ketan Mulmuley, Umesh V Vazirani, and Vijay V Vazirani. 1987. Matching is as easy as matrix inversion. In Proceedings of the 19th annual ACM Symposium on Theory of Computing (STOC). 345–354.
[42]
Feng Niu, Benjamin Recht, Christopher Re, and Stephen J Wright. 2011. HOGWILD! a lock-free approach to parallelizing stochastic gradient descent. In Proceedings of the 24th Advances in Neural Information Processing Systems (NIPS). 693–701.
[43]
Judea Pearl. 1982. Reverend bayes on inference engines: a distributed hierarchical approach. In Proceedings of the Second AAAI Conference on Artificial Intelligence (AAAI). 133–136.
[44]
Allan Sly and Nike Sun. 2012. The Computational Hardness of Counting in Two-Spin Models on d-Regular Graphs. In Proceedings of the 53rd Annual IEEE Symposium on Foundations of Computer Science (FOCS). 361–369.
[45]
Daniel Štefankovič, Santosh Vempala, and Eric Vigoda. 2009. Adaptive simulated annealing: A near-optimal connection between sampling and counting. Journal of the ACM (JACM), 56, 3 (2009), 1–36.
[46]
D. Stoyan, W.S. Kendall, and J. Mecke. 1995. Stochastic Geometry and Its Applications. Wiley. isbn:9780471950998 lccn:lc95004097
[47]
Shang-Hua Teng. 1995. Independent sets versus perfect matchings. Theoretical Computer Science, 145, 1-2 (1995), 381–390.
[48]
Alexander Terenin, Daniel Simpson, and David Draper. 2020. Asynchronous gibbs sampling. In Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS). 144–154.

Cited By

View all
  • (2023)Parallel Discrete Sampling via Continuous WalksProceedings of the 55th Annual ACM Symposium on Theory of Computing10.1145/3564246.3585207(103-116)Online publication date: 2-Jun-2023

Index Terms

  1. Simple parallel algorithms for single-site dynamics

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    STOC 2022: Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing
    June 2022
    1698 pages
    ISBN:9781450392648
    DOI:10.1145/3519935
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 June 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Markov chain Monte Carlo
    2. Single-site dynamics
    3. sampling

    Qualifiers

    • Research-article

    Conference

    STOC '22
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)28
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 01 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Parallel Discrete Sampling via Continuous WalksProceedings of the 55th Annual ACM Symposium on Theory of Computing10.1145/3564246.3585207(103-116)Online publication date: 2-Jun-2023

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media