research-article

Revisiting Local Computation of PageRank: Simple and Optimal

Authors:

Mingji YangAuthors Info & Claims

STOC 2024: Proceedings of the 56th Annual ACM Symposium on Theory of Computing

Pages 911 - 922

https://doi.org/10.1145/3618260.3649661

Published: 11 June 2024 Publication History

Abstract

We revisit ApproxContributions, the classic local graph exploration algorithm proposed by Andersen, Borgs, Chayes, Hopcroft, Mirrokni, and Teng (WAW ’07, Internet Math. ’08) for computing an є-approximation of the PageRank contribution vector for a target node t on a graph with n nodes and m edges. We give a worst-case complexity bound of it as O(nπ(t)/є·min(Δ_in,Δ_out,√m)), where π(t) is the PageRank score of t, and Δ_in and Δ_out are the maximum in-degree and out-degree of the graph, resp. We also give a lower bound of Ω(min(Δ_in/δ,Δ_out/δ,√m/δ,m)) for detecting t’s δ-contributing set, showing that the simple ApproxContributions algorithm is already optimal.

As ApproxContributions has become a cornerstone for computing random-walk probabilities, our results and techniques can be applied to derive better bounds for various relevant problems. In particular, we investigate the computational complexity of locally estimating a node’s PageRank centrality. We improve the best-known upper bound of O(n^2/3·min(Δ_out^1/3,m^1/6)) given by Bressan, Peserico, and Pretto (SICOMP ’23) to O(n^1/2·min(Δ_in^1/2,Δ_out^1/2,m^1/4)) by combining ApproxContributions with Monte Carlo sampling. We also improve their lower bound of Ω(min(n^1/2Δ_out^1/2,n^1/3m^1/3)) to Ω(n^1/2·min(Δ_in^1/2,Δ_out^1/2,m^1/4)) if min(Δ_in,Δ_out)=Ω(n^1/3), and to Ω(n^1/2−γ(min(Δ_in,Δ_out))^1/2+γ) otherwise, where γ>0 is an arbitrarily small constant. Our matching upper and lower bounds resolve the open problem of whether one can tighten the bounds given by Bressan, Peserico, and Pretto (FOCS ’18, SICOMP ’23). Remarkably, the techniques and analyses for proving all our results are surprisingly simple.

References

[1]

Reid Andersen, Christian Borgs, Jennifer T. Chayes, John E. Hopcroft, Kamal Jain, Vahab S. Mirrokni, and Shang-Hua Teng. 2008. Robust pagerank and locally computable spam detection features. In Proc. 4th Int. Workshop Adversarial Inf. Retrieval Web. 69–76.

Digital Library

[2]

Reid Andersen, Christian Borgs, Jennifer T. Chayes, John E. Hopcroft, Vahab S. Mirrokni, and Shang-Hua Teng. 2007. Local computation of pagerank contributions. In Proc. 5th Int. Workshop Algorithms Models Web Graph, Vol. 4863. 150–165.

[3]

Reid Andersen, Christian Borgs, Jennifer T. Chayes, John E. Hopcroft, Vahab S. Mirrokni, and Shang-Hua Teng. 2008. Local computation of pagerank contributions. Internet Math. 5, 1 (2008), 23–45.

[4]

Reid Andersen, Fan R. K. Chung, and Kevin J. Lang. 2006. Local graph partitioning using pagerank vectors. In Proc. 47th Annu. IEEE Symp. Found. Comput. Sci. 475–486.

Digital Library

[5]

Reid Andersen, Fan R. K. Chung, and Kevin J. Lang. 2007. Using pagerank to locally partition a graph. Internet Math. 4, 1 (2007), 35–64.

[6]

Konstantin Avrachenkov, Nelly Litvak, Danil Nemirovsky, and Natalia Osipova. 2007. Monte carlo methods in pagerank computation: when one iteration is sufficient. SIAM J. Numer. Anal. 45, 2 (2007), 890–904.

Digital Library

[7]

Siddhartha Banerjee and Peter Lofgren. 2015. Fast bidirectional probability estimation in markov models. In Advances Neural Inf. Process. Syst. 28. 1423–1431.

[8]

Ziv Bar-Yossef and Li-Tal Mashiach. 2008. Local approximation of pagerank and reverse pagerank. In Proc. 17th ACM Int. Conf. Inf. Knowl. Manage. 279–288.

Digital Library

[9]

Aleksandar Bojchevski, Johannes Klicpera, Bryan Perozzi, Amol Kapoor, Martin Blais, Benedek Rózemberczki, Michal Lukasik, and Stephan Günnemann. 2020. Scaling graph neural networks with approximate pagerank. In Proc. 26th ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining. 2464–2473.

Digital Library

[10]

Christian Borgs, Michael Brautbar, Jennifer T. Chayes, and Shang-Hua Teng. 2012. A sublinear time algorithm for pagerank computations. In Proc. 9th Int. Workshop Algorithms Models Web Graph, Vol. 7323. 41–53.

Digital Library

[11]

Christian Borgs, Michael Brautbar, Jennifer T. Chayes, and Shang-Hua Teng. 2014. Multiscale matrix sampling and sublinear-time pagerank computation. Internet Math. 10, 1-2 (2014), 20–48.

[12]

Marco Bressan, Enoch Peserico, and Luca Pretto. 2013. The power of local information in pagerank. In Proc. 22nd Int. World Wide Web Conf. 179–180.

Digital Library

[13]

Marco Bressan, Enoch Peserico, and Luca Pretto. 2015. Simple set cardinality estimation through random sampling. CoRR abs/1512.07901 (2015).

[14]

Marco Bressan, Enoch Peserico, and Luca Pretto. 2018. Sublinear algorithms for local graph centrality estimation. In Proc. 59th Annu. IEEE Symp. Found. Comput. Sci. 709–718.

[15]

Marco Bressan, Enoch Peserico, and Luca Pretto. 2023. Sublinear algorithms for local graph-centrality estimation. SIAM J. Comput. 52, 4 (2023), 968–1008.

[16]

Marco Bressan and Luca Pretto. 2011. Local computation of pagerank: the ranking side. In Proc. 20th ACM Int. Conf. Inf. Knowl. Manage. 631–640.

Digital Library

[17]

Sergey Brin and Lawrence Page. 1998. The anatomy of a large-scale hypertextual web search engine. Comput. Netw. 30, 1-7 (1998), 107–117.

Digital Library

[18]

Ming Chen, Zhewei Wei, Bolin Ding, Yaliang Li, Ye Yuan, Xiaoyong Du, and Ji-Rong Wen. 2020. Scalable graph neural networks via bidirectional propagation. In Advances Neural Inf. Process. Syst. 33. 14556–14566.

[19]

Yen-Yu Chen, Qingqing Gan, and Torsten Suel. 2004. Local methods for estimating pagerank values. In Proc. 13th ACM Int. Conf. Inf. Knowl. Manage. 381–389.

Digital Library

[20]

Ashish Chiplunkar, Michael Kapralov, Sanjeev Khanna, Aida Mousavifar, and Yuval Peres. 2018. Testing graph clusterability: algorithms and lower bounds. In Proc. 59th Annu. IEEE Symp. Found. Comput. Sci. 497–508.

[21]

Fan Chung. 2007. The heat kernel as the pagerank of a graph. Proc. Nat. Acad. Sci. 104, 50 (2007), 19735–19740.

[22]

Artur Czumaj, Pan Peng, and Christian Sohler. 2015. Testing cluster structure of graphs. In Proc. 47th Annu. ACM Symp. Theory Comput. 723–732.

Digital Library

[23]

Paul Dagum, Richard M. Karp, Michael Luby, and Sheldon M. Ross. 2000. An optimal algorithm for monte carlo estimation. SIAM J. Comput. 29, 5 (2000), 1484–1496.

Digital Library

[24]

Dániel Fogaras, Balázs Rácz, Károly Csalogány, and Tamás Sarlós. 2005. Towards scaling fully personalized pagerank: algorithms, lower bounds, and experiments. Internet Math. 2, 3 (2005), 333–358.

[25]

Kimon Fountoulakis, Farbod Roosta-Khorasani, Julian Shun, Xiang Cheng, and Michael W. Mahoney. 2019. Variational perspective on local graph clustering. Math. Program. 174, 1-2 (2019), 553–573.

Digital Library

[26]

David F. Gleich and Marzia Polito. 2007. Approximating personalized pagerank with minimal use of web graph data. Internet Math. 3, 3 (2007), 257–294.

[27]

Oded Goldreich, Shafi Goldwasser, and Dana Ron. 1998. Property testing and its connection to learning and approximation. J. ACM 45, 4 (1998), 653–750.

Digital Library

[28]

Oded Goldreich and Dana Ron. 2002. Property testing in bounded degree graphs. Algorithmica 32, 2 (2002), 302–343.

Digital Library

[29]

Wentian Guo, Yuchen Li, Mo Sha, and Kian-Lee Tan. 2017. Parallel personalized pagerank on dynamic graphs. Proc. VLDB Endowment 11, 1 (2017), 93–106.

Digital Library

[30]

Taher H. Haveliwala. 2003. Topic-sensitive pagerank: A context-sensitive ranking algorithm for web search. IEEE Trans. Knowl. Data Eng. 15, 4 (2003), 784–796.

Digital Library

[31]

Mark Jerrum, Leslie G. Valiant, and Vijay V. Vazirani. 1986. Random generation of combinatorial structures from a uniform distribution. Theor. Comput. Sci. 43 (1986), 169–188.

Digital Library

[32]

Satyen Kale and C. Seshadhri. 2011. An expansion tester for bounded degree graphs. SIAM J. Comput. 40, 3 (2011), 709–720.

Digital Library

[33]

Johannes Klicpera, Aleksandar Bojchevski, and Stephan Günnemann. 2019. Predict then Propagate: graph neural networks meet personalized pagerank. In Proc. 7th Int. Conf. Learn. Representations. https://openreview.net/forum?id=H1gL-2A9Ym

[34]

Meihao Liao, Rong-Hua Li, Qiangqiang Dai, and Guoren Wang. 2022. Efficient personalized pagerank computation: A spanning forests sampling based approach. In Proc. 2022 ACM SIGMOD Int. Conf. Manage. Data. 2048–2061.

Digital Library

[35]

Peter Lofgren, Siddhartha Banerjee, and Ashish Goel. 2015. Bidirectional pagerank estimation: from average-case to worst-case. In Proc. 12th Int. Workshop Algorithms Models Web Graph, Vol. 9479. 164–176.

Digital Library

[36]

Peter Lofgren, Siddhartha Banerjee, and Ashish Goel. 2016. Personalized pagerank estimation and search: A bidirectional approach. In Proc. 9th ACM Int. Conf. Web Search Data Mining. 163–172.

Digital Library

[37]

Peter Lofgren, Siddhartha Banerjee, Ashish Goel, and Seshadhri Comandur. 2014. FAST-PPR: scaling personalized pagerank estimation for large graphs. In Proc. 20th ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining. 1436–1445.

Digital Library

[38]

Peter Lofgren and Ashish Goel. 2013. Personalized pagerank to a target node. CoRR abs/1304.4658 (2013).

[39]

Dingheng Mo and Siqiang Luo. 2023. Single-source personalized pageranks with workload robustness. IEEE Trans. Knowl. Data Eng. 35, 6 (2023), 6320–6334.

Digital Library

[40]

Daniel A. Spielman and Nikhil Srivastava. 2011. Graph sparsification by effective resistances. SIAM J. Comput. 40, 6 (2011), 1913–1926.

Digital Library

[41]

Daniel A. Spielman and Shang-Hua Teng. 2004. Nearly-linear time algorithms for graph partitioning, graph sparsification, and solving linear systems. In Proc. 36th Annu. ACM Symp. Theory Comput. 81–90.

Digital Library

[42]

Hanzhi Wang, Mingguo He, Zhewei Wei, Sibo Wang, Ye Yuan, Xiaoyong Du, and Ji-Rong Wen. 2021. Approximate graph propagation. In Proc. 27th ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining. 1686–1696.

Digital Library

[43]

Hanzhi Wang and Zhewei Wei. 2023. Estimating single-node pagerank in ??? time. Proc. VLDB Endowment 16, 11 (2023), 2949–2961.

Digital Library

[44]

Hanzhi Wang, Zhewei Wei, Junhao Gan, Sibo Wang, and Zengfeng Huang. 2020. Personalized pagerank to a target node, revisited. In Proc. 26th ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining. 657–667.

Digital Library

[45]

Hanzhi Wang, Zhewei Wei, Ji-Rong Wen, and Mingji Yang. 2024. Revisiting local computation of PageRank: simple and optimal. CoRR abs/2403.12648 (2024).

[46]

Sibo Wang, Youze Tang, Xiaokui Xiao, Yin Yang, and Zengxiang Li. 2016. HubPPR: Effective indexing for approximate personalized pagerank. Proc. VLDB Endowment 10, 3 (2016), 205–216.

Digital Library

[47]

Sibo Wang and Yufei Tao. 2018. Efficient algorithms for finding approximate heavy hitters in personalized pageranks. In Proc. 2018 ACM SIGMOD Int. Conf. Manage. Data. 1113–1127.

Digital Library

[48]

Zhewei Wei, Xiaodong He, Xiaokui Xiao, Sibo Wang, Yu Liu, Xiaoyong Du, and Ji-Rong Wen. 2019. PRSim: Sublinear time simrank computation on large power-law graphs. In Proc. 2019 ACM SIGMOD Int. Conf. Manage. Data. 1042–1059.

Digital Library

[49]

Zhewei Wei, Xiaodong He, Xiaokui Xiao, Sibo Wang, Shuo Shang, and Ji-Rong Wen. 2018. TopPPR: Top-k personalized pagerank queries with precision guarantees on large graphs. In Proc. 2018 ACM SIGMOD Int. Conf. Manage. Data. 441–456.

Digital Library

[50]

Yuan Yin and Zhewei Wei. 2019. Scalable graph embeddings via sparse transpose proximities. In Proc. 25th ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining. 1429–1437.

Digital Library

[51]

Hongyang Zhang, Peter Lofgren, and Ashish Goel. 2016. Approximate personalized pagerank on dynamic graphs. In Proc. 22nd ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining. 1315–1324.

Digital Library

[52]

Yanping Zheng, Hanzhi Wang, Zhewei Wei, Jiajun Liu, and Sibo Wang. 2022. Instant graph neural networks for dynamic graphs. In Proc. 28th ACM SIGKDD Int. Conf. Knowl. Discovery Data Mining. 2605–2615.

Digital Library

Index Terms

Revisiting Local Computation of PageRank: Simple and Optimal
1. Theory of computation
  1. Design and analysis of algorithms
    1. Graph algorithms analysis
    2. Streaming, sublinear and near linear time algorithms

Recommendations

An Expansion Tester for Bounded Degree Graphs

We consider the problem of testing graph expansion (either vertex or edge) in the bounded degree model [O. Goldreich and D. Ron, On Testing Expansion in Bounded-Degree Graphs, Technical report TR00-020, ECCC, Potsdam, Germany, 2000]. We give a property ...
Revisiting Local PageRank Estimation on Undirected Graphs: Simple and Optimal
KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

We propose a simple and optimal algorithm, BackMC, for local PageRank estimation in undirected graphs: given an arbitrary target node t in an undirected graph G comprising n nodes and m edges, BackMC accurately estimates the PageRank score of node t ...
Local computation of PageRank: the ranking side
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management

Imagine you are a social network user who wants to search, in a list of potential candidates, for the best candidate for a job on the basis of their PageRank-induced importance ranking. Is it possible to compute this ranking for a low cost, by visiting ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

STOC 2024: Proceedings of the 56th Annual ACM Symposium on Theory of Computing

June 2024

2049 pages

ISBN:9798400703836

DOI:10.1145/3618260

General Chairs:
Bojan Mohar
Simon Fraser University, Canada
,
Igor Shinkar
Simon Fraser University, Canada
,
Program Chair:
Ryan O'Donnell
Carnegie Mellon University, USA

Copyright © 2024 Copyright is held by the owner/author(s). Publication rights licensed to ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGACT: ACM Special Interest Group on Algorithms and Computation Theory

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China
Beijing Natural Science Foundation
Beijing Outstanding Young Scientist Program
Alibaba Group through Alibaba Innovative Research Program
Huawei-Renmin University joint program on Information Retrieval
the fund for building world-class universities (disciplines) of Renmin University of China
Engineering Research Center of Next-Generation Intelligent Search and Recommendation, Ministry of Education
Intelligent Social Governance Interdisciplinary Platform, Major Innovation & Planning Interdisciplinary Platform for the ?Double-First Class? Initiative, Public Policy and Decision-making Research Lab
Public Computing Cloud, Renmin University of China

Conference

STOC '24

Sponsor:

SIGACT

STOC '24: 56th Annual ACM Symposium on Theory of Computing

June 24 - 28, 2024

BC, Vancouver, Canada

Acceptance Rates

Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
103
Total Downloads

Downloads (Last 12 months)103
Downloads (Last 6 weeks)21

Reflects downloads up to 04 Oct 2024

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents