research-article

Unifying the Global and Local Approaches: An Efficient Power Iteration with Forward Push

Authors:

Hao Wu,

Junhao Gan,

Zhewei Wei,

Rui ZhangAuthors Info & Claims

SIGMOD '21: Proceedings of the 2021 International Conference on Management of Data

Pages 1996 - 2008

https://doi.org/10.1145/3448016.3457298

Published: 18 June 2021 Publication History

Get Access

Abstract

Personalized PageRank (PPR) is a critical measure of the importance of a node t to a source node s in a graph. The Single-Source PPR (SSPPR) query computes the PPR's of all the nodes with respect to s on a directed graph G with n nodes and m edges; and it is an essential operation widely used in graph applications. In this paper, we propose novel algorithms for answering two variants of SSPPR queries: (i) high-precision queries and (ii) approximate queries.

For high-precision queries, Power Iteration (PowItr) and Forward Push (FwdPush) are two fundamental approaches. Given an absolute error threshold λ (which is typically set to as small as 10-8), the only known bound of FwdPush is O(m/λ), much worse than the O(m log 1/λ)-bound of PowItr. Whether FwdPush can achieve the same running time bound as PowItr does still remains an open question in the research community. We give a positive answer to this question. We show that the running time of a common implementation of FwdPush is actually bounded by O(m · log 1/λ). Based on this finding, we propose a new algorithm, called Power Iteration with Forward Push (PowerPush), which incorporates the strengths of both PowItr and FwdPush.

For approximate queries (with a relative error ε), we propose a new algorithm, called SpeedPPR, with overall expected time bounded by $O(n · log n · log 1/ε) on scale-free graphs. This improves the state-of-the-art O((n · log n)/ε) bound.

We conduct extensive experiments on six real datasets. The experimental results show that PowerPush outperforms the state-of-the-art high-precision algorithm BePi by up to an order of magnitude in both efficiency and accuracy. Furthermore, our SpeedPPR also outperforms the state-of-the-art approximate algorithm FORA by up to an order of magnitude in all aspects including query time, accuracy, pre-processing time as well as index size.

Supplementary Material

MP4 File (3448016.3457298.mp4)

Personalized PageRank (PPR) is a critical measure on the importance of a node $t$ to a source node $s$ in a graph. A Single-Source PPR (SSPPR) query computes the PPR's of all the nodes with respect to $s$ on a directed graph $G$ with $n$ nodes and $m$ edges, and it is an essential operation widely used in graph applications. In this paper, we propose novel algorithms for solving two variants of SSPPR: (i) high-precision queries and (ii) approximate queries. For the high-precision queries, Power Iteration (PowItr) and Forward Push (FwdPush) are two fundamental approaches. Given an absolute error threshold $\lamda$, the only known bound of FwdPush is $O(\frac{m}{\lamda})$, much worse than the $O(m \log \frac{1}{\lamda})$-bound of PowItr. Whether FwdPush can achieve the same running time bound as PowItr does still remains an open question in the research community. We give a positive answer to this question by showing that the running time of a common implementation of FwdPush is actually bounded by $O(m \cdot \log \frac{1}{\lamda})$. Based on this finding, we propose a new algorithm, called Power Iteration with Forward Push (PowForPush), which incorporates both strengths of PowItr and FwdPush. For approximate queries (with a relative error $\eps$), we propose a new algorithm, called SpeedPPR, with overall expected time bounded by $O(n \cdot \log n \cdot \log \frac{1}{\eps})$ on scale-free graphs. This bound greatly improves the $O(\frac{n \cdot \log n}{\eps})$ bound of a state-of-the-art algorithm FORA. We conduct extensive experiments on six real datasets. The experimental results show that PowForPush outperforms the state-of-the-art algorithm BePi by up to an order of magnitude in both efficiency and accuracy. Furthermore, our SpeedPPR also outperforms FORA by up to an order of magnitude in all aspects includes query time, accuracy, pre-processing time as well as index size.

Download
562.23 MB

References

[1]

Reid Andersen, Christian Borgs, Jennifer T. Chayes, John E. Hopcroft, Vahab S. Mirrokni, and Shang-Hua Teng. 2007. Local Computation of PageRank Contributions. In WAW. 150--165.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Local Global Tradeoffs in Metric Embeddings

TopPPR: Top-k Personalized PageRank Queries with Precision Guarantees on Large Graphs

Unifying the Landscape of Cell-Probe Lower Bounds

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations