Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3097983.3098072acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

FORA: Simple and Effective Approximate Single-Source Personalized PageRank

Published: 04 August 2017 Publication History

Abstract

Given a graph G, a source node s and a target node t, the personalized PageRank (PPR) of t with respect to s is the probability that a random walk starting from s terminates at t. A single-source PPR (SSPPR) query enumerates all nodes in G, and returns the top-k nodes with the highest PPR values with respect to a given source node s. SSPPR has important applications in web search and social networks, e.g., in Twitter's Who-To-Follow recommendation service. However, SSPPR computation is immensely expensive, and at the same time resistant to indexing and materialization. So far, existing solutions either use heuristics, which do not guarantee result quality, or rely on the strong computing power of modern data centers, which is costly.
Motivated by this, we propose FORA, a simple and effective index-based solution for approximate SSPPR processing, with rigorous guarantees on result quality. The basic idea of FORA is to combine two existing methods Forward Push (which is fast but does not guarantee quality) and Monte Carlo Random Walk (accurate but slow) in a simple and yet non-trivial way, leading to an algorithm that is both fast and accurate. Further, FORA includes a simple and effective indexing scheme, as well as a module for top-k selection with high pruning power. Extensive experiments demonstrate that FORA is orders of magnitude more efficient than its main competitors. Notably, on a billion-edge Twitter dataset, FORA answers a top-500 approximate SSPPR query within 5 seconds, using a single commodity server.

Supplementary Material

MP4 File (wang_personalized_pagerank.mp4)

References

[1]
Reid Andersen, Christian Borgs, Jennifer T. Chayes, John E. Hopcroft, Vahab S. Mirrokni, and Shang-Hua Teng 2007. Local Computation of PageRank Contributions. In WAW. 150--165.
[2]
Reid Andersen, Fan R. K. Chung, and Kevin J. Lang. 2006. Local Graph Partitioning using PageRank Vectors. FOCS. 475--486.
[3]
Konstantin Avrachenkov, Nelly Litvak, Danil Nemirovsky, Elena Smirnova, and Marina Sokol. 2011. Quick Detection of Top-k Personalized PageRank Lists WAW. 50--61.
[4]
Lars Backstrom and Jure Leskovec 2011. Supervised random walks: predicting and recommending links in social networks WSDM. 635--644.
[5]
Bahman Bahmani, Kaushik Chakrabarti, and Dong Xin. 2011. Fast personalized PageRank on MapReduce. In SIGMOD. 973--984.
[6]
Pavel Berkhin. 2005. Survey: A Survey on PageRank Computing. Internet Mathematics, Vol. 2, 1 (2005), 73--120.
[7]
Fan R. K. Chung and Lincoln Lu 2006. Survey: Concentration Inequalities and Martingale Inequalities: A Survey. Internet Mathematics, Vol. 3, 1 (2006), 79--127.
[8]
Dániel Fogaras, Balázs Rácz, Károly Csalogány, and Tamás Sarlós. 2005. Towards scaling fully personalized pagerank: Algorithms, lower bounds, and experiments. Internet Mathematics, Vol. 2, 3 (2005), 333--358.
[9]
Yasuhiro Fujiwara, Makoto Nakatsuji, Hiroaki Shiokawa, Takeshi Mishima, and Makoto Onizuka. 2013. Efficient ad-hoc search for personalized PageRank. SIGMOD. 445--456.
[10]
Yasuhiro Fujiwara, Makoto Nakatsuji, Takeshi Yamamuro, Hiroaki Shiokawa, and Makoto Onizuka. 2012. Efficient personalized pagerank with accuracy assurance KDD. 15--23.
[11]
Manish S. Gupta, Amit Pathak, and Soumen Chakrabarti. 2008. Fast algorithms for topk personalized pagerank queries WWW. 1225--1226.
[12]
Pankaj Gupta, Ashish Goel, Jimmy Lin, Aneesh Sharma, Dong Wang, and Reza Zadeh. 2013. Wtf: The who to follow service at twitter. In WWW. 505--514.
[13]
Glen Jeh and Jennifer Widom 2003. Scaling personalized web search. In WWW. 271--279.
[14]
Peter Lofgren. 2015. Efficient Algorithms for Personalized PageRank. CoRR Vol. abs/1512.04633 (2015).
[15]
Peter Lofgren, Siddhartha Banerjee, and Ashish Goel. 2016. Personalized pagerank estimation and search: A bidirectional approach WSDM. 163--172.
[16]
Peter A Lofgren, Siddhartha Banerjee, Ashish Goel, and C Seshadhri 2014. Fast-ppr: Scaling personalized pagerank estimation for large graphs KDD. 1436--1445.
[17]
Takanori Maehara, Takuya Akiba, Yoichi Iwata, and Ken-ichi Kawarabayashi 2014. Computing personalized PageRank quickly by exploiting graph structures. PVLDB, Vol. 7, 12 (2014), 1023--1034.
[18]
Naoto Ohsaka, Takanori Maehara, and Ken-ichi Kawarabayashi. 2015. Efficient PageRank Tracking in Evolving Networks. SIGKDD 2015. 875--884.
[19]
Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd 1999. The PageRank citation ranking: bringing order to the web. (1999).
[20]
Atish Das Sarma, Anisur Rahaman Molla, Gopal Pandurangan, and Eli Upfal 2013. Fast Distributed PageRank Computation. In ICDCN. 11--26.
[21]
Kijung Shin, Jinhong Jung, Lee Sael, and U. Kang. 2015. BEAR: Block Elimination Approach for Random Walk with Restart on Large Graphs SIGMOD. 1571--1585.
[22]
Sibo Wang, Youze Tang, Xiaokui Xiao, Yin Yang, and Zengxiang Li 2016. HubPPR: Effective Indexing for Approximate Personalized PageRank. PVLDB, Vol. 10, 3 (2016), 205--216. http://www.vldb.org/pvldb/vol10/p205-wang.pdf
[23]
Hongyang Zhang, Peter Lofgren, and Ashish Goel. 2016. Approximate Personalized PageRank on Dynamic Graphs KDD. 1315--1324.
[24]
Fanwei Zhu, Yuan Fang, Kevin Chen-Chuan Chang, and Jing Ying 2013. Incremental and Accuracy-Aware Personalized PageRank through Scheduled Approximation. PVLDB, Vol. 6, 6 (2013), 481--492.

Cited By

View all
  • (2024)BIRD: Efficient Approximation of Bidirectional Hidden Personalized PageRankProceedings of the VLDB Endowment10.14778/3665844.366585517:9(2255-2268)Online publication date: 1-May-2024
  • (2024)QTCS: Efficient Query-Centered Temporal Community SearchProceedings of the VLDB Endowment10.14778/3648160.364816317:6(1187-1199)Online publication date: 1-Feb-2024
  • (2024)Efficient and Provable Effective Resistance Computation on Large Graphs: An Index-based ApproachProceedings of the ACM on Management of Data10.1145/36549362:3(1-27)Online publication date: 30-May-2024
  • Show More Cited By

Index Terms

  1. FORA: Simple and Effective Approximate Single-Source Personalized PageRank

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
    August 2017
    2240 pages
    ISBN:9781450348874
    DOI:10.1145/3097983
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 August 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. forward push
    2. personalized pagerank
    3. random walk

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    KDD '17
    Sponsor:

    Acceptance Rates

    KDD '17 Paper Acceptance Rate 64 of 748 submissions, 9%;
    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)65
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 04 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)BIRD: Efficient Approximation of Bidirectional Hidden Personalized PageRankProceedings of the VLDB Endowment10.14778/3665844.366585517:9(2255-2268)Online publication date: 1-May-2024
    • (2024)QTCS: Efficient Query-Centered Temporal Community SearchProceedings of the VLDB Endowment10.14778/3648160.364816317:6(1187-1199)Online publication date: 1-Feb-2024
    • (2024)Efficient and Provable Effective Resistance Computation on Large Graphs: An Index-based ApproachProceedings of the ACM on Management of Data10.1145/36549362:3(1-27)Online publication date: 30-May-2024
    • (2024)Efficient High-Quality Clustering for Large Bipartite GraphsProceedings of the ACM on Management of Data10.1145/36392782:1(1-27)Online publication date: 26-Mar-2024
    • (2024)Fast Query of Biharmonic Distance in NetworksProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671856(1887-1897)Online publication date: 25-Aug-2024
    • (2024)Fast Computation for the Forest Matrix of an Evolving GraphProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671822(2755-2764)Online publication date: 25-Aug-2024
    • (2024)Topology-monitorable Contrastive Learning on Dynamic GraphsProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671777(4700-4711)Online publication date: 25-Aug-2024
    • (2024)PSMC: Provable and Scalable Algorithms for Motif Conductance Based Graph ClusteringProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671666(1793-1803)Online publication date: 25-Aug-2024
    • (2024)LiGNN: Graph Neural Networks at LinkedInProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671566(4793-4803)Online publication date: 25-Aug-2024
    • (2024)Towards Deeper Understanding of PPR-based Embedding Approaches: A Topological PerspectiveProceedings of the ACM Web Conference 202410.1145/3589334.3645663(969-979)Online publication date: 13-May-2024
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media