Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Incremental and accuracy-aware personalized pagerank through scheduled approximation

Published: 01 April 2013 Publication History
  • Get Citation Alerts
  • Abstract

    As Personalized PageRank has been widely leveraged for ranking on a graph, the efficient computation of Personalized PageRank Vector (PPV) becomes a prominent issue. In this paper, we propose FastPPV, an approximate PPV computation algorithm that is incremental and accuracy-aware. Our approach hinges on a novel paradigm of scheduled approximation: the computation is partitioned and scheduled for processing in an "organized" way, such that we can gradually improve our PPV estimation in an incremental manner, and quantify the accuracy of our approximation at query time. Guided by this principle, we develop an efficient hub based realization, where we adopt the metric of hub-length to partition and schedule random walk tours so that the approximation error reduces exponentially over iterations. Furthermore, as tours are segmented by hubs, the shared substructures between different tours (around the same hub) can be reused to speed up query processing both within and across iterations. Finally, we evaluate FastPPV over two real-world graphs, and show that it not only significantly outperforms two state-of-the-art baselines in both online and offline phrases, but also scale well on larger graphs. In particular, we are able to achieve near-constant time online query processing irrespective of graph size.

    References

    [1]
    R. Andersen, F. Chung, and K. Lang. Local graph partitioning using pagerank vectors. In FOCS, pages 475-486, 2006.
    [2]
    B. Bahmani, K. Chakrabarti, and D. Xin. Fast personalized PageRank on MapReduce. In SIGMOD, pages 973-984, 2011.
    [3]
    B. Bahmani, A. Chowdhury, and A. Goel. Fast incremental and personalized PageRank. VLDB, pages 173-184, 2010.
    [4]
    A. Balmin, V. Hristidis, and Y. Papakonstantinou. ObjectRank: Authority-based keyword search in databases. In VLDB, pages 564-575, 2004.
    [5]
    P. Berkhin. Bookmark-coloring algorithm for personalized pagerank computing. Internet Mathematics, 3(1):41-62, 2006.
    [6]
    S. Chakrabarti. Dynamic personalized pagerank in entity-relation graphs. In WWW, pages 571-580, 2007.
    [7]
    S. Chakrabarti, A. Pathak, and M. Gupta. Index design and query processing for graph conductance search. VLDBJ, 20:445-470, 2010.
    [8]
    D. Fogaras, B. Rácz, K. Csalogány, and T. Sarlós. Towards scaling fully personalized pagerank: Algorithms, lower bounds, and experiments. Internet Mathematics, 2(3):333-358, 2005.
    [9]
    Y. Fujiwara, M. Nakatsuji, T. Yamamuro, H. Shiokawa, and M. Onizuka. Efficient personalized pagerank with accuracy assurance. In SIGKDD, pages 15-23, 2012.
    [10]
    M. Gupta, A. Pathak, and S. Chakrabarti. Fast algorithms for top-k personalized pagerank queries. In WWW, pages 1225-1226, 2008.
    [11]
    T. H. Haveliwala. Topic-Sensitive PageRank: a Context-Sensitive ranking algorithm for web search. TKDE, 15(4):784-796, 2003.
    [12]
    G. Jeh and J. Widom. Scaling personalized web search. In WWW, pages 271-279, 2003.
    [13]
    S. Kamvar, T. Haveliwala, C. Manning, and G. Golub. Exploiting the block structure of the web for computing PageRank. Technical report, Stanford University, 2003.
    [14]
    L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank citation ranking: Bringing order to the web. Technical report, Stanford University, 1999.
    [15]
    A. Papoulis, S. Pillai, and S. Unnikrishna. Probability, random variables, and stochastic processes. McGraw-hill New York, 1965.
    [16]
    A. Pathak, S. Chakrabarti, and M. Gupta. Index design for dynamic personalized PageRank. In ICDE, pages 1489-1491, 2008.
    [17]
    M. Richardson and P. Domingos. The intelligent surfer: Probabilistic combination of link and content information in pagerank. In NIPS, pages 1441-1448, 2002.
    [18]
    P. Sarkar and A. Moore. Fast nearest-neighbor search in disk-resident graphs. In SIGKDD, pages 513-522, 2010.

    Cited By

    View all
    • (2023)Efficient Personalized PageRank Computation: The Power of Variance-Reduced Monte Carlo ApproachesProceedings of the ACM on Management of Data10.1145/35893051:2(1-26)Online publication date: 20-Jun-2023
    • (2023)Personalized PageRank on Evolving Graphs with an Incremental Index-Update SchemeProceedings of the ACM on Management of Data10.1145/35887051:1(1-26)Online publication date: 30-May-2023
    • (2022)Efficient Personalized PageRank Computation: A Spanning Forests Sampling Based ApproachProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3526140(2048-2061)Online publication date: 10-Jun-2022
    • Show More Cited By

    Index Terms

    1. Incremental and accuracy-aware personalized pagerank through scheduled approximation
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image Proceedings of the VLDB Endowment
        Proceedings of the VLDB Endowment  Volume 6, Issue 6
        April 2013
        144 pages

        Publisher

        VLDB Endowment

        Publication History

        Published: 01 April 2013
        Published in PVLDB Volume 6, Issue 6

        Qualifiers

        • Article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)2
        • Downloads (Last 6 weeks)0

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)Efficient Personalized PageRank Computation: The Power of Variance-Reduced Monte Carlo ApproachesProceedings of the ACM on Management of Data10.1145/35893051:2(1-26)Online publication date: 20-Jun-2023
        • (2023)Personalized PageRank on Evolving Graphs with an Incremental Index-Update SchemeProceedings of the ACM on Management of Data10.1145/35887051:1(1-26)Online publication date: 30-May-2023
        • (2022)Efficient Personalized PageRank Computation: A Spanning Forests Sampling Based ApproachProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3526140(2048-2061)Online publication date: 10-Jun-2022
        • (2021)Massively parallel algorithms for personalized pagerankProceedings of the VLDB Endowment10.14778/3461535.346155414:9(1668-1680)Online publication date: 22-Oct-2021
        • (2021)Unifying the Global and Local Approaches: An Efficient Power Iteration with Forward PushProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3457298(1996-2008)Online publication date: 9-Jun-2021
        • (2020)Personalized PageRank to a Target Node, RevisitedProceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3394486.3403108(657-667)Online publication date: 23-Aug-2020
        • (2019)Realtime top-k personalized pagerank over large graphs on GPUsProceedings of the VLDB Endowment10.14778/3357377.335737913:1(15-28)Online publication date: 1-Sep-2019
        • (2019)Efficient Algorithms for Approximate Single-Source Personalized PageRank QueriesACM Transactions on Database Systems10.1145/336090244:4(1-37)Online publication date: 23-Oct-2019
        • (2018)Optimized Rank Estimator in Big Data Social NetworksProceedings of the 2018 International Conference on Big Data and Education10.1145/3206157.3206181(80-84)Online publication date: 9-Mar-2018
        • (2018)TopPPRProceedings of the 2018 International Conference on Management of Data10.1145/3183713.3196920(441-456)Online publication date: 27-May-2018
        • Show More Cited By

        View Options

        Get Access

        Login options

        Full Access

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media