Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1963405.1963488acmotherconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
research-article

Layered label propagation: a multiresolution coordinate-free ordering for compressing social networks

Published: 28 March 2011 Publication History
  • Get Citation Alerts
  • Abstract

    We continue the line of research on graph compression started with WebGraph, but we move our focus to the compression of social networks in a proper sense (e.g., LiveJournal): the approaches that have been used for a long time to compress web graphs rely on a specific ordering of the nodes (lexicographical URL ordering) whose extension to general social networks is not trivial. In this paper, we propose a solution that mixes clusterings and orders, and devise a new algorithm, called Layered Label Propagation, that builds on previous work on scalable clustering and can be used to reorder very large graphs (billions of nodes). Our implementation uses task decomposition to perform aggressively on multi-core architecture, making it possible to reorder graphs of more than 600 millions nodes in a few hours.
    Experiments performed on a wide array of web graphs and social networks show that combining the order produced by the proposed algorithm with the WebGraph compression framework provides a major increase in compression with respect to all currently known techniques, both on web graphs and on social networks. These improvements make it possible to analyse in main memory significantly larger graphs.

    References

    [1]
    Alberto Apostolico and Guido Drovandi. Graph compression by BFS. Algorithms, 2(3):1031--1044, 2009.
    [2]
    Michael J. Barber and John W. Clark. Detecting network communities by propagating labels under constraints. Phys. Rev. E, 80(2):026129, Aug 2009.
    [3]
    Krishna Bharat, Andrei Broder, Monika Henzinger, Puneet Kumar, and Suresh Venkatasubramanian. The Connectivity Server: fast access to linkage information on the Web. Computer Networks and ISDN Systems, 30(1--7):469--477, 1998.
    [4]
    Paolo Boldi, Massimo Santini, and Sebastiano Vigna. Permuting web graphs. In WAW '09: Proceedings of the 6th International Workshop on Algorithms and Models for the Web-Graph, pages 116--126, Berlin, Heidelberg, 2009. Springer-Verlag.
    [5]
    Paolo Boldi, Massimo Santini, and Sebastiano Vigna. Permuting web and social graphs. Internet Math., 6(3):257--283, 2010.
    [6]
    Paolo Boldi and Sebastiano Vigna. The WebGraph framework I: Compression techniques. In Proc. of the Thirteenth International World Wide Web Conference, pages 595--601. ACM Press, 2004.
    [7]
    Flavio Chierichetti, Ravi Kumar, Silvio Lattanzi, Michael Mitzenmacher, Alessandro Panconesi, and Prabhakar Raghavan. On compressing social networks. In KDD '09: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 219--228, New York, NY, USA, 2009. ACM.
    [8]
    Santo Fortunato. Community detection in graphs. Physics Report, 486:75--174, February 2010.
    [9]
    Santo Fortunato and Marc Barthelemy. Resolution limit in community detection. Proceedings of the National Academy of Science, 104:36--41, January 2007.
    [10]
    Anna C. Gilbert and Kirill Levchenko. Compressing network graphs. In Proceedings of the LinkKDD workshop at the 10th ACM Conference on KDD, August 2004.
    [11]
    Ankur Gupta, Wing-Kai Hon, Rahul Shah, and Jeffrey Scott Vitter. Compressed data structures: Dictionaries and data-aware measures. Theoret. Comput. Sci., 387(3):313--331, 2007.
    [12]
    Guy Jacobson. Space-efficient static trees and graphs. In 30th Annual Symposium on Foundations of Computer Science, pages 549--554, Research Triangle Park, North Carolina, 1989. IEEE.
    [13]
    David Knoke and Song Yang. Social Network Analysis. Sage Publications, Inc, second edition edition, 2008.
    [14]
    Donald E. Knuth. The Art of Computer Programming, Volume 4, Fascicle 2: Generating All Tuples and Permutations (Art of Computer Programming). Addison-Wesley Professional, 2005.
    [15]
    Hossein Maserrat and Jian Pei. Neighbor query friendly compression of social networks. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 533--542. ACM, 2010.
    [16]
    Marina Meil\va. Comparing clusterings: an axiomatic view. In ICML '05: Proceedings of the 22nd international conference on Machine learning, pages 577--584, New York, NY, USA, 2005. ACM.
    [17]
    Mark E. J. Newman and Michelle Girvan. Finding and evaluating community structure in networks. Phys. Rev. E, 69(2):026113, Feb 2004.
    [18]
    Usha N. Raghavan, Réka Albert, and Soundar Kumara. Near linear time algorithm to detect community structures in large-scale networks. Physical Review E (Statistical, Nonlinear, and Soft Matter Physics), 76(3), 2007.
    [19]
    Keith H. Randall, Raymie Stata, Janet L. Wiener, and Rajiv G. Wickremesinghe. The Link Database: Fast access to graphs of the web. In Proceedings of the Data Compression Conference, pages 122--131, Washington, DC, USA, 2002. IEEE Computer Society.
    [20]
    Peter Ronhovde and Zohar Nussinov. Local resolution-limit-free Potts model for community detection. Phys. Rev. E, 81(4):046114, Apr 2010.
    [21]
    Ilya Safro and Boris Temkin. Multiscale approach for the network compression-friendly ordering. Journal of Discrete Algorithms, 2010.
    [22]
    Gergely Tibély and János Kertész. On the equivalence of the label propagation method of community detection and a Potts model approach. Physica A Statistical Mechanics and its Applications, 387:4982--4984, August 2008.
    [23]
    Sebastiano Vigna. Stanford matrix considered harmful. In Andreas Frommer, Michael W. Mahoney, and Daniel B. Szyld, editors, Web Information Retrieval and Linear Algebra Algorithms, number 07071 in Dagstuhl Seminar Proceedings. Internationales Begegnungs- und Forschungszentrum für Informatik (IBFI), Schloss Dagstuhl, Germany, 2007.
    [24]
    Stanley Wasserman, Katherine Faust, and Dawn Iacobucci. Social Network Analysis : Methods and Applications (Structural Analysis in the Social Sciences). Cambridge University Press, 1994.

    Cited By

    View all
    • (2024)SeraphProceedings of the 22nd USENIX Conference on File and Storage Technologies10.5555/3650697.3650719(373-387)Online publication date: 27-Feb-2024
    • (2024)Floating-point histograms for exploratory analysis of large scale real-world data setsIntelligent Data Analysis10.3233/IDA-230638(1-48)Online publication date: 25-Jan-2024
    • (2024)FSM: A Fine-Grained Splitting and Merging Framework for Dual-Balanced Graph PartitionProceedings of the VLDB Endowment10.14778/3665844.366586417:9(2378-2391)Online publication date: 1-May-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '11: Proceedings of the 20th international conference on World wide web
    March 2011
    840 pages
    ISBN:9781450306324
    DOI:10.1145/1963405
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 March 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. graph clustering
    2. graph compression
    3. social networks
    4. web graphs

    Qualifiers

    • Research-article

    Conference

    WWW '11
    WWW '11: 20th International World Wide Web Conference
    March 28 - April 1, 2011
    Hyderabad, India

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)50
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 11 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)SeraphProceedings of the 22nd USENIX Conference on File and Storage Technologies10.5555/3650697.3650719(373-387)Online publication date: 27-Feb-2024
    • (2024)Floating-point histograms for exploratory analysis of large scale real-world data setsIntelligent Data Analysis10.3233/IDA-230638(1-48)Online publication date: 25-Jan-2024
    • (2024)FSM: A Fine-Grained Splitting and Merging Framework for Dual-Balanced Graph PartitionProceedings of the VLDB Endowment10.14778/3665844.366586417:9(2378-2391)Online publication date: 1-May-2024
    • (2024)Improving Graph Compression for Efficient Resource-Constrained Graph AnalyticsProceedings of the VLDB Endowment10.14778/3665844.366585217:9(2212-2226)Online publication date: 1-May-2024
    • (2024)FlowWalker: A Memory-Efficient and High-Performance GPU-Based Dynamic Graph Random Walk FrameworkProceedings of the VLDB Endowment10.14778/3659437.365943817:8(1788-1801)Online publication date: 1-Apr-2024
    • (2024)RAGraph: A Region-Aware Framework for Geo-Distributed Graph ProcessingProceedings of the VLDB Endowment10.14778/3632093.363209417:3(264-277)Online publication date: 20-Jan-2024
    • (2024)Algorithm 1043: Faster Randomized SVD with Dynamic ShiftsACM Transactions on Mathematical Software10.1145/366062950:2(1-27)Online publication date: 23-Apr-2024
    • (2024)Play like a Vertex: A Stackelberg Game Approach for Streaming Graph PartitioningProceedings of the ACM on Management of Data10.1145/36549652:3(1-27)Online publication date: 30-May-2024
    • (2024)Graph Summarization: Compactness Meets EfficiencyProceedings of the ACM on Management of Data10.1145/36549432:3(1-26)Online publication date: 30-May-2024
    • (2024)Grafu: Unleashing the Full Potential of Future Value Computation for Out-of-core Synchronous Graph ProcessingProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 210.1145/3620665.3640409(467-481)Online publication date: 27-Apr-2024
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media