Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2808797.2809388acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
short-paper

Exploiting Phase Transitions for the Efficient Sampling of the Fixed Degree Sequence Model

Published: 25 August 2015 Publication History
  • Get Citation Alerts
  • Abstract

    Real-world network data is often very noisy and contains erroneous or missing edges. These superfluous and missing edges can be identified statistically by assessing the number of common neighbors of the two incident nodes. To evaluate whether this number of common neighbors, the so called co-occurrence, is statistically significant, a comparison with the expected co-occurrence in a suitable random graph model is required. For networks with a skewed degree distribution, including most real-world networks, it is known that the fixed degree sequence model, which maintains the degrees of nodes, is favourable over using simplified graph models that are based on an independence assumption. However, the use of a fixed degree sequence model requires sampling from the space of all graphs with the given degree sequence and measuring the co-occurrence of each pair of nodes in each of the samples, since there is no known closed formula for this statistic. While there exist log-linear approaches such as Markov chain Monte Carlo sampling, the computational complexity still depends on the length of the Markov chain and the number of samples, which is significant in large-scale networks. In this article, we show based on ground truth data that there are various phase transition-like tipping points that enable us to choose a comparatively low number of samples and to reduce the length of the Markov chains without reducing the quality of the significance test. As a result, the computational effort can be reduced by an order of magnitudes.

    References

    [1]
    R. Milo, S. Itzkovitz, N. Kashtan, R. Levitt, and U. Alon, "Response to Comment on "Network motifs: Simple building blocks of complex networks " and "Superfamilies of evolved and designed networks"," Science, vol. 305, p. 1107d, 2004.
    [2]
    K. A. Zweig, "How to forget the second side of the story: A new method for the one-mode projection of bipartite graphs," in Proceedings of the 2010 International Conference on Advances in Social Networks Analysis and Mining ASONAM 2010, 2010, pp. 200--207.
    [3]
    K. A. Zweig, "Good versus optimal: Why network analytic methods need more systematic evaluation," Central European Journal of Computer Science, vol. 1, pp. 137--153, 2011.
    [4]
    S. Uhlmann, H. Mannsperger, J. D. Zhang, E.-Á. Horvat, C. Schmidt, M. Küblbeck, A. Ward, U. Tschulena, K. Zweig, U. Korf, S. Wiemann, and Ö. Sahin, "Global miRNA regulation of a local protein network: Case study with the EGFR-driven cell cycle network in breast cancer," Molecular Systems Biology, vol. 8, p. 570, 2012.
    [5]
    E.-Á. Horvát, J. D. Zhang, S. Uhlmann, Ö. Sahin, and K. A. Zweig, "A network-based method to assess the statistical significance of mild co-regulation effects," PLOS ONE, vol. 8, no. 9, p. e73413, 2013.
    [6]
    E.-Á. Horvát and K. A. Zweig, "A fixed degree sequence model for the one-mode projection of multiplex bipartite graphs," Social Network Analysis and Mining, vol. 4, p. 164, 2013.
    [7]
    C. I. Del Genio, H. Kim, Z. Toroczkai, and K. E. Bassler, "Efficient and exact sampling of simple graphs with given arbitrary degree sequence," PloS one, vol. 5, no. 4, p. e10012, 2010.
    [8]
    R. Kannan, P. Tetali, and S. Vempala, "Simple markov-chain algorithms for generating bipartite graphs and tournaments," Random Structures and Algorithms, vol. 14, no. 4, pp. 293--308, 1999.
    [9]
    A. Berger and M. Müller-Hannemann, "Uniform sampling of digraphs with a fixed degree sequence," in Graph theoretic concepts in computer science, 2010, pp. 220--231.
    [10]
    G. W. Cobb and Y.-P. Chen, "An application of Markov Chain Monte Carlo to community ecology," The American Mathematical Monthly, vol. 110, pp. 265--288, 2003.
    [11]
    M. Jerrum, A. Sinclair, and E. Vigoda, "A polynomial-time approximation algorithm for the permanent of a matrix with nonnegative entries," Journal of the ACM (JACM), vol. 51, no. 4, pp. 671--697, 2004.
    [12]
    R. A. Brualdi, "Matrices of zeros and ones with fixed row and column sum vectors," Linear Algebra Applied, vol. 33, pp. 159--231, 1980.
    [13]
    A. Gionis, et al., "Assessing data mining results via swap randomization," ACM Transactions on Knowledge Discovery from Data, vol. 1, no. 3, p. article no. 14, 2007.
    [14]
    A. Gionis, H. Mannila, T. Mielikäinen, and P. Tsaparas, "Assessing data mining results via swap randomization," in Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06), 2006.
    [15]
    E.-Á. Horvát and K. A. Zweig, "One-mode projections of multiplex bipartite graphs," in Proceedings of the 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012), 2012.
    [16]
    D. Liben-Nowell and J. Kleinberg, "The link-prediction problem for social networks," Journal of the American Society for Information Science and Technology, vol. 58, no. 7, pp. 1019--1031, May 2007. {Online}. Available: http://doi.wiley.com/10.1002/asi.20591

    Cited By

    View all
    • (2023)Increasing the Sampling Efficiency for the Link Assessment ProblemAlgorithms for Big Data10.1007/978-3-031-21534-6_3(39-56)Online publication date: 18-Jan-2023
    • (2016)Increasing sampling efficiency for the fixed degree sequence model with phase transitionsSocial Network Analysis and Mining10.1007/s13278-016-0407-06:1Online publication date: 22-Oct-2016
    • (2015)Omitting RefreshProceedings of the 2015 International Symposium on Memory Systems10.1145/2818950.2818964(85-91)Online publication date: 5-Oct-2015
    1. Exploiting Phase Transitions for the Efficient Sampling of the Fixed Degree Sequence Model

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        ASONAM '15: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015
        August 2015
        835 pages
        ISBN:9781450338547
        DOI:10.1145/2808797
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 25 August 2015

        Permissions

        Request permissions for this article.

        Check for updates

        Qualifiers

        • Short-paper
        • Research
        • Refereed limited

        Conference

        ASONAM '15
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 116 of 549 submissions, 21%

        Upcoming Conference

        KDD '24

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)Increasing the Sampling Efficiency for the Link Assessment ProblemAlgorithms for Big Data10.1007/978-3-031-21534-6_3(39-56)Online publication date: 18-Jan-2023
        • (2016)Increasing sampling efficiency for the fixed degree sequence model with phase transitionsSocial Network Analysis and Mining10.1007/s13278-016-0407-06:1Online publication date: 22-Oct-2016
        • (2015)Omitting RefreshProceedings of the 2015 International Symposium on Memory Systems10.1145/2818950.2818964(85-91)Online publication date: 5-Oct-2015

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media