Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3295500.3356182acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

Slim graph: practical lossy graph compression for approximate graph processing, storage, and analytics

Published: 17 November 2019 Publication History

Abstract

We propose Slim Graph: the first programming model and framework for practical lossy graph compression that facilitates high-performance approximate graph processing, storage, and analytics. Slim Graph enables the developer to express numerous compression schemes using small and programmable compression kernels that can access and modify local parts of input graphs. Such kernels are executed in parallel by the underlying engine, isolating developers from complexities of parallel programming. Our kernels implement novel graph compression schemes that preserve numerous graph properties, for example connected components, minimum spanning trees, or graph spectra. Finally, Slim Graph uses statistical divergences and other metrics to analyze the accuracy of lossy graph compression. We illustrate both theoretically and empirically that Slim Graph accelerates numerous graph algorithms, reduces storage used by graph datasets, and ensures high accuracy of results. Slim Graph may become the common ground for developing, executing, and analyzing emerging lossy graph compression schemes.

References

[1]
[n.d.]. Hyperlink Graph 2012. http://webdatacommons.org/hyperlinkgraph/2012-08/download.html.
[2]
Kook Jin Ahn, Sudipto Guha, and Andrew McGregor. 2012. Analyzing graph structure via linear measurements. In Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms. SIAM, 459--467.
[3]
Kook Jin Ahn, Sudipto Guha, and Andrew McGregor. 2012. Graph sketches: sparsification, spanners, and subgraphs. In Proceedings of the 31st ACM SIGMOD-SIGACT-SIGAI symposium on Principles of Database Systems. ACM, 5--14.
[4]
Stephen Alstrup, Søren Dahlgaard, Arnold Filtser, Morten Stöckel, and Christian Wulff-Nilsen. 2017. Constructing light spanners deterministically in near-linear time. arXiv preprint arXiv.1709.01960 (2017).
[5]
Ingo Althöfer, Gautam Das, David Dobkin, Deborah Joseph, and José Soares. 1993. On sparse spanners of weighted graphs. Discrete & Computational Geometry 9, 1 (1993), 81--100.
[6]
David G Anderson, Ming Gu, and Christopher Melgaard. 2014. An efficient algorithm for unweighted spectral graph sparsification. arXiv preprint arXiv:1410A273 (2014).
[7]
David A Bader, Shiva Kintali, Kamesh Madduri, and Milena Mihail. 2007. Approximating betweenness centrality. In International Workshop on Algorithms and Models for the Web-Graph. Springer, 124--137.
[8]
Michèle Basseville. 2010. Divergence measures for statistical data processing. (2010).
[9]
Surender Baswana, Telikepalli Kavitha, Kurt Mehlhorn, and Seth Pettie. 2010. Additive spanners and (α, β)-spanners. ACM Transactions on Algorithms (TALG) 7, 1 (2010), 5.
[10]
Surender Baswana and Sandeep Sen. 2007. A simple and linear time randomized algorithm for computing sparse spanners in weighted graphs. Random Structures & Algorithms 30, 4 (2007), 532--563.
[11]
Omar Batarfi, Radwa El Shawi, Ayman G Fayoumi, Reza Nouri, Ahmed Barnawi, Sherif Sakr, et al. 2015. Large scale graph processing systems: survey and an experimental evaluation. Cluster Computing 18, 3 (2015), 1189--1213.
[12]
Joshua Batson, Daniel A Spielman, Nikhil Srivastava, and Shang-Hua Teng. 2013. Spectral sparsification of graphs: theory and algorithms. Commun. ACM 56, 8 (2013), 87--94.
[13]
Scott Beamer, Krste Asanović, and David Patterson. 2015. The GAP benchmark suite. arXiv preprint arXiv.1508.03619 (2015).
[14]
Maham Anwar Beg, Muhammad Ahmad, Arif Zaman, and Imdadullah Khan. 2018. Scalable Approximation Algorithm for Graph Summarization. pacific-asia conference on knowledge discovery and data mining (2018), 502--514.
[15]
András A Benczúr and David R Karger. 1996. Approximating st minimum cuts in O (n 2) time. In Proceedings of the twenty-eighth annual ACM symposium on Theory of computing. ACM, 47--55.
[16]
Suman K Bera and Amit Chakrabarti. 2017. Towards tighter space bounds for counting triangles and other substructures in graph streams. In 34th Symposium on Theoretical Aspects of Computer Science (STACS 2017). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.
[17]
Maciej Besta and Torsten Hoefler. 2018. Survey and Taxonomy of Lossless Graph Compression and Space-Efficient Graph Representations. arXiv preprint arXiv:1806.01799 (2018).
[18]
Maciej Besta, Michał Podstawski, Linus Groner, Edgar Solomonik, and Torsten Hoefler. 2017. To push or to pull: On reducing communication and synchronization in graph computations. In Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing. ACM, 93--104.
[19]
Maciej Besta, Dimitri Stanojevic, Tijana Zivic, Jagpreet Singh, Maurice Hoerold, and Torsten Hoefler. 2018. Log (graph): a near-optimal high-performance graph representation. In Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques. ACM, 7.
[20]
Paolo Boldi, Marco Rosa, and Sebastiano Vigna. 2011. HyperANF: Approximating the neighbourhood function of very large graphs on a budget. In Proceedings of the 20th international conference on World wide web. ACM, 625--634.
[21]
Paolo Boldi and Sebastiano Vigna. 2004. The webgraph framework I: compression techniques. In Proceedings of the 13th international conference on World Wide Web. ACM, 595--602.
[22]
Michele Borassi and Emanuele Natale. 2016. KADABRA is an adaptive algorithm for betweenness via random approximation. arXiv preprint arXiv:1604.08553 (2016).
[23]
Stéphane Boucheron, Gábor Lugosi, and Pascal Massart. 2013. Concentration inequalities: A nonasymptotic theory of independence. Oxford university press.
[24]
Ulrik Brandes. 2001. A faster algorithm for betweenness centrality. Journal of mathematical sociology 25, 2 (2001), 163--177.
[25]
Nieves R Brisaboa, Susana Ladra, and Gonzalo Navarro. 2009. k2-Trees for Compact Web Graph Representation. In SPIRE, Vol. 9. Springer, 18--30.
[26]
Luciana S Buriol, Gereon Frahling, Stefano Leonardi, Alberto Marchetti-Spaccamela, and Christian Sohler. 2006. Counting triangles in data streams. In Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. ACM, 253--262.
[27]
Daniele Calandriello, Ioannis Koutis, Alessandro Lazaric, and Michal Valko. 2018. Improved large-scale graph learning through ridge spectral sparsification. In International Conference on Machine Learning.
[28]
Stéphane Campinas, Renaud Delbru, and Giovanni Tummarello. 2013. Efficiency and precision trade-offs in graph summary algorithms. In Proceedings of the 17th International Database Engineering and Applications Symposium on. 38--47.
[29]
Keren Censor-Hillel and Michal Dory. 2018. Distributed spanner approximation. In Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing. ACM, 139--148.
[30]
Keren Censor-Hillel, Ami Paz, and Noam Ravid. 2018. The Sparsest Additive Spanner via Multiple Weighted BFS Trees. arXiv preprint arXiv:1811.01997 (2018).
[31]
Sung-Hyuk Cha. 2007. Comprehensive survey on distance/similarity measures between probability density functions. City 1, 2 (2007), 1.
[32]
Bernard Chazelle, Ronitt Rubinfeld, and Luca Trevisan. 2005. Approximating the minimum spanning tree weight in sublinear time. SIAM Journal on computing 34, 6 (2005), 1370--1379.
[33]
Shiri Chechik, Daniel H Larkin, Liam Roditty, Grant Schoenebeck, Robert E Tarjan, and Virginia Vassilevska Williams. 2014. Better approximation algorithms for the graph diameter. In Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms. Society for Industrial and Applied Mathematics, 1041--1052.
[34]
Mostafa Haghir Chehreghani, Albert Bifet, and Talel Abdessalem. 2018. Efficient Exact and Approximate Algorithms for Computing Betweenness Centrality in Directed Graphs. In Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 752--764.
[35]
Chen Chen, Cindy Xide Lin, Matt Fredrikson, Mihai Christodorescu, Xifeng Yan, and Jiawei Han. 2009. Mining graph patterns efficiently via randomized summaries. very large data bases 2, 1 (2009), 742--753.
[36]
Avery Ching, Sergey Edunov, Maja Kabiljo, Dionysios Logothetis, and Sambavi Muthukrishnan. 2015. One trillion edges: Graph processing at facebook-scale. Proceedings of the VLDB Endowment 8, 12 (2015), 1804--1815.
[37]
Nicos Christofides. 1976. Worst-case analysis of a new heuristic for the travelling salesman problem. Technical Report. Carnegie-Mellon Univ Pittsburgh Pa Management Sciences Research Group.
[38]
Timothy Chu, Yu Gao, Richard Peng, Sushant Sachdeva, Saurabh Sawlani, and Junxing Wang. 2018. Graph sparsification, spectral sketches, and faster resistance computation, via short cycle decompositions. In 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS). IEEE, 361--372.
[39]
Thomas H Cormen, Charles E Leiserson, Ronald L Rivest, and Clifford Stein. 2009. Introduction to algorithms. MIT press.
[40]
Graham Cormode and Hossein Jowhari. 2017. A second look at counting triangles in graph streams (corrected). Theoretical Computer Science 683 (2017), 22--30.
[41]
Thomas M Cover and Joy A Thomas. 2012. Elements of information theory. John Wiley & Sons.
[42]
Ketan Date, Keven Feng, Rakesh Nagi, Jinjun Xiong, Nam Sung Kim, and Wen-Mei Hwu. 2017. Collaborative (cpu+ gpu) algorithms for triangle counting and truss decomposition on the minsky architecture: Static graph challenge: Subgraph isomorphism. In 2017 IEEE High Performance Extreme Computing Conference (HPEC). IEEE, 1--7.
[43]
Etienne de Klerk, Dmitrii V Pasechnik, and Joost P Warners. 2004. On approximate graph colouring and max-k-cut algorithms based on the θ-function. Journal of Combinatorial Optimization 8, 3 (2004), 267--294.
[44]
Camil Demetrescu, Andrew V Goldberg, and David S Johnson. 2009. The Shortest Path Problem: Ninth DIMLACS Implementation Challenge. Vol. 74. American Math. Soc.
[45]
Niels Doekemeijer and Ana Lucia Varbanescu. 2014. A survey of parallel graph processing frameworks. Delft University of Technology (2014), 21.
[46]
Michal Dory. 2018. Distributed Approximation of Minimum k-edge-connected Spanning Subgraphs. In Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing. ACM, 149--158.
[47]
Stefania Dumbrava, Angela Bonifati, Amaia Nazabal Ruiz Diaz, and Romain Vuillemot. 2018. Approximate Evaluation of Label-Constrained Reachability Queries. arXiv preprint arXiv.1811.11561 (2018).
[48]
Cody Dunne and Ben Shneiderman. 2013. Motif simplification: improving network visualization readability with fan, connector, and clique glyphs. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 3247--3256.
[49]
Ghizlane ECHBARTHI and Hamamache KHEDDOUCI. 2017. Lasas: an aggregated search based graph matching approach. In The 29th International Conference on Software Engineering and Knowledge Engineering.
[50]
Talya Eden, Amit Levi, Dana Ron, and C Seshadhri. 2017. Approximately counting triangles in sublinear time. SIAM J. Comput. 46, 5 (2017), 1603--1646.
[51]
Michael Elkin and Ofer Neiman. 2018. Efficient algorithms for constructing very sparse spanners and emulators. ACM Transactions on Algorithms (TALG) 15, 1 (2018), 4.
[52]
Guy Even, Joseph Naor, Satish Rao, and Baruch Schieber. 1999. Fast approximate graph partitioning algorithms. SIAM J. Comput. 28, 6 (1999), 2187--2214.
[53]
Wenfei Fan, Jianzhong Li, Xin Wang, and Yinghui Wu. 2012. Query preserving graph compression. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data. 157--168.
[54]
Arash Farzan and J Ian Munro. 2008. Succinct representations of arbitrary graphs. In European Symposium on Algorithms. Springer, 393--404.
[55]
Zhuo Feng. 2016. Spectral graph sparsification in nearly-linear time leveraging efficient spectral perturbation analysis. In Proceedings of the 53rd Annual Design Automation Conference. ACM, 57.
[56]
Jacob Fox, Tim Roughgarden, C Seshadhri, Fan Wei, and Nicole Wein. 2018. Finding cliques in social networks: A new distribution-free model. arXiv preprint arXiv:1804.07431 (2018).
[57]
Haohuan Fu, Junfeng Liao, Jinzhe Yang, Lanning Wang, Zhenya Song, Xiaomeng Huang, Chao Yang, Wei Xue, Fangfang Liu, Fangli Qiao, et al. 2016. The Sunway TaihuLight supercomputer: system and applications. Science China Information Sciences 59, 7 (2016), 072001.
[58]
Robert Geisberger, Peter Sanders, and Dominik Schultes. 2008. Better approximation of betweenness centrality. In Proceedings of the Meeting on Algorithm Engineering & Expermiments. Society for Industrial and Applied Mathematics, 90--100.
[59]
Robert Gerstenberger, Maciej Besta, and Torsten Hoefler. 2014. Enabling highly-scalable remote memory access programming with MPI-3 one sided. Scientific Programming 22, 2 (2014), 75--91.
[60]
Oded Green, Pavan Yalamanchili, and Lluís-Miquel Munguía. 2014. Fast triangle counting on the GPU. In Proceedings of the 4th Workshop on Irregular Applications: Architectures and Algorithms. IEEE Press, 1--8.
[61]
Sudipto Guha and Andrew McGregor. 2012. Graph synopses, sketches, and streams: A survey. Proceedings of the VLDB Endowment 5, 12 (2012), 2030--2031.
[62]
Magnús M Halldórsson. 1993. A still better performance guarantee for approximate graph coloring. Inform. Process. Lett. 45, 1 (1993), 19--23.
[63]
Safiollah Heidari, Yogesh Simmhan, Rodrigo N Calheiros, and Rajkumar Buyya. 2018. Scalable graph processing frameworks: A taxonomy and open challenges. ACM Computing Surveys (CSUR) 51, 3 (2018), 60.
[64]
Wilko Henecka and Matthew Roughan. 2015. Lossy compression of dynamic, weighted graphs. In 2015 3rd International Conference on Future Internet of Things and Cloud. IEEE, 427--434.
[65]
Pili Hu and Wing Cheong Lau. 2013. A survey and taxonomy of graph sampling. arXiv preprint arXiv:1308.5865 (2013).
[66]
Yang Hu, Hang Liu, and H Howie Huang. 2018. High-Performance Triangle Counting on GPUs. In 2018 IEEE High Performance extreme Computing Conference (HPEC). IEEE, 1--5.
[67]
Anand Padmanabha Iyer, Zaoxing Liu, Xin Jin, Shivaram Venkataraman, Vladimir Braverman, and Ion Stoica. 2018. {ASAP}: Fast, Approximate Graph Pattern Mining at Scale. In 13th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 18). 745--761.
[68]
Anand Padmanabha Iyer, Aurojit Panda, Shivaram Venkataraman, Mosharaf Chowdhury, Aditya Akella, Scott Shenker, and Ion Stoica. 2018. Bridging the GAP: towards approximate graph analytics. In Proceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA). ACM, 10.
[69]
Arun Jambulapati and Aaron Sidford. 2018. Efficient O (n/epsilon) Spectral Sketches for the Laplacian and its Pseudoinverse. In Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms. SIAM, 2487--2503.
[70]
Madhav Jha, Ali Pinar, and C Seshadhri. 2015. Counting triangles in real-world graph streams: Dealing with repeated edges and time windows. In 2015 49th Asilomar Conference on Signals, Systems and Computers. IEEE, 1507--1514.
[71]
Madhav Jha, C Seshadhri, and Ali Pinar. 2015. A space-efficient streaming algorithm for estimating transitivity and triangle counts using the birthday paradox. ACM Transactions on Knowledge Discovery from Data (TKDD) 9, 3 (2015), 15.
[72]
Hossein Jowhari and Mohammad Ghodsi. 2005. New streaming algorithms for counting triangles in graphs. In International Computing and Combinatorics Conference. Springer, 710--716.
[73]
Vasiliki Kalavri, Tiago Simas, and Dionysios Logothetis. 2016. The shortest path is not always a straight line: leveraging semi-metricity in graph analysis. Proceedings of the VLDB Endowment 9, 9 (2016), 672--683.
[74]
David R. Karger. 2000. Minimum cuts in near-linear time. J. ACM 47, 1 (2000), 46--76.
[75]
Jonathan A Kelner and Alex Levin. 2013. Spectral sparsification in the semi-streaming setting. Theory of Computing Systems 53, 2 (2013), 243--262.
[76]
Subhash Khot and Oded Regev. 2008. Vertex cover might be hard to approximate to within 2- ε. J. Comput. System Sci 74, 3 (2008), 335--349.
[77]
Ioannis Koutis and Shen Chen Xu. 2016. Simple parallel and distributed algorithms for spectral graph sparsification. ACM Transactions on Parallel Computing (TOPC) 3, 2 (2016), 14.
[78]
Solomon Kullback. 1997. Information theory and statistics. Courier Corporation.
[79]
K. Ashwin Kumar and Petros Efstathopoulos. 2018. Utility-driven graph summarization. very large data bases 12, 4 (2018), 335--347.
[80]
Jérôme Kunegis. 2013. Konect: the koblenz network collection. In Proc. of Intl. Conf. on World Wide Web (WWW). ACM, 1343--1350.
[81]
Rasmus Kyng and Zhao Song. 2018. A Matrix Chernoff Bound for Strongly Rayleigh Distributions and Spectral Sparsifiers from a few Random Spanning Trees. In 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS). IEEE, 373--384.
[82]
Kyong-Ha Lee, Yoon-Joon Lee, Hyunsik Choi, Yon Dohn Chung, and Bongki Moon. 2012. Parallel data processing with MapReduce: a survey. AcM sIGMoD Record 40, 4 (2012), 11--20.
[83]
Yin Tat Lee and He Sun. 2018. Constructing linear-sized spectral sparsification in almost-linear time. SIAM J. Comput. 47, 6 (2018), 2315--2336.
[84]
Christoph Lenzen and Reut Levi. 2018. A Centralized Local Algorithm for the Sparse Spanning Graph Problem. In 45th International Colloquium on Automata, Languages, and Programming (ICALP 2018). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.
[85]
Jure Leskovec and Christos Faloutsos. 2006. Sampling from large graphs. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 631--636.
[86]
Jure Leskovec and Andrej Krevl. 2014. SNAP Datasets: Stanford Large Network Dataset Collection. http://snap.stanford.edu/data.
[87]
Heng Lin, Xiaowei Zhu, Bowen Yu, Xiongchao Tang, Wei Xue, Wenguang Chen, Lufei Zhang, Torsten Hoefler, Xiaosong Ma, Xin Liu, et al. 2018. ShenTu: processing multi-trillion edge graphs on millions of cores in seconds. In Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis. IEEE Press, 56.
[88]
Wei Liu, Andrey Kan, Jeffrey Chan, James Bailey, Christopher Leckie, Jian Pei, and Ramamohanarao Kotagiri. 2012. On compressing weighted time-evolving graphs. In Proceedings of the 21st ACM international conference on Information and knowledge management. ACM, 2319--2322.
[89]
Xingjie Liu, Yuanyuan Tian, Qi He, Wang-Chien Lee, and John McPherson. 2014. Distributed Graph Summarization. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. 799--808.
[90]
Yike Liu, Tara Safavi, Abhilash Dighe, and Danai Koutra. 2018. Graph Summarization Methods and Applications: A Survey. Comput. Surveys 51, 3 (2018), 62.
[91]
Yi Lu, James Cheng, Da Yan, and Huanhuan Wu. 2014. Large-scale distributed graph computing systems: An experimental evaluation. Proceedings of the VLDB Endowment 8, 3 (2014), 281--292.
[92]
Andrew Lumsdaine, Douglas Gregor, Bruce Hendrickson, and Jonathan W. Berry. 2007. Challenges in Parallel Graph Processing. Par. Proc. Let. 17, 1 (2007), 5--20.
[93]
Grzegorz Malewicz, Matthew H. Austern, Aart J.C Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski. 2010. Pregel: a system for large-scale graph processing. In Proc. of the ACM SIGMOD Intl. Conf. on Manag. of Data (SIGMOD '10). ACM, New York, NY, USA, 135--146.
[94]
Hossein Maserrat and Jian Pei. 2012. Community Preserving Lossy Compression of Social Networks. In 2012 IEEE 12th International Conference on Data Mining. 509--518.
[95]
Robert Ryan McCune, Tim Weninger, and Greg Madey. 2015. Thinking like a vertex: a survey of vertex-centric frameworks for large-scale distributed graph processing. ACM Computing Surveys (CSUR) 48, 2 (2015), 25.
[96]
Andrew McGregor, Sofya Vorotnikova, and Hoa T Vu. 2016. Better algorithms for counting triangles in data streams. In Proceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems. ACM, 401--411.
[97]
Gary L Miller, Richard Peng, Adrian Vladu, and Shen Chen Xu. 2015. Improved parallel algorithms for spanners and hopsets. In Proceedings of the 27th ACM Symposium on Parallelism in Algorithms and Architectures. ACM, 192--201.
[98]
Richard C Murphy, Kyle B Wheeler, Brian W Barrett, and James A Ang. 2010. Introducing the graph 500. Cray User's Group (CUG) (2010).
[99]
Moni Naor. 1990. Succinct representation of general unlabeled graphs. Discrete Applied Mathematics 28, 3 (1990), 303--307.
[100]
Saket Navlakha, Rajeev Rastogi, and Nisheeth Shrivastava. 2008. Graph summarization with bounded error. In Proceedings of the 2008 ACM SIGMOD international conference on Management of data. ACM, 419--432.
[101]
Donald Nguyen, Andrew Lenharth, and Keshav Pingali. 2013. A lightweight infrastructure for graph analytics. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles. ACM, 456--471.
[102]
Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. 1999. The PageRank citation ranking: Bringing order to the web. Technical Report. Stanford InfoLab.
[103]
Merav Parter, Ronitt Rubinfeld, Ali Vakilian, and Anak Yodpinyanee. 2018. Local Computation Algorithms for Spanners. In 10th Innovations in Theoretical Computer Science Conference (ITCS 2019). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.
[104]
Aduri Pavan, Srikanta Tirthapura, et al. 2013. Counting and sampling triangles from a graph stream. (2013).
[105]
David Peleg and Alejandro A Schäffer. 1989. Graph spanners. Journal of graph theory 13, 1 (1989), 99--116.
[106]
David Peleg and Jeffrey D Ullman. 1989. An optimal synchronizer for the hypercube. SIAM Journal on computing 18, 4 (1989), 740--747.
[107]
Seth Pettie. 2010. Distributed algorithms for ultrasparse spanners and linear size skeletons. Distributed Computing 22, 3 (2010), 147--166.
[108]
Adam Polak. 2016. Counting triangles in large graphs on GPU. In 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). IEEE, 740--746.
[109]
Rajeev Raman, Venkatesh Raman, and Srinivasa Rao Satti. 2007. Succinct indexable dictionaries with applications to encoding k-ary trees, prefix sums and multisets. ACM Transactions on Algorithms (TALG) 3, 4 (2007), 43.
[110]
Raimundo Real and Juan M Vargas. 1996. The probabilistic basis of Jaccard's index of similarity. Systematic biology 45, 3 (1996), 380--385.
[111]
Matteo Riondato, David García-Soriano, and Francesco Bonchi. 2017. Graph summarization with quality guarantees. Data Mining and Knowledge Discovery 31, 2 (2017), 314--349.
[112]
Matteo Riondato and Evgenios M Kornaropoulos. 2016. Fast approximation of betweenness centrality through sampling. Data Mining and Knowledge Discovery 30, 2 (2016), 438--475.
[113]
Matteo Riondato and Eli Upfal. 2018. ABRA: Approximating betweenness centrality in static and dynamic graphs with rademacher averages. ACM Transactions on Knowledge Discovery from Data (TKDD) 12, 5 (2018), 61.
[114]
Liam Roditty and Virginia Vassilevska Williams. 2013. Fast approximation algorithms for the diameter and radius of sparse graphs. In Proceedings of the forty-fifth annual ACM symposium on Theory of computing. ACM, 515--524.
[115]
Amin Sadri, Flora D. Salim, Yongli Ren, Masoomeh Zameni, Jeffrey Chan, and Timos Sellis. 2017. Shrink: Distance preserving graph compression. Information Systems 69 (2017), 180--193.
[116]
Semih Salihoglu and Jennifer Widom. 2014. Optimizing graph algorithms on Pregel-like systems. Proceedings of the VLDB Endowment 7, 7 (2014), 577--588.
[117]
Ahmet Erdem Sariyüce, Kamer Kaya, Erik Saule, and Ümit V Çatalyürek. 2013. Betweenness centrality on GPUs and heterogeneous architectures. In Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units. ACM, 76--85.
[118]
Berkant Savas and Inderjit S Dhillon. 2011. Clustered low rank approximation of graphs in information science applications. In Proceedings of the 2011 SIAM International Conference on Data Mining. SIAM, 164--175.
[119]
C Seshadhri. 2015. A simpler sublinear algorithm for approximating the triangle count. arXiv preprint arXiv:1505.01927 (2015).
[120]
Comandur Seshadhri, Ali Pinar, and Tamara G Kolda. 2013. Fast triangle counting through wedge sampling. In Proceedings of the SIAM Conference on Data Mining, Vol. 4. 5.
[121]
C Seshadhri, Ali Pinar, and Tamara G Kolda. 2014. Wedge sampling for computing clustering coefficients and triangle counts on large graphs. Statistical Analysis and Data Mining: The ASA Data Science Journal 7, 4 (2014), 294--307.
[122]
Zechao Shang and Jeffrey Xu Yu. 2014. Auto-approximation of graph computing. Proceedings of the VLDB Endowment 7, 14 (2014), 1833--1844.
[123]
Xuanhua Shi, Zhi gao Zheng, Yongluan Zhou, Hai Jin, Ligang He, Bo Liu, and Qiang-Sheng Hua. 2018. Graph processing on GPUs: A survey. ACM Computing Surveys (CSUR) 50, 6 (2018), 81.
[124]
Kijung Shin, Amol Ghoting, Myunghwan Kim, and Hema Raghavan. 2019. Sweg: Lossless and lossy summarization of web-scale graphs. In Proceedings of the 28th International Conference on World Wide Web. ACM, Vol. 1. 1--2.
[125]
Julian Shun and Kanat Tangwongsan. 2015. Multicore triangle computations without tuning. In 2015 IEEE 31st International Conference on Data Engineering. IEEE, 149--160.
[126]
Somesh Singh and Rupesh Nasre. 2018. Scalable and Performant Graph Processing on GPUs Using Approximate Computing. IEEE Transactions on Multi-Scale Computing Systems 4, 3 (2018), 190--203.
[127]
George M Slota and Kamesh Madduri. 2014. Complex network analysis using parallel approximate motif counting. In Parallel and Distributed Processing Symposium, 2014 IEEE 28th International. IEEE, 405--414.
[128]
Tasuku Soma and Yuichi Yoshida. 2019. Spectral Sparsification of Hypergraphs. In Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms. SIAM, 2570--2581.
[129]
Daniel A Spielman and Nikhil Srivastava. 2011. Graph sparsification by effective resistances. SIAM J. Comput. 40, 6 (2011), 1913--1926.
[130]
Daniel A Spielman and Shang-Hua Teng. 2011. Spectral sparsification of graphs. SIAM J. Comput. 40, 4 (2011), 981--1025.
[131]
Xin Sui, Tsung-Hsien Lee, Joyce Jiyoung Whang, Berkant Savas, Saral Jain, Keshav Pingali, and Inderjit Dhillon. 2012. Parallel clustered low-rank approximation of graphs and its application to link prediction. In International Workshop on Languages and Compilers for Parallel Computing. Springer, 76--95.
[132]
Daniel Yasumasa Takahashi, Joao Ricardo Sato, Carlos Eduardo Ferreira, and André Fujita. 2012. Discriminating different classes of biological networks by analyzing the graphs spectra distribution. PLoS One 7, 12 (2012), e49949.
[133]
Hannu Toivonen, Fang Zhou, Aleksi Hartikainen, and Atte Hinkka. 2011. Compression of weighted graphs. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. 965--973.
[134]
Hannu Toivonen, Fang Zhou, Aleksi Hartikainen, and Atte Hinkka. 2012. Network compression by node and edge mergers. Bisociative Knowledge Discovery (2012), 199--217.
[135]
Ioanna Tsalouchidou, Francesco Bonchi, Gianmarco De Francisci Morales, and Ricardo Baeza-Yates. 2018. Scalable Dynamic Graph Summarization. IEEE Transactions on Knowledge and Data Engineering (2018), 1--1.
[136]
Charalampos E Tsourakakis. [n.d.]. Fast Counting of Triangles in Large Real Networks: Algorithms and Laws. cis. temple. edu ([n.d.]), 608--617.
[137]
Charalampos E Tsourakakis, U Kang, Gary L Miller, and Christos Faloutsos. 2009. Doulion: counting triangles in massive graphs with a coin. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 837--846.
[138]
György Turán. 1984. On the succinct representation of graphs. Discrete Applied Mathematics 8, 3 (1984), 289--294.
[139]
Jason TL Wang, Kaizhong Zhang, and Gung-Wei Chirn. 1995. Algorithms for approximate graph matching. Information Sciences 82, 1-2 (1995), 45--74.
[140]
Leyuan Wang, Yangzihao Wang, Carl Yang, and John D Owens. 2016. A comparative study on exact triangle counting algorithms on the gpu. In Proceedings of the ACM Workshop on High Performance Graph Processing. ACM, 1--8.
[141]
Tianyi Wang, Yang Chen, Zengbin Zhang, Tianyin Xu, Long Jin, Pan Hui, Beixing Deng, and Xing Li. 2011. Understanding graph sampling algorithms for social network analysis. In 2011 31st international conference on distributed computing systems workshops. IEEE, 123--128.
[142]
Yongyu Wang and Zhuo Feng. 2017. Towards scalable spectral clustering via spectrum-preserving sparsification. arXiv preprint arXiv.1710.04584 (2017).
[143]
Michael M Wolf, Mehmet Deveci, Jonathan W Berry, Simon D Hammond, and Sivasankaran Rajamanickam. 2017. Fast linear algebra-based triangle counting with kokkoskernels. In 2017 IEEE High Performance Extreme Computing Conference (HPEC). IEEE, 1--7.
[144]
Da Yan, James Cheng, Kai Xing, Yi Lu, Wilfred Ng, and Yingyi Bu. 2014. Pregel algorithms for graph connectivity problems with performance guarantees. Proceedings of the VLDB Endowment 7, 14 (2014), 1821--1832.
[145]
Abdurrahman Yaşar, Sivasankaran Rajamanickam, Michael Wolf, Jonathan Berry, and Ümit V Çatalyürek. 2018. Fast Triangle Counting Using Cilk. In 2018 IEEE High Performance extreme Computing Conference (HPEC). IEEE, 1--7.
[146]
Hao Zhang, Gang Chen, Beng Chin Ooi, Kian-Lee Tan, and Meihui Zhang. 2015. In-memory big data management and processing: A survey. IEEE Transactions on Knowledge and Data Engineering 27, 7 (2015), 1920--1948.
[147]
Yunquan Zhang, Ting Cao, Shigang Li, Xinhui Tian, Liang Yuan, Haipeng Jia, and Athanasios V Vasilakos. 2016. Parallel processing systems for big data: a survey. Proc. IEEE 104, 11 (2016), 2114--2136.
[148]
Ying Zhang, Zhiqiang Zhao, and Zhuo Feng. 2018. Towards Scalable Spectral Sparsification of Directed Graphs. arXiv preprint arXiv:1812.04165 (2018).
[149]
Fang Zhou, Qiang Qu, and Hannu Toivonen. 2017. Summarisation of weighted networks. Journal of Experimental and Theoretical Artificial Intelligence 29, 5 (2017), 1023--1052.

Cited By

View all
  • (2024)A Sampling-Based Framework for Hypothesis Testing on Large Attributed GraphsProceedings of the VLDB Endowment10.14778/3681954.368199317:11(3192-3200)Online publication date: 1-Jul-2024
  • (2024)SamBaS: Sampling-Based Stochastic Block PartitioningIEEE Transactions on Network Science and Engineering10.1109/TNSE.2024.335830111:3(3053-3065)Online publication date: May-2024
  • (2024)General-purpose query processing on summary graphsSocial Network Analysis and Mining10.1007/s13278-024-01314-w14:1Online publication date: 9-Aug-2024
  • Show More Cited By
  1. Slim graph: practical lossy graph compression for approximate graph processing, storage, and analytics

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SC '19: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
    November 2019
    1921 pages
    ISBN:9781450362290
    DOI:10.1145/3295500
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    In-Cooperation

    • IEEE CS

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 November 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article

    Funding Sources

    • European Research Council (ERC)

    Conference

    SC '19
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)65
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 03 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A Sampling-Based Framework for Hypothesis Testing on Large Attributed GraphsProceedings of the VLDB Endowment10.14778/3681954.368199317:11(3192-3200)Online publication date: 1-Jul-2024
    • (2024)SamBaS: Sampling-Based Stochastic Block PartitioningIEEE Transactions on Network Science and Engineering10.1109/TNSE.2024.335830111:3(3053-3065)Online publication date: May-2024
    • (2024)General-purpose query processing on summary graphsSocial Network Analysis and Mining10.1007/s13278-024-01314-w14:1Online publication date: 9-Aug-2024
    • (2023)Rate-Distortion Function of the Stochastic Block Model2023 57th Asilomar Conference on Signals, Systems, and Computers10.1109/IEEECONF59524.2023.10476987(699-703)Online publication date: 29-Oct-2023
    • (2023)An Integrated Approach for Accelerating Stochastic Block Partitioning2023 IEEE High Performance Extreme Computing Conference (HPEC)10.1109/HPEC58863.2023.10363599(1-7)Online publication date: 25-Sep-2023
    • (2022)ProbGraphProceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis10.5555/3571885.3571942(1-17)Online publication date: 13-Nov-2022
    • (2022)HBMaxProceedings of the International Conference on Parallel Architectures and Compilation Techniques10.1145/3559009.3569647(412-425)Online publication date: 8-Oct-2022
    • (2022)Software-defined floating-point number formats and their application to graph processingProceedings of the 36th ACM International Conference on Supercomputing10.1145/3524059.3532360(1-17)Online publication date: 28-Jun-2022
    • (2022)Graph Stream Sketch: Summarizing Graph Streams with High Speed and AccuracyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3174570(1-1)Online publication date: 2022
    • (2022)Topological network features determine convergence rate of distributed average algorithmsScientific Reports10.1038/s41598-022-25974-w12:1Online publication date: 17-Dec-2022
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media