Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Noise in the Clouds: Influence of Network Performance Variability on Application Scalability

Published: 08 December 2022 Publication History

Abstract

Cloud computing represents an appealing opportunity for cost-effective deployment of HPC workloads on the best-fitting hardware. However, although cloud and on-premise HPC systems offer similar computational resources, their network architecture and performance may differ significantly. For example, these systems use fundamentally different network transport and routing protocols, which may introduce network noise that can eventually limit the application scaling. This work analyzes network performance, scalability, and cost of running HPC workloads on cloud systems. First, we consider latency, bandwidth, and collective communication patterns in detailed small-scale measurements, and then we simulate network performance at a larger scale. We validate our approach on four popular cloud providers and three on-premise HPC systems, showing that network (and also OS) noise can significantly impact performance and cost both at small and large scale.

References

[1]
Top 500. 2022. Top 500 List. https://www.top500.org/. Accessed: 31-Mar-2022.
[2]
Albert Alexandrov, Mihai F. Ionescu, Klaus E. Schauser, and Chris Scheiman. 1997. LogGP: Incorporating Long Messages into the LogP Model for Parallel Computation. J. Parallel and Distrib. Comput. 44, 1 (1997), 71--79. https: //doi.org/10.1006/jpdc.1997.1346
[3]
Mohammad Alizadeh, Tom Edsall, Sarang Dharmapurikar, Ramanan Vaidyanathan, Kevin Chu, Andy Fingerhut, Vinh The Lam, Francis Matus, Rong Pan, Navindra Yadav, and George Varghese. 2014. CONGA: Distributed CongestionAware Load Balancing for Datacenters. In Proceedings of the 2014 ACM Conference on SIGCOMM (Chicago, Illinois, USA) (SIGCOMM '14). Association for Computing Machinery, New York, NY, USA, 503--514. https://doi.org/10.1145/ 2619239.2626316
[4]
Rawan Aljamal, Ali El-Mousa, and Fahed Jubair. 2020. Benchmarking Microsoft Azure Virtual Machines for the use of HPC applications. In 2020 11th International Conference on Information and Communication Systems (ICICS). 382--387. https://doi.org/10.1109/ICICS49469.2020.239525
[5]
Bob Alverson, Edwin Froese, Larry Kaplan, and Duncan Roweth. 2012. Cray XC series network. Cray Inc., White Paper WP-Aries01--1112 (2012).
[6]
Amazon. 2022. Amazon AWS. https://aws.amazon.com/. Accessed: 11-Mar-2022.
[7]
Amazon. 2022. Amazon EC2 instance network bandwidth. https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ ec2-instance-network-bandwidth.html. Accessed: 11-Mar-2022.
[8]
Amazon. 2022. Amazon EC2 P4d Instances -- Highest performance for ML training and HPC applications in the cloud. https://aws.amazon.com/ec2/instance-types/p4/. Accessed: 31-Mar-2022.
[9]
Amazon. 2022. AWS Nitro Card. https://aws.amazon.com/ec2/nitro/. Accessed: 31-Mar-2022.
[10]
Amazon. 2022. AWS ParallelCluster User Guide. https://docs.aws.amazon.com/parallelcluster/latest/ug/clusterdefinition.html#base-os. Accessed: 31-Mar-2022.
[11]
AWS. 2022. AWS Graviton Processor. https://aws.amazon.com/it/ec2/graviton/. Accessed: 31-Mar-2022.
[12]
Azure. 2022. FPGA optimized virtual machine sizes. https://docs.microsoft.com/en-us/azure/virtual-machines/sizesfield-programmable-gate-arrays. Accessed: 31-Mar-2022.
[13]
Tal Ben-Nun and Torsten Hoefler. 2019. Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis. ACM Comput. Surv. 52, 4, Article 65 (aug 2019), 43 pages. https://doi.org/10.1145/3320060
[14]
Keren Bergman. 2018. Empowering Flexible and Scalable High Performance Architectures with Embedded Photonics. In 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS). 378--378. https://doi.org/10.1109/ IPDPS.2018.00047
[15]
Abhinav Bhatele, Kathryn Mohror, Steven H. Langer, and Katherine E. Isaacs. 2013. There goes the neighborhood: Performance degradation due to nearby jobs. In SC '13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis. 1--12. https://doi.org/10.1145/2503210.2503247
[16]
Jiaxin Cao, Rui Xia, Pengkun Yang, Chuanxiong Guo, Guohan Lu, Lihua Yuan, Yixin Zheng, Haitao Wu, Yongqiang Xiong, and Dave Maltz. 2013. Per-Packet Load-Balanced, Low-Latency Routing for Clos-Based Data Center Networks. In Proceedings of the Ninth ACM Conference on Emerging Networking Experiments and Technologies (Santa Barbara, California, USA) (CoNEXT '13). Association for Computing Machinery, New York, NY, USA, 49--60. https://doi.org/10. 1145/2535372.2535375
[17]
Swiss National Supercomputing Centre. 2022. Pricing: Pay-As-You-Go Service. https://2go.cscs.ch/offering/pricing/. Accessed: 02-May-2022.
[18]
S Chang, R Hood, H Jin, S Heistand, J Chang, S Cheung, J Djomehri, G Jost, and D Kokron. 2018. Evaluating the Suitability of Commercial Clouds for NASA's High Performance Computing Applications: A Trade Study. NASA NAS Technical Report 1, May (2018), 1--46.
[19]
Sudheer Chunduri, Kevin Harms, Scott Parker, Vitali Morozov, Samuel Oshin, Naveen Cherukuri, and Kalyan Kumaran. 2017. Run-to-Run Variability on Xeon Phi Based Cray XC Systems. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (Denver, Colorado) (SC '17). Association for Computing Machinery, New York, NY, USA, Article 52, 13 pages. https://doi.org/10.1145/3126908.3126926
[20]
Google Cloud. 2022. N2, N2D, C2, and C2D higher bandwidth configuration. https://cloud.google.com/compute/allpricing#high_bandwidth_configuration. Accessed: 02-May-2022.
[21]
Susan Coghlan and Katherine Yelick. 2011. The Magellan Final Report on Cloud Computing. (12 2011). https: //doi.org/10.2172/1076794
[22]
Jeffrey L. Cox. 2022. Evolution of Optical Technologies in the Cloud Infrastructure. https://www.youtube.com/watch? v=r3GJt7AiGuc. Accessed: 31-Mar-2022.
[23]
Michael Dalton, David Schultz, Jacob Adriaens, Ahsan Arefin, Anshuman Gupta, Brian Fahs, Dima Rubinstein, Enrique Cauich Zermeno, Erik Rubow, James Alexander Docauer, Jesse Alpert, Jing Ai, Jon Olson, Kevin DeCabooter, Marc De Kruijf, Nan Hua, Nathan Lewis, Nikhil Kasinadhuni, Riccardo Crepaldi, Srinivas Krishnan, Subbaiah Venkata,Yossi Richter, Uday Naik, and Amin Vahdat. 2018. Andromeda: Performance, Isolation, and Velocity at Scale in Cloud Network Virtualization. In Proceedings of the 15th USENIX Conference on Networked Systems Design and Implementation (Renton, WA, USA) (NSDI'18). USENIX Association, USA, 373--387.
[24]
Daniele De Sensi, Salvatore Di Girolamo, and Torsten Hoefler. 2019. Mitigating Network Noise on Dragonfly Networks through Application-Aware Routing. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (Denver, Colorado) (SC '19). Association for Computing Machinery, New York, NY, USA, Article 16, 32 pages. https://doi.org/10.1145/3295500.3356196
[25]
Daniele De Sensi, Salvatore Di Girolamo, Kim H. McMahon, Duncan Roweth, and Torsten Hoefler. 2020. An In-Depth Analysis of the Slingshot Interconnect. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (Atlanta, Georgia) (SC '20). IEEE Press, Article 35, 14 pages.
[26]
Arturo Fernandez. 2021. Evaluation of the performance of tightly coupled parallel solvers and MPI communications in IaaS from the public cloud. IEEE Transactions on Cloud Computing (2021), 1--1. https://doi.org/10.1109/TCC.2021.3052844
[27]
Kurt B. Ferreira, Patrick Bridges, and Ron Brightwell. 2008. Characterizing application sensitivity to OS interference using kernel-level noise injection. In SC '08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing. 1--12. https://doi.org/10.1109/SC.2008.5219920
[28]
Forbes. 2022. Oracle Cloud Had A Banner 2021 And Is Very Credible. https://www.forbes.com/sites/patrickmoorhead/ 2022/02/08/oracle-cloud-had-a-banner-2021-and-now-very-credible/. Accessed: 30-Jun-2022.
[29]
Arnaud Froidmont. 2022. Running Applications on Oracle Cloud Using Cluster Networking. https://blogs.oracle.com/ cloud-infrastructure/post/running-applications-on-oracle-cloud-using-cluster-networking. Accessed: 30-Jun-2022.
[30]
Soudeh Ghorbani, Zibin Yang, P. Brighten Godfrey, Yashar Ganjali, and Amin Firoozshahian. 2017. DRILL: Micro Load Balancing for Low-Latency Data Center Networks. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication (Los Angeles, CA, USA) (SIGCOMM '17). Association for Computing Machinery, New York, NY, USA, 225--238. https://doi.org/10.1145/3098822.3098839
[31]
Google. 2022. Cloud TPU. https://cloud.google.com/tpu. Accessed: 31-Mar-2022.
[32]
Google. 2022. Creating an HPC-ready VM instance. https://cloud.google.com/compute/docs/instances/create-hpc-vm. Accessed: 31-Mar-2022.
[33]
Google. 2022. Google GCP. https://cloud.google.com/. Accessed: 11-Mar-2022.
[34]
Taylor Groves, Yizi Gu, and Nicholas J. Wright. 2017. Understanding Performance Variability on the Aries Dragonfly Network. In 2017 IEEE International Conference on Cluster Computing (CLUSTER). 809--813. https://doi.org/10.1109/ CLUSTER.2017.76
[35]
Giulia Guidi, Marquita Ellis, Aydin Buluç, Katherine Yelick, and David Culler. 2021. 10 Years Later: Cloud Computing is Closing the Performance Gap. Association for Computing Machinery, New York, NY, USA, 41--48. https://doi.org/10. 1145/3447545.3451183
[36]
Torsten Hoefler, Torsten Mehlan, Andrew Lumsdaine, and Wolfgang Rehm. 2007. Netgauge: A Network Performance Measurement Framework. In Proceedings of High Performance Computing and Communications, HPCC'07 (Houston, USA), Vol. 4782. Springer, 659--671.
[37]
Torsten Hoefler, Timo Schneider, and Andrew Lumsdaine. 2009. The Effect of Network Noise on Large-Scale Collective Communications. Parallel Processing Letters (PPL) 19, 4 (Aug. 2009), 573--593.
[38]
Torsten Hoefler, Timo Schneider, and Andrew Lumsdaine. 2010. Characterizing the Influence of System Noise on Large-Scale Applications by Simulation. In SC '10: Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis. 1--11. https://doi.org/10.1109/SC.2010.12
[39]
Torsten Hoefler, Timo Schneider, and Andrew Lumsdaine. 2010. LogGOPSim - Simulating Large-Scale Applications in the LogGOPS Model. In Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing (Chicago, Illinois). ACM, 597--604.
[40]
Torsten Hoefler, Christian Siebert, and Andrew Lumsdaine. 2009. Group Operation Assembly Language - A Flexible Way to Express Collective Communication. In ICPP-2009 - The 38th International Conference on Parallel Processing (Vienna, Austria). IEEE.
[41]
Intel. 2022. Accelerating High-Speed Networking with Intel I/OAT. https://www.intel.com/content/dam/doc/whitepaper/i-o-acceleration-technology-paper.pdf. Accessed: 11-Mar-2022.
[42]
Intel. 2022. Benchmarking GEMM on Intel Architecture Processors. https://www.intel.com/content/www/us/en/ developer/articles/technical/benchmarking-gemm-with-intel-mkl-and-blis-on-intel-processors.html. Accessed: 31- Mar-2022.
[43]
Intel. 2022. Intel MPI Benchmarks. https://www.intel.com/content/www/us/en\/developer/articles/technical/intelmpi-benchmarks.html. Accessed: 31-Mar-2022.
[44]
Keith R. Jackson, Lavanya Ramakrishnan, Krishna Muriki, Shane Canon, Shreyas Cholia, John Shalf, Harvey J. Wasserman, and Nicholas J. Wright. 2010. Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud. In 2010 IEEE Second International Conference on Cloud Computing Technology and Science 159--168. https://doi.org/10.1109/CloudCom.2010.69
[45]
Jülich Supercomputing Centre. 2022. DEEP-EST System Overview. https://deeptrac.zam.kfa-juelich.de:8443/trac/wiki/ Public/User_Guide/System_overview. Accessed: 06-May-2022.
[46]
Yong (Alexander) Liu, Xin (Lucy) Liu, Fang (Nancy) Li, Haohuan Fu, Yuling Yang, Jiawei Song, Pengpeng Zhao, Zhen Wang, Dajia Peng, Huarong Chen, Chu Guo, Heliang Huang, Wenzhao Wu, and Dexun Chen. 2021. Closing the "Quantum Supremacy" Gap: Achieving Real-Time Simulation of a Random Quantum Circuit Using a New Sunway Supercomputer. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (St. Louis, Missouri) (SC '21). Association for Computing Machinery, New York, NY, USA, Article 3, 12 pages. https://doi.org/10.1145/3458817.3487399
[47]
Patrick MacArthur, Qian Liu, Robert D. Russell, Fabrice Mizero, Malathi Veeraraghavan, and John M. Dennis. 2017. An Integrated Tutorial on InfiniBand, Verbs, and MPI. IEEE Communications Surveys & Tutorials 19, 4 (2017), 2894--2926. https://doi.org/10.1109/COMST.2017.2746083
[48]
Aniruddha Marathe, Rachel Harris, David K. Lowenthal, Bronis R. de Supinski, Barry Rountree, Martin Schulz, and Xin Yuan. 2013. A Comparative Study of High-Performance Computing on the Cloud. In Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing (New York, New York, USA) (HPDC '13). Association for Computing Machinery, New York, NY, USA, 239--250. https://doi.org/10.1145/2493123.2462919
[49]
George Michelogiannakis, Yiwen Shen, Min Yee Teh, Xiang Meng, Benjamin Aivazi, Taylor Groves, John Shalf, Madeleine Glick, Manya Ghobadi, Larry Dennison, and Keren Bergman. 2019. Bandwidth Steering in HPC Using Silicon Nanophotonics. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (Denver, Colorado) (SC '19). Association for Computing Machinery, New York, NY, USA, Article 41, 25 pages. https://doi.org/10.1145/3295500.3356145
[50]
Microsoft. 2022. Adaptive Routing on Azure HPC . https://techcommunity.microsoft.com/t5/azure-compute-blog/ adaptive-routing-on-azure-hpc/ba-p/1205217. Accessed: 11-Mar-2022.
[51]
Microsoft. 2022. CentOS-HPC VM images. https://docs.microsoft.com/en-us/azure/virtual-machines/workloads/hpc/ configure#centos-hpc-vm-images. Accessed: 31-Mar-2022.
[52]
Microsoft. 2022. FSv2 Series. https://docs.microsoft.com/en-us/azure/virtual-machines/fsv2-series. Accessed: 31-Mar2022.
[53]
Microsoft. 2022. High performance computing VM sizes. https://docs.microsoft.com/en-us/azure/virtual-machines/ sizes-hpc. Accessed: 31-Mar-2022.
[54]
Microsoft. 2022. Microsoft Azure. https://azure.microsoft.com/. Accessed: 11-Mar-2022.
[55]
Mohammad Mohammadi and Timur Bazhirov. 2018. Comparative Benchmarking of Cloud Computing Vendors with High Performance Linpack. In Proceedings of the 2nd International Conference on High Performance Compilation, Computing and Communications (Hong Kong, Hong Kong) (HP3C). Association for Computing Machinery, New York, NY, USA, 1--5. https://doi.org/10.1145/3195612.3195613
[56]
Vivek Navale and Philip E. Bourne. 2018. Cloud computing applications for biomedical science: A perspective. PLOS Computational Biology 14, 6 (06 2018), 1--14. https://doi.org/10.1371/journal.pcbi.1006144
[57]
Marco A. S. Netto, Rodrigo N. Calheiros, Eduardo R. Rodrigues, Renato L. F. Cunha, and Rajkumar Buyya. 2018. HPC Cloud for Scientific and Business Applications: Taxonomy, Vision, and Research Challenges. ACM Comput. Surv. 51, 1, Article 8 (jan 2018), 29 pages. https://doi.org/10.1145/3150224
[58]
Network-Based Computing Laboratory - Ohio State University. 2022. OSU Microbenchmarks. http://mvapich.cse.ohiostate.edu/benchmarks/. Accessed: 31-Mar-2022.
[59]
Masahito Ohue, Kento Aoyama, and Yutaka Akiyama. 2021. High-Performance Cloud Computing for Exhaustive Protein--Protein Docking. In Advances in Parallel & Distributed Processing, and Applications, Hamid R. Arabnia, Leonidas Deligiannidis, Michael R. Grimaila, Douglas D. Hodson, Kazuki Joe, Masakazu Sekijima, and Fernando G. Tinetti (Eds.). Springer International Publishing, Cham, 737--746.
[60]
OpenMPI. 2022. FAQ: Tuning the run-time characteristics of MPI TCP communications. https://www.open-mpi.org/ faq/?category=tcp#tcp-multi-links. Accessed: 11-Mar-2022.
[61]
Oracle. 2022. Oracle Cloud Infrastructure. https://www.oracle.com/cloud/. Accessed: 11-Mar-2022.
[62]
Manish Parashar, Moustafa AbdelBaky, Ivan Rodero, and Aditya Devarakonda. 2013. Cloud Paradigms and Practices for Computational and Data-Enabled Science and Engineering. Comput. Sci. Eng. 15, 4 (2013), 10--18. https://doi.org/ 10.1109/MCSE.2013.49
[63]
Carlos Peña-Monferrer, Robert Manson-Sawko, and Vadim Elisseev. 2021. HPC-cloud native framework for concurrent simulation, analysis and visualization of CFD workflows. Future Generation Computer Systems 123 (2021), 14--23. https://doi.org/10.1016/j.future.2021.04.008
[64]
Bogdan Prisacari, German Rodriguez, Philip Heidelberger, Dong Chen, Cyriel Minkenberg, and Torsten Hoefler. 2014. Efficient Task Placement and Routing of Nearest Neighbor Exchanges in Dragonfly Networks. In Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing (Vancouver, BC, Canada) (HPDC'14). Association for Computing Machinery, New York, NY, USA, 129--140. https://doi.org/10.1145/2600212.2600225
[65]
Daniel Reed, Dennis Gannon, and Jack Dongarra. 2022. Reinventing High Performance Computing: Challenges and Opportunities. https://arxiv.org/abs/2203.02544. arXiv:arXiv:2203.02544
[66]
Iman Sadooghi, Jesús Hernández Martin, Tonglin Li, Kevin Brandstatter, Ketan Maheshwari, Tiago Pais Pitta de Lacerda Ruivo, Gabriele Garzoglio, Steven Timm, Yong Zhao, and Ioan Raicu. 2017. Understanding the Performance and Potential of Cloud Computing for Scientific Applications. IEEE Transactions on Cloud Computing 5, 2 (2017), 358--371. https://doi.org/10.1109/TCC.2015.2404821
[67]
K. Saurabh, S. Adavani, K. Tan, M. Ishii, B. Gao, A. Krishnamurthy, H. Sundar, and B. Ganapathysubramanian. 2021. Case study of SARS-CoV-2 transmission risk assessment in indoor environments using cloud computing resources. In 2021 SC Workshops Supplementary Proceedings (SCWS). IEEE Computer Society, Los Alamitos, CA, USA, 79--86. https://doi.org/10.1109/SCWS55283.2021.00020
[68]
Leah Shalev, Hani Ayoub, Nafea Bshara, and Erez Sabbag. 2020. A Cloud-Optimized Transport Protocol for Elastic and Scalable HPC. IEEE Micro 40, 6 (2020), 67--73. https://doi.org/10.1109/MM.2020.3016891
[69]
Arjun Singh, Joon Ong, Amit Agarwal, Glen Anderson, Ashby Armistead, Roy Bannon, Seb Boving, Gaurav Desai, Bob Felderman, Paulie Germano, Anand Kanagala, Jeff Provost, Jason Simmons, Eiichi Tanda, Jim Wanderer, Urs Hölzle, Stephen Stuart, and Amin Vahdat. 2015. Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google's Datacenter Network. In Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication (London, United Kingdom) (SIGCOMM '15). Association for Computing Machinery, New York, NY, USA, 183--197. https://doi.org/10.1145/2785956.2787508
[70]
Staci A. Smith, Clara E. Cromey, David K. Lowenthal, Jens Domke, Nikhil Jain, Jayaraman J. Thiagarajan, and Abhinav Bhatele. 2018. Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing. In SC18: International Conference for High Performance Computing, Networking, Storage and Analysis. 346--360. https://doi.org/10.1109/SC.2018.00030
[71]
SPCL. 2022. LogGOPSim simulator. https://github.com/spcl/LogGOPSim. Accessed: 31-Mar-2022.
[72]
C. B. Stunkel, R. L. Graham, G. Shainer, M. Kagan, S. S. Sharkawi, B. Rosenburg, and G. A. Chochia. 2020. The high-speed networks of the Summit and Sierra supercomputers. IBM Journal of Research and Development 64, 3/4 (2020), 3:1--3:10. https://doi.org/10.1147/JRD.2020.2967330
[73]
Estela Suarez, Anke Kreuzer, Norbert Eicker, and Thomas Lippert. 2021. The DEEP-EST project. Schriften des Forschungszentrums Jülich IAS Series, Vol. 48. Forschungszentrum Jülich GmbH Zentralbibliothek, Verlag, Jülich, 9--25. https://juser.fz-juelich.de/record/905812
[74]
Swiss National Supercomputing Centre. 2022. Alps Supercomputer. https://www.cscs.ch/computers/alps/. Accessed: 06-May-2022.
[75]
Swiss National Supercomputing Centre. 2022. Piz Daint Supercomputer. https://www.cscs.ch/computers/piz-daint/. Accessed: 11-Mar-2022.
[76]
D. Thaler and C. Hopps. 2000. RFC2991: Multipath Issues in Unicast and Multicast Next-Hop Selection.
[77]
Xu Yang, John Jenkins, Misbah Mubarak, Robert B. Ross, and Zhiling Lan. 2016. Watch Out for the Bully! Job Interference Study on Dragonfly Network. In SC '16: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. 750--760. https://doi.org/10.1109/SC.2016.63
[78]
Yan Zhai, Mingliang Liu, Jidong Zhai, Xiaosong Ma, and Wenguang Chen. 2011. Cloud versus In-House Cluster: Evaluating Amazon Cluster Compute Instances for Running MPI Applications. In State of the Practice Reports (Seattle, Washington) (SC '11). Association for Computing Machinery, New York, NY, USA, Article 11, 10 pages. https: //doi.org/10.1145/2063348.2063363

Cited By

View all

Index Terms

  1. Noise in the Clouds: Influence of Network Performance Variability on Application Scalability

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Proceedings of the ACM on Measurement and Analysis of Computing Systems
      Proceedings of the ACM on Measurement and Analysis of Computing Systems  Volume 6, Issue 3
      POMACS
      December 2022
      534 pages
      EISSN:2476-1249
      DOI:10.1145/3576048
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 08 December 2022
      Published in POMACS Volume 6, Issue 3

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. cloud
      2. hpc
      3. network noise
      4. scalability

      Qualifiers

      • Research-article

      Funding Sources

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)169
      • Downloads (Last 6 weeks)27
      Reflects downloads up to 09 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Software Resource Disaggregation for HPC with Serverless Computing2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS57955.2024.00021(139-156)Online publication date: 27-May-2024
      • (2024) ExDeFuture Generation Computer Systems10.1016/j.future.2023.11.013153:C(84-96)Online publication date: 16-May-2024
      • (2024)CanaryFuture Generation Computer Systems10.1016/j.future.2023.10.010152:C(70-82)Online publication date: 4-Mar-2024
      • (2023)Analytical Approaches to QoS Analysis and Performance Modelling in Fog ComputingMulti-Disciplinary Applications of Fog Computing10.4018/978-1-6684-4466-5.ch007(121-151)Online publication date: 3-Aug-2023
      • (2023)Myths and legends in high-performance computingInternational Journal of High Performance Computing Applications10.1177/1094342023116660837:3-4(245-259)Online publication date: 1-Jul-2023
      • (2023)HEAR: Homomorphically Encrypted AllreduceProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607099(1-17)Online publication date: 12-Nov-2023
      • (2023)Data Center Ethernet and Remote Direct Memory Access: Issues at HyperscaleComputer10.1109/MC.2023.326118456:7(67-77)Online publication date: 1-Jul-2023

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media