Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1375527.1375554acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
research-article

Evaluating the effect of replacing CNK with linux on the compute-nodes of blue gene/l

Published: 07 June 2008 Publication History

Abstract

The Blue Gene machines in production today run a small single-user, single-process kernel (CNK) having a limited functionality. Motivated by the desire to provide applications with a much richer operating environment, we evaluate the effect of replacing CNK with a standard Linux kernel on the compute nodes of Blue Gene/L. We show that with a relatively small amount of effort we were able to improve benchmark performance under Linux up to a level that is comparable to CNK.

References

[1]
The MPICH and MPICH2 homepage. http://www-unix.mcs.anl.gov/mpi/mpich.
[2]
N. R. Adiga et al. An overview of the BlueGene/L supercomputer. In SC2002 -- High Performance Networking and Computing, Baltimore, MD, November 2002.
[3]
G. Alm--asi, R. Bellofatto, J. Brunheroto, C. Cascaval, J. G. Castanos, L. Ceze, P. Crumley, C. Erway, J. Gagliano, D. Lieber, X. Martorell, J. E. Moreira, A. Sanomiya, and K. Strauss. An overview of the BlueGene/L system software organization. In Proceedings of Euro-Par 2003 Conference, Lecture Notes in Computer Science, Klagenfurt, Austria, August 2003. Springer-Verlag.
[4]
G. Alm--asi et al. The Design and Implementation of Message Passing Services for the BlueGene/L Supercomputer. IBM System Journal, 2004.
[5]
Astron project homepage. http://astron.nl/.
[6]
P. Beckman, K. Iskra, K. Yoshii, and S. Coghlan. The influence of operating systems on the performance of collective operations at extreme scale. In Proceedings of the 8th IEEE International Conference on Cluster Computing, Barcelona, Spain, Sept. 2006.
[7]
P. Beckman, K. Iskra, K. Yoshii, and S. Coghlan. Operating system issues for petascale systems. ACM SIGOPS Operating Systems Review, 40(2):29--33, Apr. 2006.
[8]
Blue brain project homepage. http://bluebrain.epfl.ch/.
[9]
R. Brightwell, L. A. Fisk, D. S. Greenberg, T. Hudson, M. Levenhagen, A. B. MacCabe, and R. Riesen. Massively parallel computing using commodity components. Parallel Comput., 26(2-3):243--266, 2000.
[10]
R. Brightwell, A. B. Maccabe, and R. Riesen. On the appropriateness of commodity operating systems for large-scale, balanced computing systems. ipdps, 00:68a, 2003.
[11]
R. Brightwell, R. Riesen, K. D. Underwood, T. Hudson, P. G. Bridges, and A. B. Maccabe. A performance comparison of linux and a lightweight kernel. In CLUSTER, pages 251--258, 2003.
[12]
S. Chakravorty, C. L. Mendes, L. V. Kal--e, T. Jones, A. Tauferner, T. Inglett, and J. Moreira. Hpc-colony: services and interfaces for very large systems. SIGOPS Oper. Syst. Rev., 40(2):43--49, 2006.
[13]
J. Dongarra, H.-W. Meuer, and E. Strohmaier. TOP500 Supercomputer Sites. Available in Web page at: http://www.netlib.org/benchmark/top500.html.
[14]
Francois Gygi et al. Large-scale electronic structure calculations of high-z metals on the bluegene/l platform. Gordon Bell prize winner at Supercomputing 2006, Tampa FL.
[15]
FUSE Homepage. http://fuse.sourceforge.net/.
[16]
T. G. Mattson, D. Scott, and S. R. Wheat. A tera flop supercomputer in 1996: The ASCI t flop system. In IPPS '96: Proceedings of the 10th International Parallel Processing Symposium, pages 84--93, Washington, DC, USA, 1996. IEEE Computer Society.
[17]
J. Navarro, S. Iyer, P. Druschel, and A. Cox. Practical, transparent operating system support for superpages. SIGOPS Oper. Syst. Rev., 36(SI):89--104, 2002.
[18]
NAS Parallel Benchmarks. http://www.nas.nasa.gov/Software/NPB.
[19]
F. Petrini, D. J. Kerbyson, and S. Pakin. The case of the missing supercomputer performance: Achieving optimal performance on the 8,192 processors of ASCI Q. In SC '03: Proceedings of the 2003 ACM/IEEE conference on Supercomputing, page 55, Washington, DC, USA, 2003. IEEE Computer Society.

Cited By

View all
  • (2018)Multi-level load balancing with an integrated runtime approachProceedings of the 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing10.1109/CCGRID.2018.00018(31-40)Online publication date: 1-May-2018
  • (2017)Jitter-TraceProceedings of the 7th International Workshop on Runtime and Operating Systems for Supercomputers ROSS 201710.1145/3095770.3095772(1-8)Online publication date: 27-Jun-2017
  • (2017)Argo NodeOS: Toward Unified Resource Management for Exascale2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2017.25(153-162)Online publication date: May-2017
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICS '08: Proceedings of the 22nd annual international conference on Supercomputing
June 2008
390 pages
ISBN:9781605581583
DOI:10.1145/1375527
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 June 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. CNK
  2. NPB
  3. bluegene
  4. kernel
  5. linux

Qualifiers

  • Research-article

Conference

ICS08
Sponsor:
ICS08: International Conference on Supercomputing
June 7 - 12, 2008
Island of Kos, Greece

Acceptance Rates

Overall Acceptance Rate 629 of 2,180 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Multi-level load balancing with an integrated runtime approachProceedings of the 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing10.1109/CCGRID.2018.00018(31-40)Online publication date: 1-May-2018
  • (2017)Jitter-TraceProceedings of the 7th International Workshop on Runtime and Operating Systems for Supercomputers ROSS 201710.1145/3095770.3095772(1-8)Online publication date: 27-Jun-2017
  • (2017)Argo NodeOS: Toward Unified Resource Management for Exascale2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2017.25(153-162)Online publication date: May-2017
  • (2016)Lightweight Memory Management for High Performance Applications in Consolidated EnvironmentsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2015.239745227:2(468-480)Online publication date: 1-Feb-2016
  • (2016)Thread Assignment in Multicore/Multithreaded Processors: A Statistical ApproachIEEE Transactions on Computers10.1109/TC.2015.241753365:1(256-269)Online publication date: 1-Jan-2016
  • (2014)mOSProceedings of the 4th International Workshop on Runtime and Operating Systems for Supercomputers10.1145/2612262.2612263(1-8)Online publication date: 10-Jun-2014
  • (2014)HPMMAPProceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium10.1109/IPDPS.2014.73(649-658)Online publication date: 19-May-2014
  • (2013)Thread Assignment of Multithreaded Network Applications in Multicore/Multithreaded ProcessorsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2012.31124:12(2513-2525)Online publication date: 1-Dec-2013
  • (2012)Salvaging chips with caches beyond repairProceedings of the Conference on Design, Automation and Test in Europe10.5555/2492708.2493018(1263-1268)Online publication date: 12-Mar-2012
  • (2012)Linux kernel co-scheduling and bulk synchronous parallelismThe International Journal of High Performance Computing Applications10.1177/109434201143352326:2(136-145)Online publication date: 16-Jan-2012
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media