Susan Coghlan

Publication Date: 2000

Publication Name: 4th Annual Linux Showcase & Conference ({ALS} 2000)

Publication Date: 2017

Research Interests:
Biology and Sequoia

We investigate operating system noise, which we identify as one of the main reasons for a lack of synchronicity in parallel applications. Using a microbenchmark, we measure the noise on several contemporary platforms and find that, even... more

We investigate operating system noise, which we identify as one of the main reasons for a lack of synchronicity in parallel applications. Using a microbenchmark, we measure the noise on several contemporary platforms and find that, even with a general-purpose operating sys-tem, noise can be limited if certain precautions are taken. We then inject artificially generated noise into a massively parallel system and measure its influence on the performance of collec-tive operations. Our experiments indicate that on extreme-scale platforms, the performance is correlated with the largest interruption to the application, even if the probability of such an in-terruption on a single process is extremely small. We demonstrate that synchronizing the noise can significantly reduce its negative influence.

Publication Date: 2000

Publication Name: 4th Annual Linux Showcase & Conference ({ALS} 2000)

Publication Date: 2017

Research Interests: Biology and Sequoia<div>()</div>

Publication Date: 2008

Publication Date: 2015

Publication Date: 2003

Publication Date: 2011

Publisher: Chapman and Hall/CRC

Publication Name: Contemporary High Performance Computing

Research Interests: Computer Science and Cloud Computing<div>()</div>

Publication Date: 2013

Research Interests: Computer Science<div>()</div>

Publisher: IEEE

Publication Name: SC16: International Conference for High Performance Computing, Networking, Storage and Analysis

Research Interests: Engineering and Computer Science<div>()</div>

Publisher: Springer Science and Business Media LLC

Publication Name: Nature Computational Science

Research Interests: Computer Science<div>()</div>

Publisher: Wiley-Blackwell

Publication Name: Concurrency and Computation: Practice and Experience

Research Interests: Computer Science, Distributed Computing, and Computer Software<div>()</div>

Publication Date: 1999

Publisher: Office of Scientific and Technical Information (OSTI)

Publication Date: 2011

Publisher: ACM

Publication Date: 2014

Publication Name: Proceedings of the 5th ACM/SPEC international conference on Performance engineering

Publication Date: 1992

Publication Name: Analysis and Modeling of Neural Systems

Publisher: Wiley

Publication Date: 2010

Publication Name: Concurrency and Computation: Practice and Experience

Publication Date: 1999

Research Interests: Supercomputing<div>()</div>

Publisher: portal.acm.org

Publication Date: 2005

Publication Name: Proceedings of the …

Publisher: Contemporary High Performance Computing

Publication Date: 2019

Publication Name: Contemporary High Performance Computing

Research Interests: Engineering<div>()</div>

Publication Date: 2006

Publication Name: 2006 IEEE International Conference on Cluster Computing

Research Interests: Cluster Computing and OPERATING SYSTEM<div>()</div>

Publication Date: 2006

Publication Name: ACM SIGOPS Operating Systems Review

Research Interests: Operating Systems, OPERATING SYSTEM, and Fault Tolerant<div>()</div>

Research Interests: System Management<div>()</div>

Publication Date: 2008

Publication Name: Cluster Computing

Research Interests: Cluster Computing, OPERATING SYSTEM, and Parallel Machines<div>()</div>

Publication Date: 2012

Publication Name: 2012 IEEE International Conference on Cluster Computing

Publication Date: 2011

Publication Name: Proceedings of the 2nd international workshop on Scientific cloud computing - ScienceCloud '11

Research Interests: Cloud Computing, Scientific Computing, Virtual Machine, and Programming Model<div>()</div>

Publication Date: 2010

Publication Name: 2010 International Conference on Dependable Systems and Networks Workshops (DSN-W)

Research Interests: Genetic Algorithm, Failure Prediction, Lead Time, and Prediction Accuracy<div>()</div>

Publication Date: 2011

Publication Name: 2011 IEEE/IFIP 41st International Conference on Dependable Systems and Networks Workshops (DSN-W)

Publication Date: 2013

Publication Name: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '13

Publication Date: 2013

Publication Name: 2013 IEEE International Conference on Cluster Computing (CLUSTER)

Publication Date: 2013

Publication Name: 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum

Research Interests: Complexity, Databases, Hardware, Temperature measurement, and Supercomputer<div>()</div>

Publication Date: 2011

Publication Name: 2011 IEEE International Parallel & Distributed Processing Symposium

Research Interests: Failure Analysis, Parallel & Distributed Computing, and System Reliability<div>()</div>

Publication Date: 2012

Publication Name: 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)

Publication Date: 2012

Publication Name: 2012 Innovative Parallel Computing (InPar)

Publication Date: 2000

Publication Name: IBM Journal of Research and Development

Research Interests: Information Systems and Computer Software<div>()</div>

Research Interests:
Biology and Sequoia

Research Interests:
Computer Science and Cloud Computing

Research Interests:
Computer Science

Research Interests:
Engineering and Computer Science

Research Interests:
Computer Science

Research Interests:
Computer Science, Distributed Computing, and Computer Software

Research Interests:
Supercomputing

Research Interests:
Engineering

Research Interests:
Cluster Computing and OPERATING SYSTEM

Research Interests:
Operating Systems, OPERATING SYSTEM, and Fault Tolerant

Research Interests:
System Management

Research Interests:
Cluster Computing, OPERATING SYSTEM, and Parallel Machines

Research Interests:
Cloud Computing, Scientific Computing, Virtual Machine, and Programming Model

Research Interests:
Genetic Algorithm, Failure Prediction, Lead Time, and Prediction Accuracy

Research Interests:
Complexity, Databases, Hardware, Temperature measurement, and Supercomputer

Research Interests:
Failure Analysis, Parallel & Distributed Computing, and System Reliability

Research Interests:
Information Systems and Computer Software

Research Interests:
System Management

Research Interests:
Distributed Computing, Parallel & Distributed Computing, Case Study, Parallel, Failure Prediction, and Prediction Accuracy