Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2023
AtoMig: Automatically Migrating Millions Lines of Code from TSO to WMM
- Martin Beck,
- Koustubha Bhat,
- Lazar Stričević,
- Geng Chen,
- Diogo Behrens,
- Ming Fu,
- Viktor Vafeiadis,
- Haibo Chen,
- Hermann Härtig
ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2Pages 61–73https://doi.org/10.1145/3575693.3579849CPUs with weak memory-consistency models (WMMs), such as Arm and RISC-V, are rapidly increasing their market share. Porting legacy x86 applications to such CPUs requires introducing extra synchronization to prevent WMM-related concurrency bugs---a ...
- research-articleJuly 2022Best Paper
Porting uintah to heterogeneous systems
PASC '22: Proceedings of the Platform for Advanced Scientific Computing ConferenceArticle No.: 11, Pages 1–10https://doi.org/10.1145/3539781.3539794The Uintah Computational Framework is being prepared to make portable use of forthcoming exascale systems, initially the DOE Aurora system through the Aurora Early Science Program. This paper describes the evolution of Uintah to be ready for such ...
- articleMay 2012
Fast Parallel Markov Clustering in Bioinformatics Using Massively Parallel Computing on GPU with CUDA and ELLPACK-R Sparse Format
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), Volume 9, Issue 3Pages 679–692https://doi.org/10.1109/TCBB.2011.68Markov clustering (MCL) is becoming a key algorithm within bioinformatics for determining clusters in networks. However, with increasing vast amount of data on biological networks, performance and scalability issues are becoming a critical limiting ...
- research-articleApril 2012
A Cluster-on-a-Chip Architecture for High-Throughput Phylogeny Search
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 23, Issue 4Pages 579–588https://doi.org/10.1109/TPDS.2010.191In this paper, we describe an FPGA-based coprocessor architecture that performs a high-throughput branch-and-bound search of the space of phylogenetic trees corresponding to the number of input taxa. Our coprocessor architecture is designed to ...
- research-articleDecember 2011
Prediction of Optimal Parallelism Level in Wide Area Data Transfers
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 22, Issue 12Pages 2033–2045https://doi.org/10.1109/TPDS.2011.228Wide area data transfer may be a major bottleneck for the end-to-end performance of distributed applications. A practical way of increasing the wide area throughput at the application layer is using multiple parallel streams. Although increased number ...
- research-articleNovember 2011
Streamline Integration Using MPI-Hybrid Parallelism on a Large Multicore Architecture
IEEE Transactions on Visualization and Computer Graphics (ITVC), Volume 17, Issue 11Pages 1702–1713https://doi.org/10.1109/TVCG.2010.259Streamline computation in a very large vector field data set represents a significant challenge due to the nonlocal and data-dependent nature of streamline integration. In this paper, we conduct a study of the performance characteristics of hybrid ...
- posterSeptember 2011
Modeling growth and dynamics of neural networks via message passing in Erlang: neural models have a natural home in message passing functional programming languages
Erlang '11: Proceedings of the 10th ACM SIGPLAN workshop on ErlangPages 94–97https://doi.org/10.1145/2034654.2034674Erlang is well suited as a platform for modeling neural dynamics and development. We overview similarities between neural architecture and language paradigms in Erlang, specifically functional programming, message passing, distributed computing and ...
- research-articleMay 2010
A Scalable Message Passing Interface Implementation of an Ad-Hoc Parallel I/o system
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 24, Issue 2Pages 164–184https://doi.org/10.1177/1094342009347890In this paper we present the novel design, implementation, and evaluation of an ad-hoc parallel I/O system AHPIOS. AHPIOS is the first scalable parallel I/O system completely implemented in the Message Passing Interface MPI. The MPI implementation ...
- articleFebruary 2010
Implementation and Evaluation of File Write-Back and Prefetching for MPI-IO Over GPFS
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 24, Issue 1Pages 78–92https://doi.org/10.1177/1094342009359015In this paper we present the implementation of an open-source MPI-IO interface for the General Parallel File System (GPFS). Our solution includes the design and implementation of GPFS-based write-back and prefetching modules, which have been integrated ...
- research-articleFebruary 2010
Scheduling Concurrent Bag-of-Tasks Applications on Heterogeneous Platforms
IEEE Transactions on Computers (ITCO), Volume 59, Issue 2Pages 202–217https://doi.org/10.1109/TC.2009.117Scheduling problems are already difficult on traditional parallel machines, and they become extremely challenging on heterogeneous clusters. In this paper, we deal with the problem of scheduling multiple applications, made of collections of independent ...
- research-articleMarch 2006
The Monte Carlo Method in Science and Engineering
Computing in Science and Engineering (IEEECS_CISE-NEW), Volume 8, Issue 2Pages 9–19https://doi.org/10.1109/MCSE.2006.34Since 1953, researchers have applied the Monte Carlo method to a wide range of areas. Specialized algorithms have also been developed to extend the method's applicability and efficiency. The author describes some of the algorithms that have been ...
- research-articleMarch 2023
Program construction by verifying specification
Journal of Computer Science and Technology (JCST), Volume 13, Issue 6Pages 597–607https://doi.org/10.1007/BF02946503AbstractA program construction method based on Gamma language is proposed. The problem to be solved is specified by first-order predicate logic and a semantic verification program is constructed directly from the specification. Ways, for improving ...
- articleSeptember 1997
Fast Motion Planning by Parallel Processing – a Review
Journal of Intelligent and Robotic Systems (JIRS), Volume 20, Issue 1Pages 45–69https://doi.org/10.1023/A:1007948727999One of the many features needed to support the activities of autonomous systems is the ability to plan motion. This enables robots to move in their environment securely and to accomplish given tasks. Unfortunately, the control loop comprising sensing, ...