Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleAugust 2022
MARTINI: The Little Match and Replace Tool for Automatic Application Rewriting with Code Examples
AbstractRewriting code for cleanliness, API changes, and new programming models is a common, yet time-consuming task. Localized or syntax-based changes are often mechanical and can be automated with text-based rewriting tools, like sed. However, non-...
- ArticleSeptember 2021
SKaMPI-OpenSHMEM: Measuring OpenSHMEM Communication Routines
OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Exascale and Smart NetworksPages 63–80https://doi.org/10.1007/978-3-031-04888-3_4AbstractBenchmarking is an important challenge in HPC, in particular, to be able to tune the basic blocks of the software environment used by applications. The communication library and distributed run-time environment are among the most critical ones. In ...
- ArticleSeptember 2021
Fault-Tolerant LU Factorization Is Low Cost
AbstractAt large scale, failures are statistically frequent and need to be taken into account. Tolerating failures has arisen as a major challenge in parallel computing as the size of the systems grow, failures become more common and some computation ...
- research-articleJuly 2021
- ArticleJune 2019
Parallel Parametric Linear Programming Solving, and Application to Polyhedral Computations
AbstractParametric linear programming is central in polyhedral computations and in certain control applications. We propose a task-based scheme for parallelizing it, with quasi-linear speedup over large problems.
- ArticleAugust 2018
One-Sided Communications for More Efficient Parallel State Space Exploration over RDMA Clusters
AbstractThis paper investigates the use of one-sided communications in the context of state space exploration. This operation is often the core component of model checking tools that explores a system state space to look for behaviours deviating from its ...
- research-articleSeptember 2014
Distributed Behavioral Cartography of Timed Automata
EuroMPI/ASIA '14: Proceedings of the 21st European MPI Users' Group MeetingPages 109–114https://doi.org/10.1145/2642769.2642784Real-time systems, characterized by a set of timings constants (internal delays, timers, clock speeds), need to be perfectly reliable. Formal methods can prove their correctness but, if one of the timing constants changes, verification needs to be ...
- ArticleSeptember 2012
Fault tolerance logical network properties of irregular graphs
ICA3PP'12: Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part IPages 377–391https://doi.org/10.1007/978-3-642-33078-0_27Assume a desktop grid middleware or a deployed cloud infrastructure that are both based on a large number of volunteers for computational-intensive applications or business applications. In this case, the Internet is the communication layer; hence, the ...
- ArticleMay 2011
A Model for Coherent Distributed Memory for Race Condition Detection
IPDPSW '11: Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD ForumPages 584–590https://doi.org/10.1109/IPDPS.2011.196We present a new model for distributed shared memory systems, based on remote data accesses. Such features are offered by network interface cards that allow one-sided operations, remote direct memory access and OS bypass. This model leads to new ...
- articleApril 2011
QCG-OMPI: MPI applications on grids
- Emmanuel Agullo,
- Camille Coti,
- Thomas Herault,
- Julien Langou,
- Sylvain Peyronnet,
- Ala Rezmerita,
- Franck Cappello,
- Jack Dongarra
Future Generation Computer Systems (FGCS), Volume 27, Issue 4Pages 357–369https://doi.org/10.1016/j.future.2010.11.015Computational grids present promising computational and storage capacities. They can be made by punctual aggregation of smaller resources (i.e., clusters) to obtain a large-scale supercomputer. Running general applications is challenging for several ...
- research-articleNovember 2010
- ArticleDecember 2009
Running Parallel Applications with Topology-Aware Grid Middleware
E-SCIENCE '09: Proceedings of the 2009 Fifth IEEE International Conference on e-SciencePages 292–299https://doi.org/10.1109/e-Science.2009.48The concept of topology-aware grid applications is derived from parallelized computational models of complex systems that are executed on heterogeneous resources, either because they require specialized hardware for certain calculations, or because ...
- ArticleDecember 2009
Kernels and learning curves for Gaussian process regression on random graphs
NIPS'09: Proceedings of the 22nd International Conference on Neural Information Processing SystemsPages 1723–1731We investigate how well Gaussian process regression can learn functions defined on graphs, using large regular random graphs as a paradigmatic example. Random-walk based kernels are shown to have some non-trivial properties: within the standard ...
- ArticleAugust 2009
MPI Applications on Grids: A Topology Aware Approach
Euro-Par '09: Proceedings of the 15th International Euro-Par Conference on Parallel ProcessingPages 466–477https://doi.org/10.1007/978-3-642-03869-3_45Porting on grids complex MPI applications involving collective communications requires significant program modification, usually dedicated to a single grid structure. The difficulty comes from the mismatch between programs organizations and grid ...
- ArticleMay 2008
Grid Services for MPI
CCGRID '08: Proceedings of the 2008 Eighth IEEE International Symposium on Cluster Computing and the GridPages 417–424https://doi.org/10.1109/CCGRID.2008.106Institutional grids consist of the aggregation of clusters belonging to different administrative domains to build a single parallel machine. To run an MPI application over an institutional grid, one has to address many challenges. One of the first ...
- ArticleNovember 2006
Blocking vs. non-blocking coordinated checkpointing for large-scale fault tolerant MPI
- Camille Coti,
- Thomas Herault,
- Pierre Lemarinier,
- Laurence Pilard,
- Ala Rezmerita,
- Eric Rodriguez,
- Franck Cappello
SC '06: Proceedings of the 2006 ACM/IEEE conference on SupercomputingPages 127–eshttps://doi.org/10.1145/1188455.1188587A long-term trend in high-performance computing is the increasing number of nodes in parallel computing platforms, which entails a higher failure probability. Fault tolerant programming environments should be used to guarantee the safe execution of ...