Concurrency

Applied Filters

People

Publications

Publication Date

Searched The ACM Guide to Computing Literature (3,790,159 records)|Limit your search to The ACM Full-Text Collection (766,444 records)

Showing 1 - 20of547 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
September 2024
Topo: Towards a fine-grained topological data processing framework on Tianhe-3 supercomputer
Journal of Parallel and Distributed Computing (JPDC), Volume 192, Issue Chttps://doi.org/10.1016/j.jpdc.2024.104926
Abstract
Big data frameworks are widely deployed in supercomputers for analyzing large-scale datasets. Topological data processing is an emerging approach that focuses on analyzing the topological structures in high-dimensional scientific data. However, ...
0
Metrics
Total Citations0
research-article
September 2024
PerfTop: Towards performance prediction of distributed learning over general topology
Journal of Parallel and Distributed Computing (JPDC), Volume 192, Issue Chttps://doi.org/10.1016/j.jpdc.2024.104922
Abstract
Distributed learning with multiple GPUs has been widely adopted to accelerate the training process of large-scale deep neural networks. However, misconfiguration of the GPU clusters with various communication primitives and topologies could ...
Highlights

A new performance prediction framework (termed PerfTop) is proposed to accurately predict the execution time of distributed learning over general topologies.
The framework provides an in-depth analysis of the underlying mechanisms of ...
0
Metrics
Total Citations0
research-article
March 2023
Parallel computing in finance for estimating risk-neutral densities through option prices
- Ana M. Monteiro,
- António A.F. Santos
Journal of Parallel and Distributed Computing (JPDC), Volume 173, Issue CPages 61–69https://doi.org/10.1016/j.jpdc.2022.11.010
Abstract
Option pricing is one of the most active Financial Economics research fields. Black-Scholes-Merton option pricing theory states that risk-neutral density is lognormal. However, markets' pieces of evidence do not support that ...
Highlights

An example of such big data environments is information associated with financial options contracts.
0
Metrics
Total Citations0
research-article
November 2022
Asynchronous simulated annealing on the placement problem: A beneficial race condition
Journal of Parallel and Distributed Computing (JPDC), Volume 169, Issue CPages 242–251https://doi.org/10.1016/j.jpdc.2022.07.001
Abstract
Race conditions, which occur when compute workers do not synchronise correctly, are considered undesirable in parallel computing, as they introduce often-unintended stochastic behaviour. This study presents an asynchronous parallel ...
Highlights

Performance gains from asynchronous algorithms are worth the race condition.
...
1
Metrics
Total Citations1
research-article
April 2022
Reachability in parallel programs is polynomial in the number of threads
- Alexander Malkis
Journal of Parallel and Distributed Computing (JPDC), Volume 162, Issue CPages 1–16https://doi.org/10.1016/j.jpdc.2021.11.008
Highlights

The notions of the diameter and local diameter for not necessarily binary programs.
Abstract
Reachability in parallel finite-state programs equipped with interleaving semantics is an inherently difficult, important problem. Its complexity in the number of threads n, while keeping the thread-local–memory size and the shared-...
0
Metrics
Total Citations0
research-article
March 2022
Communication lower-bounds for distributed-memory computations for mass spectrometry based omics data
Journal of Parallel and Distributed Computing (JPDC), Volume 161, Issue CPages 37–47https://doi.org/10.1016/j.jpdc.2021.11.001
Highlights

We present a theoretical framework that can be used for analyzing, and quantifying the performance of parallel algorithms designed for MS based omics data.
We prove the lower communication bounds for the existing parallel algorithms.
Abstract
Mass spectrometry (MS) based omics data analysis require significant time and resources. To date, few parallel algorithms have been proposed for deducing peptides from mass spectrometry-based data. However, these parallel algorithms were designed,...
0
Metrics
Total Citations0
editorial
December 2019
Editorial on the Special Issue on Parallel Computing in Modelling and Simulation
Journal of Parallel and Distributed Computing (JPDC), Volume 134, Issue CPages 233–235https://doi.org/10.1016/j.jpdc.2019.09.002
0
Metrics
Total Citations0
research-article
October 2019
Cross-state events: A new approach to parallel discrete event simulation and its speculative runtime support
- Alessandro Pellegrini,
- Francesco Quaglia
Journal of Parallel and Distributed Computing (JPDC), Volume 132, Issue CPages 48–68https://doi.org/10.1016/j.jpdc.2019.05.003
Abstract
We present a new approach to Parallel Discrete Event Simulation (PDES), where we enable the execution of so-called cross-state events. During their processing, the state of multiple concurrent simulation objects can be accessed in read/...
Highlights

Introduction of a new concept of “event” in parallel discrete event simulation, called cross-state event.
6
Metrics
Total Citations6
research-article
February 2018
Parallel algorithms for computing the smallest binary tree size in unit simplex refinement
Journal of Parallel and Distributed Computing (JPDC), Volume 112, Issue P2Pages 166–178https://doi.org/10.1016/j.jpdc.2017.05.016

Refinement of the unit simplex by iterative longest edge bisection (LEB) up to sub-simplices have a size smaller or equal to a given accuracy, generates a binary tree. For a dimension higher than three, the size of the generated tree depends on the ...
0
Metrics
Total Citations0
research-article
November 2016
Fair synchronization
Journal of Parallel and Distributed Computing (JPDC), Volume 97, Issue CPages 1–10https://doi.org/10.1016/j.jpdc.2016.06.007

Most published concurrent data structures which avoid locking do not provide any fairness guarantees. That is, they allow processes to access a data structure and complete their operations arbitrarily many times before some other trying process can ...
2
Metrics
Total Citations2
research-article
July 2016
Read/write shared memory abstraction on top of asynchronous Byzantine message-passing systems
Journal of Parallel and Distributed Computing (JPDC), Volume 93, Issue CPages 1–9https://doi.org/10.1016/j.jpdc.2016.03.012

This paper is on the construction and use of a shared memory abstraction on top of an asynchronous message-passing system in which up to t processes may commit Byzantine failures. This abstraction consists of arrays of n single-writer/multi-reader ...
2
Metrics
Total Citations2
research-article
January 2016
A highly scalable parallel algorithm for solving Toeplitz tridiagonal systems of linear equations
- Andrew V. Terekhov
Journal of Parallel and Distributed Computing (JPDC), Volume 87, Issue CPages 102–108https://doi.org/10.1016/j.jpdc.2015.10.004

Based on a modification of the dichotomy algorithm, we propose a novel parallel procedure for solving tridiagonal systems of equations with Toeplitz matrices. Taking the structure of the Toeplitz matrices, we may substantially reduce the number of the ...
2
Metrics
Total Citations2
article
April 2014
Accelerating sequential programs on commodity multi-core processors
Journal of Parallel and Distributed Computing (JPDC), Volume 74, Issue 4Pages 2257–2265https://doi.org/10.1016/j.jpdc.2013.12.009

A recently proposed pipelined multithreading (PMT) technique exhibits wide applicability in parallelizing general sequential programs on multi-core processors. However, significant inter-core communication overhead limits PMT performance and prevents ...
4
Metrics
Total Citations4
research-article
January 2014
Bitonic sort on a chained-cubic tree interconnection network
- Sherenaz W. Al-Haj Baddar,
- Basel A. Mahafzah
Journal of Parallel and Distributed Computing (JPDC), Volume 74, Issue 1Pages 1744–1761https://doi.org/10.1016/j.jpdc.2013.09.008

Bitonic sort is one of the fastest oblivious parallel sorting algorithms known so far. Due to its high modularity, bitonic sort can be mapped to different interconnection networks. In this paper, the bitonic sort algorithm is mapped to the chained-cubic ...
5
Metrics
Total Citations5
article
June 2013
Estimating parallel performance
Journal of Parallel and Distributed Computing (JPDC), Volume 73, Issue 6Pages 876–887https://doi.org/10.1016/j.jpdc.2013.01.011

In this paper we introduce our estimation method for parallel execution times, based on identifying separate ''parts'' of the work done by parallel programs. Our run time analysis works without any source code inspection. The time of parallel program ...
3
Metrics
Total Citations3
article
February 2013
Research note: Revisiting parallel cyclic reduction and parallel prefix-based algorithms for block tridiagonal systems of equations
Journal of Parallel and Distributed Computing (JPDC), Volume 73, Issue 2Pages 273–280https://doi.org/10.1016/j.jpdc.2012.10.003

Direct solvers based on prefix computation and cyclic reduction algorithms exploit the special structure of tridiagonal systems of equations to deliver better parallel performance compared to those designed for more general systems of equations. This ...
5
Metrics
Total Citations5
article
February 2013
Programming support and scheduling for communicating parallel tasks
Journal of Parallel and Distributed Computing (JPDC), Volume 73, Issue 2Pages 220–234https://doi.org/10.1016/j.jpdc.2012.09.017

Task-based programming models are beneficial for the development of parallel programs for several reasons. They provide a decoupling of the specification of parallelism from the scheduling and mapping to execution resources of a specific hardware ...
1
Metrics
Total Citations1
article
November 2010
Unified parallel encoding and decoding algorithms for Dandelion-like codes
- Saverio Caminiti,
- Rossella Petreschi
Journal of Parallel and Distributed Computing (JPDC), Volume 70, Issue 11Pages 1119–1127https://doi.org/10.1016/j.jpdc.2010.07.003

The Dandelion-like codes are eight bijections between labeled trees and strings of node labels. The literature contains optimal sequential algorithms for these bijections, but no parallel algorithms have been reported. In this paper the first parallel ...
0
Metrics
Total Citations0
article
June 2010
An efficient parallel algorithm for building the separating tree
Journal of Parallel and Distributed Computing (JPDC), Volume 70, Issue 6Pages 625–629https://doi.org/10.1016/j.jpdc.2010.01.007

We present an efficient parallel algorithm for building the separating tree for a separable permutation. Our algorithm runs in O(log^2n) time using O(nlog^1^.^5n) operations on the CREW PRAM and O(log^2n) time using O(nlognloglogn) operations on the ...
1
Metrics
Total Citations1
article
May 2010
Iterative computations with ordered read-write locks
- Pierre-Nicolas Clauss,
- Jens Gustedt
Journal of Parallel and Distributed Computing (JPDC), Volume 70, Issue 5Pages 496–504https://doi.org/10.1016/j.jpdc.2009.09.002

We introduce the framework of ordered read-write locks, ORWL, that are characterized by two main features: a strict FIFO policy for access, and the attribution of access to lock-handles instead of processes or threads. These two properties together ...
3
Metrics
Total Citations3

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

All Publications

Content Type

Publisher

Publication Date

Topo: Towards a fine-grained topological data processing framework on Tianhe-3 supercomputer

PerfTop: Towards performance prediction of distributed learning over general topology

Parallel computing in finance for estimating risk-neutral densities through option prices

Asynchronous simulated annealing on the placement problem: A beneficial race condition

Reachability in parallel programs is polynomial in the number of threads

Communication lower-bounds for distributed-memory computations for mass spectrometry based omics data

Editorial on the Special Issue on Parallel Computing in Modelling and Simulation

Cross-state events: A new approach to parallel discrete event simulation and its speculative runtime support

Parallel algorithms for computing the smallest binary tree size in unit simplex refinement

Fair synchronization

Read/write shared memory abstraction on top of asynchronous Byzantine message-passing systems

A highly scalable parallel algorithm for solving Toeplitz tridiagonal systems of linear equations

Accelerating sequential programs on commodity multi-core processors

Bitonic sort on a chained-cubic tree interconnection network

Estimating parallel performance

Research note: Revisiting parallel cyclic reduction and parallel prefix-based algorithms for block tridiagonal systems of equations

Programming support and scheduling for communicating parallel tasks

Unified parallel encoding and decoding algorithms for Dandelion-like codes

An efficient parallel algorithm for building the separating tree

Iterative computations with ordered read-write locks