Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleSeptember 2024
Topo: Towards a fine-grained topological data processing framework on Tianhe-3 supercomputer
Journal of Parallel and Distributed Computing (JPDC), Volume 192, Issue Chttps://doi.org/10.1016/j.jpdc.2024.104926AbstractBig data frameworks are widely deployed in supercomputers for analyzing large-scale datasets. Topological data processing is an emerging approach that focuses on analyzing the topological structures in high-dimensional scientific data. However, ...
- research-articleSeptember 2024
PerfTop: Towards performance prediction of distributed learning over general topology
Journal of Parallel and Distributed Computing (JPDC), Volume 192, Issue Chttps://doi.org/10.1016/j.jpdc.2024.104922AbstractDistributed learning with multiple GPUs has been widely adopted to accelerate the training process of large-scale deep neural networks. However, misconfiguration of the GPU clusters with various communication primitives and topologies could ...
Highlights- A new performance prediction framework (termed PerfTop) is proposed to accurately predict the execution time of distributed learning over general topologies.
- The framework provides an in-depth analysis of the underlying mechanisms of ...
- research-articleMarch 2023
Parallel computing in finance for estimating risk-neutral densities through option prices
Journal of Parallel and Distributed Computing (JPDC), Volume 173, Issue CPages 61–69https://doi.org/10.1016/j.jpdc.2022.11.010AbstractOption pricing is one of the most active Financial Economics research fields. Black-Scholes-Merton option pricing theory states that risk-neutral density is lognormal. However, markets' pieces of evidence do not support that ...
Highlights- An example of such big data environments is information associated with financial options contracts.
- research-articleNovember 2022
Asynchronous simulated annealing on the placement problem: A beneficial race condition
Journal of Parallel and Distributed Computing (JPDC), Volume 169, Issue CPages 242–251https://doi.org/10.1016/j.jpdc.2022.07.001AbstractRace conditions, which occur when compute workers do not synchronise correctly, are considered undesirable in parallel computing, as they introduce often-unintended stochastic behaviour. This study presents an asynchronous parallel ...
Highlights- Performance gains from asynchronous algorithms are worth the race condition.
- ...
- research-articleApril 2022
Reachability in parallel programs is polynomial in the number of threads
Journal of Parallel and Distributed Computing (JPDC), Volume 162, Issue CPages 1–16https://doi.org/10.1016/j.jpdc.2021.11.008Highlights- The notions of the diameter and local diameter for not necessarily binary programs.
Reachability in parallel finite-state programs equipped with interleaving semantics is an inherently difficult, important problem. Its complexity in the number of threads n, while keeping the thread-local–memory size and the shared-...
-
- research-articleMarch 2022
Communication lower-bounds for distributed-memory computations for mass spectrometry based omics data
Journal of Parallel and Distributed Computing (JPDC), Volume 161, Issue CPages 37–47https://doi.org/10.1016/j.jpdc.2021.11.001Highlights- We present a theoretical framework that can be used for analyzing, and quantifying the performance of parallel algorithms designed for MS based omics data.
- We prove the lower communication bounds for the existing parallel algorithms.
Mass spectrometry (MS) based omics data analysis require significant time and resources. To date, few parallel algorithms have been proposed for deducing peptides from mass spectrometry-based data. However, these parallel algorithms were designed,...
- research-articleOctober 2019
Cross-state events: A new approach to parallel discrete event simulation and its speculative runtime support
Journal of Parallel and Distributed Computing (JPDC), Volume 132, Issue CPages 48–68https://doi.org/10.1016/j.jpdc.2019.05.003AbstractWe present a new approach to Parallel Discrete Event Simulation (PDES), where we enable the execution of so-called cross-state events. During their processing, the state of multiple concurrent simulation objects can be accessed in read/...
Highlights- Introduction of a new concept of “event” in parallel discrete event simulation, called cross-state event.
- research-articleFebruary 2018
Parallel algorithms for computing the smallest binary tree size in unit simplex refinement
Journal of Parallel and Distributed Computing (JPDC), Volume 112, Issue P2Pages 166–178https://doi.org/10.1016/j.jpdc.2017.05.016Refinement of the unit simplex by iterative longest edge bisection (LEB) up to sub-simplices have a size smaller or equal to a given accuracy, generates a binary tree. For a dimension higher than three, the size of the generated tree depends on the ...
- research-articleNovember 2016
Fair synchronization
Journal of Parallel and Distributed Computing (JPDC), Volume 97, Issue CPages 1–10https://doi.org/10.1016/j.jpdc.2016.06.007Most published concurrent data structures which avoid locking do not provide any fairness guarantees. That is, they allow processes to access a data structure and complete their operations arbitrarily many times before some other trying process can ...
- research-articleJuly 2016
Read/write shared memory abstraction on top of asynchronous Byzantine message-passing systems
Journal of Parallel and Distributed Computing (JPDC), Volume 93, Issue CPages 1–9https://doi.org/10.1016/j.jpdc.2016.03.012This paper is on the construction and use of a shared memory abstraction on top of an asynchronous message-passing system in which up to t processes may commit Byzantine failures. This abstraction consists of arrays of n single-writer/multi-reader ...
- research-articleJanuary 2016
A highly scalable parallel algorithm for solving Toeplitz tridiagonal systems of linear equations
Journal of Parallel and Distributed Computing (JPDC), Volume 87, Issue CPages 102–108https://doi.org/10.1016/j.jpdc.2015.10.004Based on a modification of the dichotomy algorithm, we propose a novel parallel procedure for solving tridiagonal systems of equations with Toeplitz matrices. Taking the structure of the Toeplitz matrices, we may substantially reduce the number of the ...
- articleApril 2014
Accelerating sequential programs on commodity multi-core processors
Journal of Parallel and Distributed Computing (JPDC), Volume 74, Issue 4Pages 2257–2265https://doi.org/10.1016/j.jpdc.2013.12.009A recently proposed pipelined multithreading (PMT) technique exhibits wide applicability in parallelizing general sequential programs on multi-core processors. However, significant inter-core communication overhead limits PMT performance and prevents ...
- research-articleJanuary 2014
Bitonic sort on a chained-cubic tree interconnection network
Journal of Parallel and Distributed Computing (JPDC), Volume 74, Issue 1Pages 1744–1761https://doi.org/10.1016/j.jpdc.2013.09.008Bitonic sort is one of the fastest oblivious parallel sorting algorithms known so far. Due to its high modularity, bitonic sort can be mapped to different interconnection networks. In this paper, the bitonic sort algorithm is mapped to the chained-cubic ...
- articleJune 2013
Estimating parallel performance
Journal of Parallel and Distributed Computing (JPDC), Volume 73, Issue 6Pages 876–887https://doi.org/10.1016/j.jpdc.2013.01.011In this paper we introduce our estimation method for parallel execution times, based on identifying separate ''parts'' of the work done by parallel programs. Our run time analysis works without any source code inspection. The time of parallel program ...
- articleFebruary 2013
Research note: Revisiting parallel cyclic reduction and parallel prefix-based algorithms for block tridiagonal systems of equations
Journal of Parallel and Distributed Computing (JPDC), Volume 73, Issue 2Pages 273–280https://doi.org/10.1016/j.jpdc.2012.10.003Direct solvers based on prefix computation and cyclic reduction algorithms exploit the special structure of tridiagonal systems of equations to deliver better parallel performance compared to those designed for more general systems of equations. This ...
- articleFebruary 2013
Programming support and scheduling for communicating parallel tasks
Journal of Parallel and Distributed Computing (JPDC), Volume 73, Issue 2Pages 220–234https://doi.org/10.1016/j.jpdc.2012.09.017Task-based programming models are beneficial for the development of parallel programs for several reasons. They provide a decoupling of the specification of parallelism from the scheduling and mapping to execution resources of a specific hardware ...
- articleNovember 2010
Unified parallel encoding and decoding algorithms for Dandelion-like codes
Journal of Parallel and Distributed Computing (JPDC), Volume 70, Issue 11Pages 1119–1127https://doi.org/10.1016/j.jpdc.2010.07.003The Dandelion-like codes are eight bijections between labeled trees and strings of node labels. The literature contains optimal sequential algorithms for these bijections, but no parallel algorithms have been reported. In this paper the first parallel ...
- articleJune 2010
An efficient parallel algorithm for building the separating tree
Journal of Parallel and Distributed Computing (JPDC), Volume 70, Issue 6Pages 625–629https://doi.org/10.1016/j.jpdc.2010.01.007We present an efficient parallel algorithm for building the separating tree for a separable permutation. Our algorithm runs in O(log^2n) time using O(nlog^1^.^5n) operations on the CREW PRAM and O(log^2n) time using O(nlognloglogn) operations on the ...
- articleMay 2010
Iterative computations with ordered read-write locks
Journal of Parallel and Distributed Computing (JPDC), Volume 70, Issue 5Pages 496–504https://doi.org/10.1016/j.jpdc.2009.09.002We introduce the framework of ordered read-write locks, ORWL, that are characterized by two main features: a strict FIFO policy for access, and the attribution of access to lock-handles instead of processes or threads. These two properties together ...