Dependable and fault-tolerant systems and networks

Applied Filters

People

Publications

Conferences

Publication Date

Past 5 years

12 Results for: Book/Issue: ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,839,844 records)|Limit your search to The ACM Full-Text Collection (774,407 records)

Showing 1 - 12of12 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
Open Access
January 2023
Efficient Phase-Functioned Real-time Character Control in Mobile Games: A TVM Enabled Approach
- Haidong Lan,
- Wenxi Zhu,
- Du Wu,
- Qian Qiu,
- Honglin Zhu,
- Jingjing Zhao,
- Xinghui Fu,
- Liu Wei,
- Jintao Meng,
- Minwen Deng
ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingArticle No.: 15, Pages 1–9https://doi.org/10.1145/3545008.3545095

In this paper, we propose a highly efficient computing method for game character control with phase-functioned neural networks (PFNN). The primary challenge to accelerate PFNN on mobile platforms is that PFNN dynamically produces weight matrices with an ...
0
421
Metrics
Total Citations0
Total Downloads421
Last 12 Months230
Last 6 weeks17
View online with eReader
View this article in HTML format
PDF
research-article
January 2023
DSSA: Dual-Side Sparse Systolic Array Architecture for Accelerating Convolutional Neural Network Training
ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingArticle No.: 17, Pages 1–10https://doi.org/10.1145/3545008.3545086

Ever-growing CNN size incurs a significant amount of redundancy in model parameters, which in turn, puts considerable burden on hardware. Unstructured pruning is widely used to reduce model sparsity. While, the irregularity introduced by unstructured ...
0
148
Metrics
Total Citations0
Total Downloads148
Last 12 Months44
Last 6 weeks2
Get Access
research-article
Open Access
January 2023
MG-GCN: A Scalable multi-GPU GCN Training Framework
ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingArticle No.: 79, Pages 1–11https://doi.org/10.1145/3545008.3545082

Full batch training of Graph Convolutional Network (GCN) models is not feasible on a single GPU for large graphs containing tens of millions of vertices or more. Recent work has shown that, for the graphs used in the machine learning community, ...
3
721
Metrics
Total Citations3
Total Downloads721
Last 12 Months452
Last 6 weeks44
View online with eReader
View this article in HTML format
PDF
research-article
January 2023
DRAM Cache Management with Request Granularity for NAND-based SSDs
ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingArticle No.: 29, Pages 1–10https://doi.org/10.1145/3545008.3545081

Most flash-based solid-state drives (SSDs) employ an on-board Dynamic Random Access Memory (DRAM) to cache hot data at the SSD page granularity. This can significantly reduce the number of flush operations to the underlying arrays of SSDs given that ...
3
246
Metrics
Total Citations3
Total Downloads246
Last 12 Months94
Last 6 weeks5
Get Access
research-article
Open Access
January 2023
NCC: Neighbor-aware Congestion Control based on Reinforcement Learning for Datacenter Networks
ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingArticle No.: 62, Pages 1–10https://doi.org/10.1145/3545008.3545074

The challenges of low latency, high throughput datacenter networks create new traffic management problems that require new congestion control mechanisms. Generally, the proposals to solve this problem have focused either on refining existing window-...
1
392
Metrics
Total Citations1
Total Downloads392
Last 12 Months217
Last 6 weeks34
View online with eReader
View this article in HTML format
PDF
research-article
January 2023
An Online Learning Approach for Client Selection in Federated Edge Learning under Budget Constraint
ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingArticle No.: 72, Pages 1–11https://doi.org/10.1145/3545008.3545062

Federated learning (FL) has emerged as a new paradigm that enables distributed mobile devices to learn a global model collaboratively. Since mobile devices (a.k.a, clients) exhibit diversity in model training quality, client selection (CS) becomes ...
2
187
Metrics
Total Citations2
Total Downloads187
Last 12 Months59
Last 6 weeks2
Get Access
research-article
Open Access
January 2023
Acuerdo: Fast Atomic Broadcast over RDMA
ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingArticle No.: 59, Pages 1–11https://doi.org/10.1145/3545008.3545041

Atomic broadcast protocols ensure that messages are delivered to a group of machines in some total order, even when some of these machines can fail. These protocols are key to making distributed services fault-tolerant, as their total order guarantee ...
1
622
Metrics
Total Citations1
Total Downloads622
Last 12 Months336
Last 6 weeks42
View online with eReader
View this article in HTML format
PDF
research-article
January 2023
Repair-Optimal Data Placement for Locally Repairable Codes with Optimal Minimum Hamming Distance
- Shuang Ma,
- Si Wu,
- Cheng Li,
- Yinlong Xu
ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingArticle No.: 23, Pages 1–11https://doi.org/10.1145/3545008.3545038

Modern clustered storage systems increasingly adopt erasure coding to realize reliable data storage at low storage redundancy. Locally Repairable Codes (LRC) are a family of practical erasure codes with high repair efficiency. Among various LRC ...
3
131
Metrics
Total Citations3
Total Downloads131
Last 12 Months51
Last 6 weeks2
Get Access
research-article
January 2023
Mlog: Multi-log Write Buffer upon Ultra-fast SSD RAID
ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingArticle No.: 24, Pages 1–11https://doi.org/10.1145/3545008.3545034

Parity-based RAID suffering from partial-stripe write-penalty has to introduce write buffer to fast absorb and merge incoming writes, and then flush them to RAID array in batch. However, we experimentally observe that the popular buffering mechanism as ...
4
160
Metrics
Total Citations4
Total Downloads160
Last 12 Months80
Last 6 weeks2
Get Access
research-article
January 2023
Boosting Cross-rack Multi-stripe Repair in Heterogeneous Erasure-coded Clusters
- Hai Zhou,
- Dan Feng
ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingArticle No.: 22, Pages 1–11https://doi.org/10.1145/3545008.3545029

Large-scale distributed storage systems have introduced erasure code to guarantee high data reliability, yet inevitably at the expense of high repair costs. In practice, storage nodes are usually divided into different racks, and data blocks in storage ...
2
133
Metrics
Total Citations2
Total Downloads133
Last 12 Months46
Last 6 weeks7
Get Access
research-article
January 2023
TileSpMSpV: A Tiled Algorithm for Sparse Matrix-Sparse Vector Multiplication on GPUs
ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingArticle No.: 9, Pages 1–11https://doi.org/10.1145/3545008.3545028

Sparse matrix-sparse vector multiplication (SpMSpV) is an important primitive for graph algorithms and machine learning applications. The sparsity of the input and output vectors makes its floating point efficiency in general lower than sparse matrix-...
5
225
Metrics
Total Citations5
Total Downloads225
Last 12 Months83
Last 6 weeks5
Get Access
research-article
January 2023
Regularizing Sparse and Imbalanced Communications for Voxel-based Brain Simulations on Supercomputers
- Yuhao Liu,
- Xin Du,
- Zhihui Lu,
- Qiang Duan,
- Jianfeng Feng,
- Minglong Wang,
- Jie Wu
ICPP '22: Proceedings of the 51st International Conference on Parallel ProcessingArticle No.: 81, Pages 1–11https://doi.org/10.1145/3545008.3545019

Inter-process communications form a performance bottleneck for large-scale brain simulations. The sparse and imbalanced communication patterns of human brain make it particularly challenging to design a communication system for supporting large-scale ...
3
113
Metrics
Total Citations3
Total Downloads113
Last 12 Months37
Last 6 weeks3
Get Access

Applied Filters

People

Names

Institutions

Authors

Publications

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Conference Event

Proceedings Series

Publication Date

Results

Efficient Phase-Functioned Real-time Character Control in Mobile Games: A TVM Enabled Approach

DSSA: Dual-Side Sparse Systolic Array Architecture for Accelerating Convolutional Neural Network Training

MG-GCN: A Scalable multi-GPU GCN Training Framework

DRAM Cache Management with Request Granularity for NAND-based SSDs

NCC: Neighbor-aware Congestion Control based on Reinforcement Learning for Datacenter Networks

An Online Learning Approach for Client Selection in Federated Edge Learning under Budget Constraint

Acuerdo: Fast Atomic Broadcast over RDMA

Repair-Optimal Data Placement for Locally Repairable Codes with Optimal Minimum Hamming Distance

Mlog: Multi-log Write Buffer upon Ultra-fast SSD RAID

Boosting Cross-rack Multi-stripe Repair in Heterogeneous Erasure-coded Clusters

TileSpMSpV: A Tiled Algorithm for Sparse Matrix-Sparse Vector Multiplication on GPUs

Regularizing Sparse and Imbalanced Communications for Voxel-based Brain Simulations on Supercomputers