Architectures

Applied Filters

People

Publications

Conferences

Reproducibility Badges

Publication Date

15 Results for: Book/Issue: ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Edit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,766,572 records)|Limit your search to The ACM Full-Text Collection (759,386 records)

Showing 1 - 15of15 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
February 2024
Explainable-DSE: An Agile and Explainable Exploration of Efficient HW/SW Codesigns of Deep Learning Accelerators Using Bottleneck Analysis
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 87–107https://doi.org/10.1145/3623278.3624772

Effective design space exploration (DSE) is paramount for hardware/software codesigns of deep learning accelerators that must meet strict execution constraints. For their vast search space, existing DSE techniques can require excessive trials to obtain a ...
0
441
Metrics
Total Citations0
Total Downloads441
Last 12 Months441
Last 6 weeks60
Get Access
research-article
February 2024
Artifacts Available / v1.1
Artifacts Evaluated & Functional / v1.1
Flame: A Centralized Cache Controller for Serverless Computing
- Yanan Yang,
- Laiping Zhao,
- Yiming Li,
- Shihao Wu,
- Yuechan Hao,
- Yuchi Ma,
- Keqiu Li
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 153–168https://doi.org/10.1145/3623278.3624769

Caching function is a promising way to mitigate coldstart overhead in serverless computing. However, as caching also increases the resource cost significantly, how to make caching decisions is still challenging. We find that the prior "local cache ...
0
420
Metrics
Total Citations0
Total Downloads420
Last 12 Months420
Last 6 weeks58
Get Access
research-article
Open Access
February 2024
HIR: An MLIR-based Intermediate Representation for Hardware Accelerator Description
- Kingshuk Majumder,
- Uday Bondhugula
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 189–201https://doi.org/10.1145/3623278.3624767

The emergence of machine learning, image and audio processing on edge devices has motivated research towards power-efficient custom hardware accelerators. Though FPGAs are an ideal target for custom accelerators, the difficulty of hardware design and the ...
1
756
Metrics
Total Citations1
Total Downloads756
Last 12 Months756
Last 6 weeks125
View online with eReader
PDF
research-article
Open Access
February 2024
Artifacts Evaluated & Functional / v1.1
Artifacts Available / v1.1
λFS: A Scalable and Elastic Distributed File System Metadata Service using Serverless Functions
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 394–411https://doi.org/10.1145/3623278.3624765

The metadata service (MDS) sits on the critical path for distributed file system (DFS) operations, and therefore it is key to the overall performance of a large-scale DFS. Common "serverful" MDS architectures, such as a single server or cluster of ...
0
525
Metrics
Total Citations0
Total Downloads525
Last 12 Months525
Last 6 weeks81
View online with eReader
PDF
research-article
Open Access
February 2024
VarSaw: Application-tailored Measurement Error Mitigation for Variational Quantum Algorithms
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 362–377https://doi.org/10.1145/3623278.3624764

For potential quantum advantage, Variational Quantum Algorithms (VQAs) need high accuracy beyond the capability of today's NISQ devices, and thus will benefit from error mitigation. In this work we are interested in mitigating measurement errors which ...
1
224
Metrics
Total Citations1
Total Downloads224
Last 12 Months224
Last 6 weeks27
View online with eReader
PDF
research-article
February 2024
CPS: A Cooperative Para-virtualized Scheduling Framework for Manycore Machines
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 43–56https://doi.org/10.1145/3623278.3624762

Today's cloud platforms offer large virtual machine (VM) instances with multiple virtual CPUs (vCPU) on manycore machines. These machines typically have a deep memory hierarchy to enhance communication between cores. Although previous researches have ...
1
321
Metrics
Total Citations1
Total Downloads321
Last 12 Months321
Last 6 weeks35
Get Access
research-article
February 2024
Results Reproduced / v1.1
Artifacts Evaluated & Functional / v1.1
Artifacts Available / v1.1
RECom: A Compiler Approach to Accelerating Recommendation Model Inference with Massive Embedding Columns
- Zaifeng Pan,
- Zhen Zheng,
- Feng Zhang,
- Ruofan Wu,
- Hao Liang,
- Dalin Wang,
- Xiafei Qiu,
- Junjie Bai,
- Wei Lin,
- Xiaoyong Du
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 268–286https://doi.org/10.1145/3623278.3624761

Embedding columns are important for deep recommendation models to achieve high accuracy, but they can be very time-consuming during inference. Machine learning (ML) compilers are used broadly in real businesses to optimize ML models automatically. ...
0
342
Metrics
Total Citations0
Total Downloads342
Last 12 Months342
Last 6 weeks49
Get Access
research-article
February 2024
Sleuth: A Trace-Based Root Cause Analysis System for Large-Scale Microservices with Graph Neural Networks
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 324–337https://doi.org/10.1145/3623278.3624758

Cloud microservices are being scaled up due to the rising demand for new features and the convenience of cloud-native technologies. However, the growing scale of microservices complicates the remote procedure call (RPC) dependency graph, exacerbates the ...
0
367
Metrics
Total Citations0
Total Downloads367
Last 12 Months367
Last 6 weeks32
Get Access
research-article
Open Access
February 2024
Results Reproduced / v1.1
Artifacts Evaluated & Functional / v1.1
Artifacts Available / v1.1
LightRidge: An End-to-end Agile Design Framework for Diffractive Optical Neural Networks
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 202–218https://doi.org/10.1145/3623278.3624757

To lower the barrier to diffractive optical neural networks (DONNs) design, exploration, and deployment, we propose LightRidge, the first end-to-end optical ML compilation framework, which consists of (1) precise and differentiable optical physics ...
0
266
Metrics
Total Citations0
Total Downloads266
Last 12 Months266
Last 6 weeks52
View online with eReader
PDF
research-article
Open Access
February 2024
Predict; Don't React for Enabling Efficient Fine-Grain DVFS in GPUs
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 253–267https://doi.org/10.1145/3623278.3624756

With the continuous improvement of on-chip integrated voltage regulators (IVRs) and fast, adaptive frequency control, dynamic voltage-frequency scaling (DVFS) transition times have shrunk from the microsecond to the nanosecond regime, providing immense ...
1
450
Metrics
Total Citations1
Total Downloads450
Last 12 Months450
Last 6 weeks115
View online with eReader
PDF
research-article
February 2024
Artifacts Available / v1.1
DataFlower: Exploiting the Data-flow Paradigm for Serverless Workflow Orchestration
- Zijun Li,
- Chuhao Xu,
- Quan Chen,
- Jieru Zhao,
- Chen Chen,
- Minyi Guo
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 57–72https://doi.org/10.1145/3623278.3624755

Serverless computing that runs functions with auto-scaling is a popular task execution pattern in the cloud-native era. By connecting serverless functions into workflows, tenants can achieve complex functionality. Prior research adopts the control-flow ...
0
427
Metrics
Total Citations0
Total Downloads427
Last 12 Months427
Last 6 weeks54
Get Access
research-article
February 2024
Results Reproduced / v1.1
Artifacts Evaluated & Functional / v1.1
Artifacts Available / v1.1
Supporting Descendants in SIMD-Accelerated JSONPath
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 338–361https://doi.org/10.1145/3623278.3624754

Harnessing the power of SIMD can bring tremendous performance gains in data processing. In querying streamed JSON data, the state of the art leverages SIMD to fast forward significant portions of the document. However, it does not provide support for ...
0
126
Metrics
Total Citations0
Total Downloads126
Last 12 Months126
Last 6 weeks6
Get Access
research-article
Open Access
February 2024
DREAM: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 73–86https://doi.org/10.1145/3623278.3624753

Emerging real-time multi-model ML (RTMM) workloads such as AR/VR and drone control involve dynamic behaviors in various granularity; task, model, and layers within a model. Such dynamic behaviors introduce new challenges to the system software in an ML ...
0
922
Metrics
Total Citations0
Total Downloads922
Last 12 Months922
Last 6 weeks156
View online with eReader
PDF
research-article
Open Access
February 2024
Exploiting the Regular Structure of Modern Quantum Architectures for Compiling and Optimizing Programs with Permutable Operators
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 108–124https://doi.org/10.1145/3623278.3624751

A critical feature in today's quantum circuit is that they have permutable two-qubit operators. The flexibility in ordering the permutable two-qubit gates leads to more compiler optimization opportunities. However, it also imposes significant challenges ...
1
333
Metrics
Total Citations1
Total Downloads333
Last 12 Months333
Last 6 weeks41
View online with eReader
PDF
research-article
Open Access
February 2024
Manticore: Hardware-Accelerated RTL Simulation with Static Bulk-Synchronous Parallelism
ASPLOS '23: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4Pages 219–237https://doi.org/10.1145/3623278.3624750

The demise of Moore's Law and Dennard Scaling has revived interest in specialized computer architectures and accelerators. Verification and testing of this hardware depend heavily upon cycle-accurate simulation of register-transfer-level (RTL) designs. ...
0
520
Metrics
Total Citations0
Total Downloads520
Last 12 Months520
Last 6 weeks65
View online with eReader
PDF

Applied Filters

People

Names

Institutions

Authors

Publications

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Reproducibility Badges

Publication Date

Explainable-DSE: An Agile and Explainable Exploration of Efficient HW/SW Codesigns of Deep Learning Accelerators Using Bottleneck Analysis

Flame: A Centralized Cache Controller for Serverless Computing

HIR: An MLIR-based Intermediate Representation for Hardware Accelerator Description

λFS: A Scalable and Elastic Distributed File System Metadata Service using Serverless Functions

VarSaw: Application-tailored Measurement Error Mitigation for Variational Quantum Algorithms

CPS: A Cooperative Para-virtualized Scheduling Framework for Manycore Machines

RECom: A Compiler Approach to Accelerating Recommendation Model Inference with Massive Embedding Columns

Sleuth: A Trace-Based Root Cause Analysis System for Large-Scale Microservices with Graph Neural Networks

LightRidge: An End-to-end Agile Design Framework for Diffractive Optical Neural Networks

Predict; Don't React for Enabling Efficient Fine-Grain DVFS in GPUs

DataFlower: Exploiting the Data-flow Paradigm for Serverless Workflow Orchestration

Supporting Descendants in SIMD-Accelerated JSONPath

DREAM: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads

Exploiting the Regular Structure of Modern Quantum Architectures for Compiling and Optimizing Programs with Permutable Operators

Manticore: Hardware-Accelerated RTL Simulation with Static Bulk-Synchronous Parallelism