Adaptive Data Placement in Multi-Cloud Storage: A Non-Stationary Combinatorial Bandit Approach
Multi-cloud storage is recently a viable approach to solve the vendor lock-in, reliability, and security issues in cloud storage systems. As a key concern, data placement influences the cost and performance of storage services. Yet, in practice it remains ...
Multi-SP Network Slicing Parallel Relieving Edge Network Conflict
Network slicing is rapidly prevailing in the edge network, which provides computing, network, and storage resources for various services. When the multiple service providers (SPs) respond to their tenants in parallel, individual decisions on the dynamic ...
SketchINT: Empowering INT With TowerSketch for Per-Flow Per-Switch Measurement
Network measurement is indispensable to network operations. INT solutions that can provide fine-grained per-switch per-packet information serve as promising solutions for per-flow per-switch measurement. The main shortcoming of INT is its high network ...
TDTA: Topology-Based Real-Time DAG Task Allocation on Identical Multiprocessor Platforms
Modern real-time systems contain complex workloads, which are usually modeled as directed acyclic graph (DAG) tasks and deployed on multiprocessor platforms. The complex execution logic of DAG tasks results in excessive schedulability analysis overhead, ...
Back to Homogeneous Computing: A Tightly-Coupled Neuromorphic Processor With Neuromorphic ISA
- Zhijie Yang,
- Lei Wang,
- Wei Shi,
- Yao Wang,
- Junbo Tie,
- Feng Wang,
- Xiang Yu,
- Linghui Peng,
- Chao Xiao,
- Xun Xiao,
- Yao Yao,
- Gan Zhou,
- Xuhu Yu,
- Rui Gong,
- Xia Zhao,
- Yuhua Tang,
- Weixia Xu
In recent years, neuromorphic processors are widely used in many scenarios, showing extreme energy efficiency over traditional architectures. However, almost all existing neuromorphic hardware are following the heterogeneous computing methodology without ...
Divide&Content: A Fair OS-Level Resource Manager for Contention Balancing on NUMA Multicores
Chip multicore processors (CMPs) constitute the cherry-picked architecture for high-performance servers employed in supercomputers and cloud datacenters. In the last few years, Non-Uniform Memory Access (NUMA) multicore systems have become the dominant ...
High-Level Data Abstraction and Elastic Data Caching for Data-Intensive AI Applications on Cloud-Native Platforms
- Rong Gu,
- Zhihao Xu,
- Yang Che,
- Xu Wang,
- Haipeng Dai,
- Kai Zhang,
- Bin Fan,
- Haojun Hou,
- Li Yi,
- Yu Ding,
- Yihua Huang,
- Guihai Chen
Nowdays, it is prevalent to train deep learning models in cloud-native platforms that actively leverage containerization and orchestration technologies for high elasticity, low and flexible operation cost, and many other benefits. However, it also faces ...
Automatic Multi-Parameter Performance Modeling of HPC Applications on a New Sunway Supercomputer
As the successor to Sunway TaihuLight, the new Sunway supercomputer has ultra-high computing capacity, but the unique heterogeneous architecture presents performance optimization challenges for High Performance Computing (HPC) applications. Performance ...
<sc>UMA-MF</sc>: A Unified Multi-CPU/GPU Asynchronous Computing Framework for SGD-Based Matrix Factorization
Recent research has shown that collaborative computing of CPUs and GPUs in the same system can effectively accelerate large-scale SGD-based matrix factorization (MF), but it faces the problem of limited scalability due to parameter synchronization in the ...
Joint Deployment and Request Routing for Microservice Call Graphs in Data Centers
Microservices are an architectural and organizational paradigm for Internet application development. In cloud data centers, delay-sensitive applications receive massive user requests, which are fed into multiple queues and subsequently served by multiple ...