No abstract available.
Proceeding Downloads
Enhancing Effective Bidirectional Isolation for Function Fusion in Serverless Architectures
Serverless computing has emerged as a popular paradigm in modern cloud environments, offering flexibility and scalability to tenants. A serverless function might handle sensitive tenant data. Employing Trusted Execution Environment (TEE) techniques to ...
On the Semantic Overlap of Operators in Stream Processing Engines
Stream Processing Engines (SPEs) extract value from data streams in the Edge-to-Cloud continuum through graphs of operators that progressively transform data.
State-of-the-art SPEs are bridged into shared models based on their overlapping APIs. The ...
STRATA: Random Forests going Serverless
Serverless computing has received growing interest in recent years for supporting large-scale machine learning tasks. However, training a machine learning model in a serverless environment is a nontrivial procedure and several challenges still need to be ...
Optimal Resource Efficiency with Fairness in Heterogeneous GPU Clusters
Ensuring the highest training throughput to maximize resource efficiency, while maintaining fairness among users, is critical for deep learning (DL) training in heterogeneous GPU clusters. However, current DL schedulers provide only limited fairness ...
L3: Latency-aware Load Balancing in Multi-Cluster Service Mesh
Microservice architectures and service meshes have become highly popular and face increasingly stringent scalability and dependability requirements. To achieve low-latency service execution and maximize performance, service providers of large-scale ...
sMVX: Multi-Variant Execution on Selected Code Paths
Multi-Variant Execution (MVX) is an effective way to detect memory corruption vulnerabilities, intrusions, or live software updates. A traditional MVX system concurrently runs multiple copies of functionally identical, layout-different program variants. ...
SpotVerse: Optimizing Bioinformatics Workflows with Multi-Region Spot Instances in Galaxy and Beyond
As demand for cloud computing in bioinformatics increases, various studies have explored options for running large-scale workloads with reduced costs, often leveraging spot instances in multi-region deployments. For example, spot instances offer lower ...
FLEdge: Benchmarking Federated Learning Applications in Edge Computing Systems
Federated Learning (FL) has become a viable technique for realizing privacy-enhancing distributed deep learning on the network edge. Heterogeneous hardware, unreliable client devices, and energy constraints often characterize edge computing systems. In ...
UTwinVM: Reliable hints on the effects of hypervisor updates on VMs in the Cloud
We investigate the problem of getting hints on the effects of virtualization system (aka hypervisor) updates impact on virtual machines (VMs). System administrators can be reluctant to apply updates due to vague hints regarding the updates' impact on ...
Dexter: A Performance-Cost Efficient Resource Allocation Manager for Serverless Data Analytics
- Anna Maria Nestorov,
- Diego Marrón,
- Alberto Gutierrez-Torre,
- Chen Wang,
- Claudia Misale,
- Alaa Youssef,
- David Carrera,
- Josep Lluís Berral
Leveraging serverless platforms for the efficient execution of distributed data analytics frameworks, such as Apache Spark [3], has gained substantial interest since early 2022. The elasticity, free-of-management, and on-demand scalability of serverless ...
BASS: A Resource Orchestrator to Account for Vagaries in Network Conditions in Community Wi-Fi Mesh
- Manasvini Sethuraman,
- Anirudh Sarma,
- Netra Ghaisas,
- Adwait Bauskar,
- Ashutosh Dhekne,
- Anand Sivasubramaniam,
- Kishore Ramachandran
We investigate the issue of deploying applications on a set of loosely coupled compute devices, connected through a wireless mesh, typical in community networks. Wireless mesh networks experience significant temporal and spatial variations in link ...
Near-Storage Processing in FaaS Environments with Funclets
Serverless computing has disrupted how computation is performed in the Cloud. The ability to write Functions, and not care about infrastructure brings many benefits, including significantly lower deployment costs, improved developer workflow, scalability,...
Chasing Lightspeed Consensus: Fast Wide-Area Byzantine Replication with Mercury
Blockchain technology sparked renewed interest in planetary-scale Byzantine fault-tolerant (BFT) state machine replication (SMR). While recent works predominantly focused on improving the scalability and throughput of these protocols, few of them ...
In Serverless, OS Scheduler Choice Costs Money: A Hybrid Scheduling Approach for Cheaper FaaS
In Function-as-a-Service (FaaS) serverless, large applications are split into short-lived stateless functions. Deploying functions is mutually profitable: users need not be concerned with resource management, while providers can keep their servers at ...
Menos: Split Fine-Tuning Large Language Models with Efficient GPU Memory Sharing
Fine-tuning of pre-trained large language models has become increasingly popular, yet existing fine-tuning methods are typically centralized, requiring users to send local data to centralized servers, or model owners to open-source their models. However, ...
Privagic: automatic code partitioning with explicit secure typing
Partitioning a multi-threaded application between a secure and a non-secure memory zone remains a challenge. The current tools rely on data flow analysis techniques, which are unable to handle multi-threaded C or C++ applications. To avoid this ...
Towards SLO-Compliant and Cost-Effective Serverless Computing on Emerging GPU Architectures
- Vivek M. Bhasi,
- Aakash Sharma,
- Rishabh Jain,
- Jashwant Raj Gunasekaran,
- Ashutosh Pattnaik,
- Mahmut Taylan Kandemir,
- Chita Das
Serverless platforms are supporting an increasing variety of applications (apps). Among these, apps such as Machine Learning (ML) inference serving can benefit significantly from leveraging accelerators like GPUs. Yet, major serverless providers, despite ...
B-Side: Binary-Level Static System Call Identification
System call filtering is widely used to secure programs in multi-tenant environments, and to sandbox applications in modern desktop software deployment and package management systems. Filtering rules are hard to write and maintain manually, hence ...
Amalgam: A Framework for Obfuscated Neural Network Training on the Cloud
Training a proprietary Neural Network (NN) model with a proprietary dataset on the cloud comes at the risk of exposing the model architecture and the dataset to the cloud service provider. To tackle this problem, in this paper, we present an NN ...
zkStream: a Framework for Trustworthy Stream Processing
In stream processing, managing sensitive information in a timely manner while ensuring trust remains a significant challenge. When parties without a priori trust cooperate to execute a streaming application, it is difficult to ensure that sensitive data ...
QuickDrop: Efficient Federated Unlearning via Synthetic Data Generation
Federated Unlearning (FU) aims to delete specific training data from an ML model trained using Federated Learning (FL). However, existing FU methods suffer from inefficiencies due to the high costs associated with gradient recomputation and storage. This ...
RoleML: a Role-Oriented Programming Model for Customizable Distributed Machine Learning on Edges
Edge AI aims to enable distributed machine learning (DML) on edge resources to fulfill the need for data privacy and low latency. Meanwhile, the challenge of device heterogeneity and discrepancy in data distribution requires more sophisticated DML ...
SeqCDC: Hashless Content-Defined Chunking for Data Deduplication
Data deduplication is critical to cloud storage providers and is widely employed to conserve server-side storage space. Data chunking is an important aspect of deduplication, being directly responsible for storage space savings and end-to-end system ...
Cannikin: Optimal Adaptive Distributed DNN Training over Heterogeneous Clusters
Adjusting batch sizes and adaptively tuning other hyperparameters can significantly speed up deep neural network (DNN) training. Despite the ubiquity of heterogeneous clusters, existing adaptive DNN training techniques solely consider homogeneous ...
Guardian: Safe GPU Sharing in Multi-Tenant Environments
- Manos Pavlidakis,
- Giorgos Vasiliadis,
- Stelios Mavridis,
- Anargyros Argyros,
- Antony Chazapis,
- Angelos Bilas
Modern GPU applications, such as machine learning (ML), can only partially utilize GPUs, leading to GPU underutilization in cloud environments. Sharing GPUs across multiple applications from different tenants can improve resource utilization and ...
Targeting Tail Latency in Replicated Systems with Proactive Rejection
When put under stress, traditional state-machine replication protocols typically exhibit response times that by far exceed the average level of normal-case operation. The common way to mitigate such overload-induced tail latency is to overprovision ...
Consensus-Agnostic State-Machine Replication
State-machine replication (SMR) is a popular fault-tolerance technique for building highly-available services. Usually, consensus protocols are used to enforce a deterministic service-request ordering among replicas, in order to prevent their state from ...
Ripple: Large-Scale Service and Configuration Management in the Cloud
Microservice architectures backed by container technology have been widely used in many real-world cloud-native applications. By enabling customers to manage their services and configurations in the cloud in a centralized, externalized, and dynamic ...
Spyker: Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients
Federated learning (FL) systems enable multiple clients to train a machine learning model iteratively through synchronously exchanging the intermediate model weights with a single server. The scalability of such FL systems can be limited by two factors: ...
PvCC: A vCPU Scheduling Policy for DPDK-applied Systems at Multi-Tenant Edge Data Centers
This paper explores a practical means to employ Data Plane Development Kit (DPDK), a kernel-bypassing framework for packet processing, in resource-limited multi-tenant edge data centers. The problem is that the traditional virtual CPU (vCPU) schedulers ...
Index Terms
- Proceedings of the 25th International Middleware Conference
Recommendations
Acceptance Rates
Year | Submitted | Accepted | Rate |
---|---|---|---|
Middleware '22 | 21 | 8 | 38% |
Middleware '17 | 85 | 20 | 24% |
Middleware '17 | 20 | 7 | 35% |
Middleware '17 | 17 | 12 | 71% |
Middleware Industry '15 | 20 | 4 | 20% |
Middleware '15 | 118 | 23 | 19% |
Middleware '14 | 144 | 27 | 19% |
Middleware '12 | 18 | 13 | 72% |
Middleware '08 | 117 | 21 | 18% |
Middleware '07 | 108 | 22 | 20% |
Middleware '06 | 122 | 21 | 17% |
Middleware '03 | 158 | 25 | 16% |
Overall | 948 | 203 | 21% |