-
Multi-FedLS: a Framework for Cross-Silo Federated Learning Applications on Multi-Cloud Environments
Authors:
Rafaela C. Brum,
Maria Clicia Stelling de Castro,
Luciana Arantes,
Lúcia Maria de A. Drummond,
Pierre Sens
Abstract:
Federated Learning (FL) is a distributed Machine Learning (ML) technique that can benefit from cloud environments while preserving data privacy. We propose Multi-FedLS, a framework that manages multi-cloud resources, reducing execution time and financial costs of Cross-Silo Federated Learning applications by using preemptible VMs, cheaper than on-demand ones but that can be revoked at any time. Ou…
▽ More
Federated Learning (FL) is a distributed Machine Learning (ML) technique that can benefit from cloud environments while preserving data privacy. We propose Multi-FedLS, a framework that manages multi-cloud resources, reducing execution time and financial costs of Cross-Silo Federated Learning applications by using preemptible VMs, cheaper than on-demand ones but that can be revoked at any time. Our framework encloses four modules: Pre-Scheduling, Initial Mapping, Fault Tolerance, and Dynamic Scheduler. This paper extends our previous work \cite{brum2022sbac} by formally describing the Multi-FedLS resource manager framework and its modules. Experiments were conducted with three Cross-Silo FL applications on CloudLab and a proof-of-concept confirms that Multi-FedLS can be executed on a multi-cloud composed by AWS and GCP, two commercial cloud providers. Results show that the problem of executing Cross-Silo FL applications in multi-cloud environments with preemptible VMs can be efficiently resolved using a mathematical formulation, fault tolerance techniques, and a simple heuristic to choose a new VM in case of revocation.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
Efficient Consensus-Free Weight Reassignment for Atomic Storage (Extended Version)
Authors:
Hasan Heydari,
Guthemberg Silvestre,
Luciana Arantes
Abstract:
Weighted voting is a conventional approach to improving the performance of replicated systems based on commonly-used majority quorum systems in heterogeneous environments. In long-lived systems, a weight reassignment protocol is required to reassign weights over time in order to accommodate performance variations accordingly. The weight reassignment protocol should be consensus-free in asynchronou…
▽ More
Weighted voting is a conventional approach to improving the performance of replicated systems based on commonly-used majority quorum systems in heterogeneous environments. In long-lived systems, a weight reassignment protocol is required to reassign weights over time in order to accommodate performance variations accordingly. The weight reassignment protocol should be consensus-free in asynchronous failure-prone systems because of the impossibility of solving consensus in such systems. This paper presents an efficient consensus-free weight reassignment protocol for atomic storage systems in heterogeneous, dynamic, and asynchronous message-passing systems. An experimental evaluation shows that the proposed protocol improves the performance of atomic read/write storage implemented by majority quorum systems compared with previous solutions.
△ Less
Submitted 2 December, 2021; v1 submitted 20 October, 2021;
originally announced October 2021.
-
Scheduling Bag-of-Tasks in Clouds using Spot and Burstable Virtual Machines
Authors:
Luan Teylo,
Luciana Arantes,
Pierre Sens,
Lúcia Maria de A. Drummond
Abstract:
Leading Cloud providers offer several types of Virtual Machines (VMs) in diverse contract models, with different guarantees in terms of availability and reliability. Among them, the most popular contract models are the on-demand and the spot models. In the former, on-demand VMs are allocated for a fixed cost per time unit, and their availability is ensured during the whole execution. On the other…
▽ More
Leading Cloud providers offer several types of Virtual Machines (VMs) in diverse contract models, with different guarantees in terms of availability and reliability. Among them, the most popular contract models are the on-demand and the spot models. In the former, on-demand VMs are allocated for a fixed cost per time unit, and their availability is ensured during the whole execution. On the other hand, in the spot market, VMs are offered with a huge discount when compared to the on-demand VMs, but their availability fluctuates according to the cloud's current demand that can terminate or hibernate a spot VM at any time. Furthermore, in order to cope with workload variations, cloud providers have also introduced the concept of burstable VMs which are able to burst up their respective baseline CPU performance during a limited period of time with an up to 20% discount when compared to an equivalent non-burstable on-demand VMs. In the current work, we present the Burst Hibernation-Aware Dynamic Scheduler (Burst-HADS), a framework that schedules and executes tasks of Bag-of-Tasks applications with deadline constraints by exploiting spot and on-demand burstable VMs, aiming at minimizing both the monetary cost and the execution time. Based on ILS metaheuristics, Burst-HADS defines an initial scheduling map of tasks to VMs which can then be dynamically altered by migrating tasks of a hibernated spot VM or by performing work-stealing when VMs become idle. Performance results on Amazon EC2 cloud with different applications show that, when compared to a solution that uses only regular on-demand instances, Burst-HADS reduces the monetary cost of the execution and meet the application deadline even in scenarios with high spot hibernation rates. It also reduces the total execution time when compared to a solution that uses only spot and non-burstable on-demand instances.
△ Less
Submitted 10 November, 2020;
originally announced November 2020.
-
A Bag-of-Tasks Scheduler Tolerant to Temporal Failures in Clouds
Authors:
Luan Teylo,
Lúcia Maria de A. Drummond,
Luciana Arantes,
Pierre Sens
Abstract:
Cloud platforms have emerged as a prominent environment to execute high performance computing (HPC) applications providing on-demand resources as well as scalability. They usually offer different classes of Virtual Machines (VMs) which ensure different guarantees in terms of availability and volatility, provisioning the same resource through multiple pricing models. For instance, in Amazon EC2 clo…
▽ More
Cloud platforms have emerged as a prominent environment to execute high performance computing (HPC) applications providing on-demand resources as well as scalability. They usually offer different classes of Virtual Machines (VMs) which ensure different guarantees in terms of availability and volatility, provisioning the same resource through multiple pricing models. For instance, in Amazon EC2 cloud, the user pays per hour for on-demand VMs while spot VMs are unused instances available for lower price. Despite the monetary advantages, a spot VM can be terminated, stopped, or hibernated by EC2 at any moment.
Using both hibernation-prone spot VMs (for cost sake) and on-demand VMs, we propose in this paper a static scheduling for HPC applications which are composed by independent tasks (bag-of-task) with deadline constraints. However, if a spot VM hibernates and it does not resume within a time which guarantees the application's deadline, a temporal failure takes place. Our scheduling, thus, aims at minimizing monetary costs of bag-of-tasks applications in EC2 cloud, respecting its deadline and avoiding temporal failures. To this end, our algorithm statically creates two scheduling maps: (i) the first one contains, for each task, its starting time and on which VM (i.e., an available spot or on-demand VM with the current lowest price) the task should execute; (ii) the second one contains, for each task allocated on a VM spot in the first map, its starting time and on which on-demand VM it should be executed to meet the application deadline in order to avoid temporal failures. The latter will be used whenever the hibernation period of a spot VM exceeds a time limit.
Performance results from simulation with task execution traces, configuration of Amazon EC2 VM classes, and VMs market history confirms the effectiveness of our scheduling and that it tolerates temporal failures.
△ Less
Submitted 24 October, 2018;
originally announced October 2018.
-
VCube-PS: A Causal Broadcast Topic-based Publish/Subscribe System
Authors:
João Paulo de Araujo,
Luciana Arantes,
Elias P. Duarte Jr.,
Luiz A. Rodrigues,
Pierre Sens
Abstract:
In this work we present VCube-PS, a topic-based Publish/Subscribe system built on the top of a virtual hypercube-like topology. Membership information and published messages are broadcast to subscribers (members) of a topic group over dynamically built spanning trees rooted at the publisher. For a given topic, the delivery of published messages respects the causal order. VCube-PS was implemented o…
▽ More
In this work we present VCube-PS, a topic-based Publish/Subscribe system built on the top of a virtual hypercube-like topology. Membership information and published messages are broadcast to subscribers (members) of a topic group over dynamically built spanning trees rooted at the publisher. For a given topic, the delivery of published messages respects the causal order. VCube-PS was implemented on the PeerSim simulator, and experiments are reported including a comparison with the traditional Publish/Subscribe approach that employs a single rooted static spanning-tree for message distribution. Results confirm the efficiency of VCube-PS in terms of scalability, latency, number and size of messages.
△ Less
Submitted 4 July, 2018; v1 submitted 26 June, 2017;
originally announced June 2017.
-
The Impact Failure Detector
Authors:
Anubis G. M. Rossetto,
Cláudio F. R. Geyer,
Luciana Arantes,
Pierre Sens
Abstract:
This work proposes a new and flexible unreliable failure detector whose output is related to the trust level of a set of processes. By expressing the relevance of each process of the set by an impact factor value, our approach allows the tuning of the detector output, making possible a softer or stricter monitoring. The idea behind our proposal is that, according to an acceptable margin of failure…
▽ More
This work proposes a new and flexible unreliable failure detector whose output is related to the trust level of a set of processes. By expressing the relevance of each process of the set by an impact factor value, our approach allows the tuning of the detector output, making possible a softer or stricter monitoring. The idea behind our proposal is that, according to an acceptable margin of failures and the impact factor assigned to processes, in some scenarios, the failure of some low impact processes may not change the user confidence in the set of processes, while the crash of a high impact factor process may seriously affect it. We outline the application scenarios and the proposed unreliable failure detector, giving a detailed account of the concept on which it is based.
△ Less
Submitted 25 April, 2014;
originally announced April 2014.
-
Requirements Engineering of a Web Portal using Organizational Semiotics Artifacts and Participatory Practices
Authors:
Flávia Linhalis Arantes
Abstract:
Software requirements are key elements that contribute to the quality and users satisfaction of the final system. In this work, Requirements Engineering (RE) of web sites is presented using an organizational semiotics perspective. They are shown as being part of an organization, with particular practices, rules and views considering stakeholders several differences and opinions. The main contribut…
▽ More
Software requirements are key elements that contribute to the quality and users satisfaction of the final system. In this work, Requirements Engineering (RE) of web sites is presented using an organizational semiotics perspective. They are shown as being part of an organization, with particular practices, rules and views considering stakeholders several differences and opinions. The main contribution of this paper is to relate an experience, from elicitation to validation, showing how organizational semiotics artifacts were exploited in a collaborative and participatory way to RE of a web portal. A case study is described in order to demonstrate the feasibility of using such artifacts to RE when we think about the system as being part of a social organization.
△ Less
Submitted 14 May, 2013;
originally announced May 2013.
-
Asynchronous Implementation of Failure Detectors with partial connectivity and unknown participants
Authors:
Pierre Sens,
Luciana Arantes,
Mathieu Bouillaguet,
Véronique Martin,
Fabiola Greve
Abstract:
We consider the problem of failure detection in dynamic networks such as MANETs. Unreliable failure detectors are classical mechanisms which provide information about process failures. However, most of current implementations consider that the network is fully connected and that the initial number of nodes of the system is known. This assumption is not applicable to dynamic environments. Further…
▽ More
We consider the problem of failure detection in dynamic networks such as MANETs. Unreliable failure detectors are classical mechanisms which provide information about process failures. However, most of current implementations consider that the network is fully connected and that the initial number of nodes of the system is known. This assumption is not applicable to dynamic environments. Furthermore, such implementations are usually timer-based while in dynamic networks there is no upper bound for communication delays since nodes can move. This paper presents an asynchronous implementation of a failure detector for unknown and mobile networks. Our approach does not rely on timers and neither the composition nor the number of nodes in the system are known. We prove that our algorithm can implement failure detectors of class <>S when behavioral properties and connectivity conditions are satisfied by the underlying system.
△ Less
Submitted 21 December, 2007; v1 submitted 3 January, 2007;
originally announced January 2007.
-
Chiral Breit-Wigner
Authors:
L. O. Arantes,
M. R. Robilotta
Abstract:
Chiral symmetry and unitarization are combined into generalized Breit-Wigner expressions describing scalar resonances, which contain free parameters and allow flexible descriptions of masses, widths and pole positions. This theoretical tool is especially designed to be used in analyses of low-energy data.
Chiral symmetry and unitarization are combined into generalized Breit-Wigner expressions describing scalar resonances, which contain free parameters and allow flexible descriptions of masses, widths and pole positions. This theoretical tool is especially designed to be used in analyses of low-energy data.
△ Less
Submitted 17 October, 2005;
originally announced October 2005.
-
Chiral symmetry and parametrization of scalar resonances
Authors:
L. O. Arantes,
M. R. Robilotta
Abstract:
The linear $\s$-model is used to study the effects of chiral symmetry in unitarized amplitudes incorporating scalar resonances. When just a single resonance is present, we show that the iteration of a chiral tree amplitude by means of regularized two-pion loops preserves the smallness of $\p\p$ interaction at low energies and estimate the importance of pion off-shell contributions. The inclusion…
▽ More
The linear $\s$-model is used to study the effects of chiral symmetry in unitarized amplitudes incorporating scalar resonances. When just a single resonance is present, we show that the iteration of a chiral tree amplitude by means of regularized two-pion loops preserves the smallness of $\p\p$ interaction at low energies and estimate the importance of pion off-shell contributions. The inclusion of a second resonance is performed by means of a chiral extension of the linear $\s$-model lagrangian. The new $\p\p$ ampitude at tree level complies with low-energy theorems, depends on a mixing angle and has a zero for a given energy between the resonance masses. The unitarization of this amplitude by means of two-pion loops preserves both its chiral low energy behavior and the position of this zero confirming, in a lagrangian framework, conclusions drawn previously by Törnqvist. Finally, we approximate and generalize our results and give a friendly expression that can be used in the parametrization of $N$ coupled scalar resonances.
△ Less
Submitted 1 February, 2005;
originally announced February 2005.