Search | arXiv e-print repository

Multi-FedLS: a Framework for Cross-Silo Federated Learning Applications on Multi-Cloud Environments

Authors: Rafaela C. Brum, Maria Clicia Stelling de Castro, Luciana Arantes, Lúcia Maria de A. Drummond, Pierre Sens

Abstract: Federated Learning (FL) is a distributed Machine Learning (ML) technique that can benefit from cloud environments while preserving data privacy. We propose Multi-FedLS, a framework that manages multi-cloud resources, reducing execution time and financial costs of Cross-Silo Federated Learning applications by using preemptible VMs, cheaper than on-demand ones but that can be revoked at any time. Ou… ▽ More Federated Learning (FL) is a distributed Machine Learning (ML) technique that can benefit from cloud environments while preserving data privacy. We propose Multi-FedLS, a framework that manages multi-cloud resources, reducing execution time and financial costs of Cross-Silo Federated Learning applications by using preemptible VMs, cheaper than on-demand ones but that can be revoked at any time. Our framework encloses four modules: Pre-Scheduling, Initial Mapping, Fault Tolerance, and Dynamic Scheduler. This paper extends our previous work \cite{brum2022sbac} by formally describing the Multi-FedLS resource manager framework and its modules. Experiments were conducted with three Cross-Silo FL applications on CloudLab and a proof-of-concept confirms that Multi-FedLS can be executed on a multi-cloud composed by AWS and GCP, two commercial cloud providers. Results show that the problem of executing Cross-Silo FL applications in multi-cloud environments with preemptible VMs can be efficiently resolved using a mathematical formulation, fault tolerance techniques, and a simple heuristic to choose a new VM in case of revocation. △ Less

Submitted 17 August, 2023; originally announced August 2023.

Comments: In review by Journal of Parallel and Distributed Computing

arXiv:2110.10666 [pdf, other]

Efficient Consensus-Free Weight Reassignment for Atomic Storage (Extended Version)

Authors: Hasan Heydari, Guthemberg Silvestre, Luciana Arantes

Abstract: Weighted voting is a conventional approach to improving the performance of replicated systems based on commonly-used majority quorum systems in heterogeneous environments. In long-lived systems, a weight reassignment protocol is required to reassign weights over time in order to accommodate performance variations accordingly. The weight reassignment protocol should be consensus-free in asynchronou… ▽ More Weighted voting is a conventional approach to improving the performance of replicated systems based on commonly-used majority quorum systems in heterogeneous environments. In long-lived systems, a weight reassignment protocol is required to reassign weights over time in order to accommodate performance variations accordingly. The weight reassignment protocol should be consensus-free in asynchronous failure-prone systems because of the impossibility of solving consensus in such systems. This paper presents an efficient consensus-free weight reassignment protocol for atomic storage systems in heterogeneous, dynamic, and asynchronous message-passing systems. An experimental evaluation shows that the proposed protocol improves the performance of atomic read/write storage implemented by majority quorum systems compared with previous solutions. △ Less

Submitted 2 December, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

arXiv:2011.05042 [pdf, other]

Scheduling Bag-of-Tasks in Clouds using Spot and Burstable Virtual Machines

Authors: Luan Teylo, Luciana Arantes, Pierre Sens, Lúcia Maria de A. Drummond

Abstract: Leading Cloud providers offer several types of Virtual Machines (VMs) in diverse contract models, with different guarantees in terms of availability and reliability. Among them, the most popular contract models are the on-demand and the spot models. In the former, on-demand VMs are allocated for a fixed cost per time unit, and their availability is ensured during the whole execution. On the other… ▽ More Leading Cloud providers offer several types of Virtual Machines (VMs) in diverse contract models, with different guarantees in terms of availability and reliability. Among them, the most popular contract models are the on-demand and the spot models. In the former, on-demand VMs are allocated for a fixed cost per time unit, and their availability is ensured during the whole execution. On the other hand, in the spot market, VMs are offered with a huge discount when compared to the on-demand VMs, but their availability fluctuates according to the cloud's current demand that can terminate or hibernate a spot VM at any time. Furthermore, in order to cope with workload variations, cloud providers have also introduced the concept of burstable VMs which are able to burst up their respective baseline CPU performance during a limited period of time with an up to 20% discount when compared to an equivalent non-burstable on-demand VMs. In the current work, we present the Burst Hibernation-Aware Dynamic Scheduler (Burst-HADS), a framework that schedules and executes tasks of Bag-of-Tasks applications with deadline constraints by exploiting spot and on-demand burstable VMs, aiming at minimizing both the monetary cost and the execution time. Based on ILS metaheuristics, Burst-HADS defines an initial scheduling map of tasks to VMs which can then be dynamically altered by migrating tasks of a hibernated spot VM or by performing work-stealing when VMs become idle. Performance results on Amazon EC2 cloud with different applications show that, when compared to a solution that uses only regular on-demand instances, Burst-HADS reduces the monetary cost of the execution and meet the application deadline even in scenarios with high spot hibernation rates. It also reduces the total execution time when compared to a solution that uses only spot and non-burstable on-demand instances. △ Less

Submitted 10 November, 2020; originally announced November 2020.

arXiv:1810.10279 [pdf, other]

A Bag-of-Tasks Scheduler Tolerant to Temporal Failures in Clouds

Authors: Luan Teylo, Lúcia Maria de A. Drummond, Luciana Arantes, Pierre Sens

Abstract: Cloud platforms have emerged as a prominent environment to execute high performance computing (HPC) applications providing on-demand resources as well as scalability. They usually offer different classes of Virtual Machines (VMs) which ensure different guarantees in terms of availability and volatility, provisioning the same resource through multiple pricing models. For instance, in Amazon EC2 clo… ▽ More Cloud platforms have emerged as a prominent environment to execute high performance computing (HPC) applications providing on-demand resources as well as scalability. They usually offer different classes of Virtual Machines (VMs) which ensure different guarantees in terms of availability and volatility, provisioning the same resource through multiple pricing models. For instance, in Amazon EC2 cloud, the user pays per hour for on-demand VMs while spot VMs are unused instances available for lower price. Despite the monetary advantages, a spot VM can be terminated, stopped, or hibernated by EC2 at any moment. Using both hibernation-prone spot VMs (for cost sake) and on-demand VMs, we propose in this paper a static scheduling for HPC applications which are composed by independent tasks (bag-of-task) with deadline constraints. However, if a spot VM hibernates and it does not resume within a time which guarantees the application's deadline, a temporal failure takes place. Our scheduling, thus, aims at minimizing monetary costs of bag-of-tasks applications in EC2 cloud, respecting its deadline and avoiding temporal failures. To this end, our algorithm statically creates two scheduling maps: (i) the first one contains, for each task, its starting time and on which VM (i.e., an available spot or on-demand VM with the current lowest price) the task should execute; (ii) the second one contains, for each task allocated on a VM spot in the first map, its starting time and on which on-demand VM it should be executed to meet the application deadline in order to avoid temporal failures. The latter will be used whenever the hibernation period of a spot VM exceeds a time limit. Performance results from simulation with task execution traces, configuration of Amazon EC2 VM classes, and VMs market history confirms the effectiveness of our scheduling and that it tolerates temporal failures. △ Less

Submitted 24 October, 2018; originally announced October 2018.

arXiv:1706.08302 [pdf, other]

VCube-PS: A Causal Broadcast Topic-based Publish/Subscribe System

Authors: João Paulo de Araujo, Luciana Arantes, Elias P. Duarte Jr., Luiz A. Rodrigues, Pierre Sens

Abstract: In this work we present VCube-PS, a topic-based Publish/Subscribe system built on the top of a virtual hypercube-like topology. Membership information and published messages are broadcast to subscribers (members) of a topic group over dynamically built spanning trees rooted at the publisher. For a given topic, the delivery of published messages respects the causal order. VCube-PS was implemented o… ▽ More In this work we present VCube-PS, a topic-based Publish/Subscribe system built on the top of a virtual hypercube-like topology. Membership information and published messages are broadcast to subscribers (members) of a topic group over dynamically built spanning trees rooted at the publisher. For a given topic, the delivery of published messages respects the causal order. VCube-PS was implemented on the PeerSim simulator, and experiments are reported including a comparison with the traditional Publish/Subscribe approach that employs a single rooted static spanning-tree for message distribution. Results confirm the efficiency of VCube-PS in terms of scalability, latency, number and size of messages. △ Less

Submitted 4 July, 2018; v1 submitted 26 June, 2017; originally announced June 2017.

Comments: Improved text and performance evaluation. Added proof for the algorithms (Section 3.4)

ACM Class: C.2.4

arXiv:1404.6415 [pdf]

The Impact Failure Detector

Authors: Anubis G. M. Rossetto, Cláudio F. R. Geyer, Luciana Arantes, Pierre Sens

Abstract: This work proposes a new and flexible unreliable failure detector whose output is related to the trust level of a set of processes. By expressing the relevance of each process of the set by an impact factor value, our approach allows the tuning of the detector output, making possible a softer or stricter monitoring. The idea behind our proposal is that, according to an acceptable margin of failure… ▽ More This work proposes a new and flexible unreliable failure detector whose output is related to the trust level of a set of processes. By expressing the relevance of each process of the set by an impact factor value, our approach allows the tuning of the detector output, making possible a softer or stricter monitoring. The idea behind our proposal is that, according to an acceptable margin of failures and the impact factor assigned to processes, in some scenarios, the failure of some low impact processes may not change the user confidence in the set of processes, while the crash of a high impact factor process may seriously affect it. We outline the application scenarios and the proposed unreliable failure detector, giving a detailed account of the concept on which it is based. △ Less

Submitted 25 April, 2014; originally announced April 2014.

Comments: EDCC-2014, Fast-Abstracts, failure detector, impact factor, trust level

arXiv:1305.3255 [pdf]

doi 10.5121/ijcsit.2013.5212

Requirements Engineering of a Web Portal using Organizational Semiotics Artifacts and Participatory Practices

Authors: Flávia Linhalis Arantes

Abstract: Software requirements are key elements that contribute to the quality and users satisfaction of the final system. In this work, Requirements Engineering (RE) of web sites is presented using an organizational semiotics perspective. They are shown as being part of an organization, with particular practices, rules and views considering stakeholders several differences and opinions. The main contribut… ▽ More Software requirements are key elements that contribute to the quality and users satisfaction of the final system. In this work, Requirements Engineering (RE) of web sites is presented using an organizational semiotics perspective. They are shown as being part of an organization, with particular practices, rules and views considering stakeholders several differences and opinions. The main contribution of this paper is to relate an experience, from elicitation to validation, showing how organizational semiotics artifacts were exploited in a collaborative and participatory way to RE of a web portal. A case study is described in order to demonstrate the feasibility of using such artifacts to RE when we think about the system as being part of a social organization. △ Less

Submitted 14 May, 2013; originally announced May 2013.

Journal ref: International Journal of Computer Science & Information Technology (IJCSIT) Vol 5, No 2, April 2013

arXiv:cs/0701015 [pdf, ps, other]

Asynchronous Implementation of Failure Detectors with partial connectivity and unknown participants

Authors: Pierre Sens, Luciana Arantes, Mathieu Bouillaguet, Véronique Martin, Fabiola Greve

Abstract: We consider the problem of failure detection in dynamic networks such as MANETs. Unreliable failure detectors are classical mechanisms which provide information about process failures. However, most of current implementations consider that the network is fully connected and that the initial number of nodes of the system is known. This assumption is not applicable to dynamic environments. Further… ▽ More We consider the problem of failure detection in dynamic networks such as MANETs. Unreliable failure detectors are classical mechanisms which provide information about process failures. However, most of current implementations consider that the network is fully connected and that the initial number of nodes of the system is known. This assumption is not applicable to dynamic environments. Furthermore, such implementations are usually timer-based while in dynamic networks there is no upper bound for communication delays since nodes can move. This paper presents an asynchronous implementation of a failure detector for unknown and mobile networks. Our approach does not rely on timers and neither the composition nor the number of nodes in the system are known. We prove that our algorithm can implement failure detectors of class <>S when behavioral properties and connectivity conditions are satisfied by the underlying system. △ Less

Submitted 21 December, 2007; v1 submitted 3 January, 2007; originally announced January 2007.

arXiv:hep-ph/0510223 [pdf, ps, other]

doi 10.1063/1.2176565

Chiral Breit-Wigner

Authors: L. O. Arantes, M. R. Robilotta

Abstract: Chiral symmetry and unitarization are combined into generalized Breit-Wigner expressions describing scalar resonances, which contain free parameters and allow flexible descriptions of masses, widths and pole positions. This theoretical tool is especially designed to be used in analyses of low-energy data. Chiral symmetry and unitarization are combined into generalized Breit-Wigner expressions describing scalar resonances, which contain free parameters and allow flexible descriptions of masses, widths and pole positions. This theoretical tool is especially designed to be used in analyses of low-energy data. △ Less

Submitted 17 October, 2005; originally announced October 2005.

Comments: Talk given at the XI International Conference on Hadron Spectroscopy, Rio de Janeiro, Brazil, August 2005

arXiv:nucl-th/0502001 [pdf, ps, other]

Chiral symmetry and parametrization of scalar resonances

Authors: L. O. Arantes, M. R. Robilotta

Abstract: The linear $\s$-model is used to study the effects of chiral symmetry in unitarized amplitudes incorporating scalar resonances. When just a single resonance is present, we show that the iteration of a chiral tree amplitude by means of regularized two-pion loops preserves the smallness of $\p\p$ interaction at low energies and estimate the importance of pion off-shell contributions. The inclusion… ▽ More The linear $\s$-model is used to study the effects of chiral symmetry in unitarized amplitudes incorporating scalar resonances. When just a single resonance is present, we show that the iteration of a chiral tree amplitude by means of regularized two-pion loops preserves the smallness of $\p\p$ interaction at low energies and estimate the importance of pion off-shell contributions. The inclusion of a second resonance is performed by means of a chiral extension of the linear $\s$-model lagrangian. The new $\p\p$ ampitude at tree level complies with low-energy theorems, depends on a mixing angle and has a zero for a given energy between the resonance masses. The unitarization of this amplitude by means of two-pion loops preserves both its chiral low energy behavior and the position of this zero confirming, in a lagrangian framework, conclusions drawn previously by Törnqvist. Finally, we approximate and generalize our results and give a friendly expression that can be used in the parametrization of $N$ coupled scalar resonances. △ Less

Submitted 1 February, 2005; originally announced February 2005.

Comments: 22 pages, 12 figures

Showing 1–10 of 10 results for author: Arantes, L