research-article

FLEE: A Hierarchical Federated Learning Framework for Distributed Deep Neural Network over Cloud, Edge, and End Device

Authors:

Zhengyi Zhong,

Weidong Bao,

Ji Wang,

Xiaomin Zhu,

Xiongtao ZhangAuthors Info & Claims

ACM Transactions on Intelligent Systems and Technology (TIST), Volume 13, Issue 5

Article No.: 71, Pages 1 - 24

https://doi.org/10.1145/3514501

Published: 13 October 2022 Publication History

Get Access

Abstract

With the development of smart devices, the computing capabilities of portable end devices such as mobile phones have been greatly enhanced. Meanwhile, traditional cloud computing faces great challenges caused by privacy-leakage and time-delay problems, there is a trend to push models down to edges and end devices. However, due to the limitation of computing resource, it is difficult for end devices to complete complex computing tasks alone. Therefore, this article divides the model into two parts and deploys them on multiple end devices and edges, respectively. Meanwhile, an early exit is set to reduce computing resource overhead, forming a hierarchical distributed architecture. In order to enable the distributed model to continuously evolve by using new data generated by end devices, we comprehensively consider various data distributions on end devices and edges, proposing a hierarchical federated learning framework FLEE, which can realize dynamical updates of models without redeploying them. Through image and sentence classification experiments, we verify that it can improve model performances under all kinds of data distributions, and prove that compared with other frameworks, the models trained by FLEE consume less global computing resource in the inference stage.

Appendix

Algorithm FedAvg illustrates the process of aggregation, its inputs include the number of clients \(M\), local iteration times \(E\), training batch size \(B\), learning rate \(\eta\), and total aggregation times \(K\). In this algorithm, firstly, each client downloads the initialized model \(G_0\) from the server. Next, models are locally trained \(E\) times with \(CLIENTUPDATE\), and then sent to the server for aggregation. After that, the server sums up all the models according to the data proportion of each client, and finally distributes the aggregated model \(G^{j+1}\) to clients for further training. The process above iterates \(K\) times. It is worth noting that only a part of clients participate in the training process in each iteration, and the whole training process adopts the gradient descent method.

References

[1]

M. S. H. Abad, E. Ozfatura, D. GUndUz, and O. Ercetin. 2020. Hierarchical federated learning ACROSS heterogeneous cellular networks. In Proceedings of the 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing. 8866–8870. DOI:

Abstract

Appendix

References

Cited By

Index Terms

Recommendations

Learning From Your Neighbours: Mobility-Driven Device-Edge-Cloud Federated Learning

A collaborative cloud-edge computing framework in distributed neural network

Personalized client-edge-cloud hierarchical federated learning in mobile edge computing

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Full Text

HTML Format

Share

Share this Publication link

Share on social media

Affiliations