Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators

Bai, Sikai; Li, Shuaicheng; Zhuang, Weiming; Zhang, Jie; Guo, Song; Yang, Kunlin; Hou, Jun; Zhang, Shuai; Gao, Junyu; Yi, Shuai

Computer Science > Machine Learning

arXiv:2307.05358 (cs)

[Submitted on 11 Jul 2023 (v1), last revised 11 Mar 2024 (this version, v3)]

Title:Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators

Authors:Sikai Bai, Shuaicheng Li, Weiming Zhuang, Jie Zhang, Song Guo, Kunlin Yang, Jun Hou, Shuai Zhang, Junyu Gao, Shuai Yi

View PDF HTML (experimental)

Abstract:Federated learning has become a popular method to learn from decentralized heterogeneous data. Federated semi-supervised learning (FSSL) emerges to train models from a small fraction of labeled data due to label scarcity on decentralized clients. Existing FSSL methods assume independent and identically distributed (IID) labeled data across clients and consistent class distribution between labeled and unlabeled data within a client. This work studies a more practical and challenging scenario of FSSL, where data distribution is different not only across clients but also within a client between labeled and unlabeled data. To address this challenge, we propose a novel FSSL framework with dual regulators, FedDure. FedDure lifts the previous assumption with a coarse-grained regulator (C-reg) and a fine-grained regulator (F-reg): C-reg regularizes the updating of the local model by tracking the learning effect on labeled data distribution; F-reg learns an adaptive weighting scheme tailored for unlabeled instances in each client. We further formulate the client model training as bi-level optimization that adaptively optimizes the model in the client with two regulators. Theoretically, we show the convergence guarantee of the dual regulators. Empirically, we demonstrate that FedDure is superior to the existing methods across a wide range of settings, notably by more than 11 on CIFAR-10 and CINIC-10 datasets.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2307.05358 [cs.LG]
	(or arXiv:2307.05358v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2307.05358
Journal reference:	The 38th Annual AAAI Conference on Artificial Intelligence, 2024

Submission history

From: Sikai Bai [view email]
[v1] Tue, 11 Jul 2023 15:45:03 UTC (1,599 KB)
[v2] Sun, 16 Jul 2023 10:10:10 UTC (1,599 KB)
[v3] Mon, 11 Mar 2024 15:48:08 UTC (365 KB)

Computer Science > Machine Learning

Title:Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators