Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision

Li, Minglei; Ye, Peng; Huang, Yongqi; Zhang, Lin; Chen, Tao; He, Tong; Fan, Jiayuan; Ouyang, Wanli

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.03051 (cs)

[Submitted on 5 Jun 2024 (v1), last revised 6 Jun 2024 (this version, v2)]

Title:Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision

Authors:Minglei Li, Peng Ye, Yongqi Huang, Lin Zhang, Tao Chen, Tong He, Jiayuan Fan, Wanli Ouyang

View PDF HTML (experimental)

Abstract:Parameter-efficient fine-tuning (PEFT) has become increasingly important as foundation models continue to grow in both popularity and size. Adapter has been particularly well-received due to their potential for parameter reduction and adaptability across diverse tasks. However, striking a balance between high efficiency and robust generalization across tasks remains a challenge for adapter-based methods. We analyze existing methods and find that: 1) parameter sharing is the key to reducing redundancy; 2) more tunable parameters, dynamic allocation, and block-specific design are keys to improving performance. Unfortunately, no previous work considers all these factors. Inspired by this insight, we introduce a novel framework named Adapter-X. First, a Sharing Mixture of Adapters (SMoA) module is proposed to fulfill token-level dynamic allocation, increased tunable parameters, and inter-block sharing at the same time. Second, some block-specific designs like Prompt Generator (PG) are introduced to further enhance the ability of adaptation. Extensive experiments across 2D image and 3D point cloud modalities demonstrate that Adapter-X represents a significant milestone as it is the first to outperform full fine-tuning in both 2D image and 3D point cloud modalities with significantly fewer parameters, i.e., only 0.20% and 1.88% of original trainable parameters for 2D and 3D classification tasks. Our code will be publicly available.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.03051 [cs.CV]
	(or arXiv:2406.03051v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.03051

Submission history

From: Minglei Li [view email]
[v1] Wed, 5 Jun 2024 08:26:44 UTC (5,940 KB)
[v2] Thu, 6 Jun 2024 02:18:41 UTC (5,941 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators