LoRA-Pro: Are Low-Rank Adapters Properly Optimized?

Wang, Zhengbo; Liang, Jian; He, Ran; Wang, Zilei; Tan, Tieniu

Computer Science > Machine Learning

arXiv:2407.18242 (cs)

[Submitted on 25 Jul 2024 (v1), last revised 15 Oct 2024 (this version, v2)]

Title:LoRA-Pro: Are Low-Rank Adapters Properly Optimized?

Authors:Zhengbo Wang, Jian Liang, Ran He, Zilei Wang, Tieniu Tan

View PDF HTML (experimental)

Abstract:Low-rank adaptation, also known as LoRA, has emerged as a prominent method for parameter-efficient fine-tuning of foundation models. Despite its computational efficiency, LoRA still yields inferior performance compared to full fine-tuning. In this paper, we first uncover a fundamental connection between the optimization processes of LoRA and full fine-tuning: using LoRA for optimization is mathematically equivalent to full fine-tuning using a low-rank gradient for parameter updates. And this low-rank gradient can be expressed in terms of the gradients of the two low-rank matrices in LoRA. Leveraging this insight, we introduce LoRA-Pro, a method that enhances LoRA's performance by strategically adjusting the gradients of these low-rank matrices. This adjustment allows the low-rank gradient to more accurately approximate the full fine-tuning gradient, thereby narrowing the performance gap between LoRA and full fine-tuning. Furthermore, we theoretically derive the optimal solutions for adjusting the gradients of the low-rank matrices, applying them during fine-tuning in LoRA-Pro. We conduct extensive experiments across natural language understanding, dialogue generation, mathematical reasoning, code generation, and image classification tasks, demonstrating that LoRA-Pro substantially improves LoRA's performance, effectively narrowing the gap with full fine-tuning. Code is publicly available at \url{this https URL}.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2407.18242 [cs.LG]
	(or arXiv:2407.18242v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.18242

Submission history

From: Zhengbo Wang [view email]
[v1] Thu, 25 Jul 2024 17:57:12 UTC (18 KB)
[v2] Tue, 15 Oct 2024 17:58:24 UTC (196 KB)

Computer Science > Machine Learning

Title:LoRA-Pro: Are Low-Rank Adapters Properly Optimized?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:LoRA-Pro: Are Low-Rank Adapters Properly Optimized?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators