MOYU: A Theoretical Study on Massive Over-activation Yielded Uplifts in LLMs

Ma, Chi; Huang, Mincong; Wang, Chao; Wang, Yujie; Yu, Lei; Liu, Chuan; Lin, Wei

Computer Science > Machine Learning

arXiv:2406.12569v1 (cs)

[Submitted on 18 Jun 2024 (this version), latest version 28 Jun 2024 (v2)]

Title:MOYU: A Theoretical Study on Massive Over-activation Yielded Uplifts in LLMs

Authors:Chi Ma, Mincong Huang, Chao Wang, Yujie Wang, Lei Yu, Chuan Liu, Wei Lin

View PDF HTML (experimental)

Abstract:Massive Over-activation Yielded Uplifts(MOYU) is an inherent property of large language models, and dynamic activation(DA) based on the MOYU property is a clever yet under-explored strategy designed to accelerate inference in these models. Existing methods that utilize MOYU often face a significant 'Impossible Trinity': struggling to simultaneously maintain model performance, enhance inference speed, and extend applicability across various architectures. Due to the theoretical ambiguities surrounding MOYU, this paper elucidates the root cause of the MOYU property and outlines the mechanisms behind two primary limitations encountered by current DA methods: 1) history-related activation uncertainty, and 2) semantic-irrelevant activation inertia. Our analysis not only underscores the limitations of current dynamic activation strategies within large-scale LLaMA models but also proposes opportunities for refining the design of future sparsity schemes.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2406.12569 [cs.LG]
	(or arXiv:2406.12569v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.12569

Submission history

From: Yujie Wang [view email]
[v1] Tue, 18 Jun 2024 12:57:33 UTC (2,027 KB)
[v2] Fri, 28 Jun 2024 07:23:16 UTC (2,027 KB)

Computer Science > Machine Learning

Title:MOYU: A Theoretical Study on Massive Over-activation Yielded Uplifts in LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:MOYU: A Theoretical Study on Massive Over-activation Yielded Uplifts in LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators