default search action
Shwai He
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j1]Hongtai Jing, Zhengtao Gao, Sheng Xu, Tao Shen, Zhangzhi Peng, Shwai He, Tao You, Shuang Ye, Wei Lin, Siqi Sun:
Accurate prediction of antibody function and structure using bio-inspired antibody language model. Briefings Bioinform. 25(4) (2024) - [c9]Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou:
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning. ACL (1) 2024: 14255-14273 - [c8]Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Jiuxiang Gu, Tianyi Zhou:
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning. ACL (Findings) 2024: 16189-16211 - [c7]Run-Ze Fan, Xuefeng Li, Haoyang Zou, Junlong Li, Shwai He, Ethan Chern, Jiewen Hu, Pengfei Liu:
Reformatted Alignment. EMNLP (Findings) 2024: 574-597 - [i16]Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou:
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning. CoRR abs/2402.00530 (2024) - [i15]Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Jiuxiang Gu, Tianyi Zhou:
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning. CoRR abs/2402.10110 (2024) - [i14]Run-Ze Fan, Xuefeng Li, Haoyang Zou, Junlong Li, Shwai He, Ethan Chern, Jiewen Hu, Pengfei Liu:
Reformatted Alignment. CoRR abs/2402.12219 (2024) - [i13]Shwai He, Tianlong Chen:
RESSA: Repair Sparse Vision-Language Models via Sparse Cross-Modality Adaptation. CoRR abs/2404.02424 (2024) - [i12]Shwai He, Daize Dong, Liang Ding, Ang Li:
Demystifying the Compression of Mixture-of-Experts Through a Unified Framework. CoRR abs/2406.02500 (2024) - [i11]Prajwal Singhania, Siddharth Singh, Shwai He, Soheil Feizi, Abhinav Bhatele:
Loki: Low-Rank Keys for Efficient Sparse Attention. CoRR abs/2406.02542 (2024) - [i10]Shwai He, Guoheng Sun, Zheyu Shen, Ang Li:
What Matters in Transformers? Not All Attention is Needed. CoRR abs/2406.15786 (2024) - [i9]Shwai He, Tao Ge, Guoheng Sun, Bowei Tian, Xiaoyang Wang, Ang Li, Dong Yu:
Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers. CoRR abs/2410.13184 (2024) - 2023
- [c6]Shwai He, Liang Ding, Daize Dong, Boan Liu, Fuqiang Yu, Dacheng Tao:
PAD-Net: An Efficient Framework for Dynamic Networks. ACL (1) 2023: 14354-14366 - [c5]Shwai He, Run-Ze Fan, Liang Ding, Li Shen, Tianyi Zhou, Dacheng Tao:
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts. EMNLP 2023: 14685-14691 - [c4]Chenbo Jiang, Jie Yang, Shwai He, Yu-Kun Lai, Lin Gao:
NeuralSlice: Neural 3D Triangle Mesh Reconstruction via Slicing 4D Tetrahedral Meshes. ICML 2023: 15170-15185 - [c3]Shwai He, Chenbo Jiang, Daize Dong, Liang Ding:
SD-Conv: Towards the Parameter-Efficiency of Dynamic Convolution. WACV 2023: 6443-6452 - [i8]Shwai He, Run-Ze Fan, Liang Ding, Li Shen, Tianyi Zhou, Dacheng Tao:
MerA: Merging Pretrained Adapters For Few-Shot Learning. CoRR abs/2308.15982 (2023) - [i7]Shwai He, Run-Ze Fan, Liang Ding, Li Shen, Tianyi Zhou, Dacheng Tao:
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts. CoRR abs/2310.09832 (2023) - [i6]Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Heng Huang, Jiuxiang Gu, Tianyi Zhou:
Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning. CoRR abs/2310.11716 (2023) - 2022
- [c2]Shwai He, Liang Ding, Daize Dong, Jeremy Zhang, Dacheng Tao:
SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters. EMNLP (Findings) 2022: 2184-2190 - [c1]Changtong Zan, Keqin Peng, Liang Ding, Baopu Qiu, Boan Liu, Shwai He, Qingyu Lu, Zheng Zhang, Chuang Liu, Weifeng Liu, Yibing Zhan, Dacheng Tao:
Vega-MT: The JD Explore Academy Machine Translation System for WMT22. WMT 2022: 411-422 - [i5]Shwai He, Yuhang Li, Chenbo Jiang, Shi Gu:
When Sparsity Meets Dynamic Convolution. CoRR abs/2204.02227 (2022) - [i4]Changtong Zan, Keqin Peng, Liang Ding, Baopu Qiu, Boan Liu, Shwai He, Qingyu Lu, Zheng Zhang, Chuang Liu, Weifeng Liu, Yibing Zhan, Dacheng Tao:
Vega-MT: The JD Explore Academy Translation System for WMT22. CoRR abs/2209.09444 (2022) - [i3]Shwai He, Liang Ding, Daize Dong, Miao Zhang, Dacheng Tao:
SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters. CoRR abs/2210.04284 (2022) - [i2]Shwai He, Liang Ding, Daize Dong, Boan Liu, Fuqiang Yu, Dacheng Tao:
Cherry Hypothesis: Identifying the Cherry on the Cake for Dynamic Networks. CoRR abs/2211.05528 (2022) - 2021
- [i1]Shwai He, Shi Gu:
Multi-modal Attention Network for Stock Movements Prediction. CoRR abs/2112.13593 (2021)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-13 19:10 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint