Conghui He

Cited by

	All	Since 2019
Citations	3694	3620
h-index	28	28
i10-index	40	39

2600

1300

650

1950

2017201820192020202120222023202422 43 84 86 121 181 498 2578

Public access

View all

16 articles

5 articles

available

not available

Based on funding mandates

Co-authors

Dahua LinThe Chinese University of Hong KongVerified email at ie.cuhk.edu.hk
Weijia LiAssociate Professor, Sun Yat-Sen UniversityVerified email at mail.sysu.edu.cn
Jiaqi WangShanghai AI LaboratoryVerified email at pjlab.org.cn
Haohuan FuTsinghua UniversityVerified email at tsinghua.edu.cn
Yu QiaoProfessor of Shanghai AI Laboratory; Shenzhen Institutes of Advanced Technology, CASVerified email at siat.ac.cn
Kai ChenShanghai AI LaboratoryVerified email at pjlab.org.cn
Jiarui Fang (方佳瑞）TencentVerified email at tencent.com
郑珏鹏（Juepeng Zheng）Assistant Professor, Sun Yat-Sen UniversityVerified email at mail.sysu.edu.cn
Wayne LukProfessor of Computer Engineering, Imperial College LondonVerified email at imperial.ac.uk
Wei XueTsinghua UniversityVerified email at tsinghua.edu.cn
Lin GanTsinghua UniversityVerified email at tsinghua.edu.cn
Peng GongThe University of Hong Kong，http://orcid.org/0000-0003-1513-3765Verified email at hku.hk
Xiaomeng HuangTsinghua UniversityVerified email at tsinghua.edu.cn

Conghui He

Shanghai AI Laboratory

Verified email at pjlab.org.cn - Homepage

Data-centric AI LLM


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Mmbench: Is your multi-modal model an all-around player? Y Liu, H Duan, Y Zhang, B Li, S Zhang, W Zhao, Y Yuan, J Wang, C He, ... European Conference on Computer Vision, 216-233, 2025	506	2025
Llama-adapter v2: Parameter-efficient visual instruction model P Gao, J Han, R Zhang, Z Lin, S Geng, A Zhou, W Zhang, P Lu, C He, ... arXiv preprint arXiv:2304.15010, 2023	466	2023
Sharegpt4v: Improving large multi-modal models with better captions L Chen, J Li, X Dong, P Zhang, C He, J Wang, F Zhao, D Lin arXiv preprint arXiv:2311.12793, 2023	281	2023
Semantic segmentation-based building footprint extraction using very high-resolution satellite images and multi-source GIS data W Li, C He, J Fang, J Zheng, H Fu, L Yu Remote Sensing 11 (4), 403, 2019	238	2019
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ... arXiv preprint arXiv:2404.16821, 2024	162	2024
Persformer: 3d lane detection via perspective transformer and the openlane benchmark L Chen, C Sima, Y Li, Z Zheng, J Xu, X Geng, H Li, C He, J Shi, Y Qiao, ... European Conference on Computer Vision, 550-567, 2022	154	2022
9-Pflops nonlinear earthquake simulation on Sunway TaihuLight: enabling depiction of 18-Hz and 8-meter scenarios H Fu, C He, B Chen, Z Yin, Z Zhang, W Zhang, T Zhang, W Xue, W Liu, ... Proceedings of the International Conference for High Performance Computing …, 2017	148	2017
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024	147	2024
Internvid: A large-scale video-text dataset for multimodal understanding and generation Y Wang, Y He, Y Li, K Li, J Yu, X Ma, X Li, G Chen, X Chen, Y Wang, C He, ... arXiv preprint arXiv:2307.06942, 2023	146	2023
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, H Duan, ... arXiv preprint arXiv:2309.15112, 2023	135	2023
Internlm2 technical report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024	128	2024
Influence selection for active learning Z Liu, H Ding, H Zhong, W Li, J Dai, C He Proceedings of the IEEE/CVF international conference on computer vision …, 2021	89	2021
Opera: Alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocation Q Huang, X Dong, P Zhang, B Wang, C He, J Wang, D Lin, W Zhang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	76	2024
Sphinx-x: Scaling data and parameters for a family of multi-modal large language models D Liu, R Zhang, L Qiu, S Huang, W Lin, S Zhao, S Geng, Z Lin, P Jin, ... arXiv preprint arXiv:2402.05935, 2024	73	2024
Think twice before driving: Towards scalable decoders for end-to-end autonomous driving X Jia, P Wu, L Chen, J Xie, C He, J Yan, H Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	65	2023
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ... arXiv preprint arXiv:2404.06512, 2024	60	2024
Beyond hallucinations: Enhancing lvlms through hallucination-aware direct preference optimization Z Zhao, B Wang, L Ouyang, X Dong, J Wang, C He arXiv preprint arXiv:2311.16839, 2023	56	2023
Vigc: Visual instruction generation and correction B Wang, F Wu, X Han, J Peng, H Zhong, P Zhang, X Dong, W Li, W Li, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (6), 5309-5317, 2024	52	2024
Global-scale associations of vegetation phenology with rainfall and temperature at a high spatio-temporal resolution N Clinton, L Yu, H Fu, C He, P Gong Remote Sensing 6 (8), 7320-7338, 2014	51	2014
Semantic segmentation based building extraction method using multi-source gis map datasets and satellite imagery W Li, C He, J Fang, H Fu Proceedings of the IEEE conference on computer vision and pattern …, 2018	48	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors