Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Follow
Conghui He
Conghui He
Shanghai AI Laboratory
Verified email at pjlab.org.cn - Homepage
Title
Cited by
Cited by
Year
Mmbench: Is your multi-modal model an all-around player?
Y Liu, H Duan, Y Zhang, B Li, S Zhang, W Zhao, Y Yuan, J Wang, C He, ...
European Conference on Computer Vision, 216-233, 2025
5062025
Llama-adapter v2: Parameter-efficient visual instruction model
P Gao, J Han, R Zhang, Z Lin, S Geng, A Zhou, W Zhang, P Lu, C He, ...
arXiv preprint arXiv:2304.15010, 2023
4662023
Sharegpt4v: Improving large multi-modal models with better captions
L Chen, J Li, X Dong, P Zhang, C He, J Wang, F Zhao, D Lin
arXiv preprint arXiv:2311.12793, 2023
2812023
Semantic segmentation-based building footprint extraction using very high-resolution satellite images and multi-source GIS data
W Li, C He, J Fang, J Zheng, H Fu, L Yu
Remote Sensing 11 (4), 403, 2019
2382019
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites
Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ...
arXiv preprint arXiv:2404.16821, 2024
1622024
Persformer: 3d lane detection via perspective transformer and the openlane benchmark
L Chen, C Sima, Y Li, Z Zheng, J Xu, X Geng, H Li, C He, J Shi, Y Qiao, ...
European Conference on Computer Vision, 550-567, 2022
1542022
9-Pflops nonlinear earthquake simulation on Sunway TaihuLight: enabling depiction of 18-Hz and 8-meter scenarios
H Fu, C He, B Chen, Z Yin, Z Zhang, W Zhang, T Zhang, W Xue, W Liu, ...
Proceedings of the International Conference for High Performance Computing …, 2017
1482017
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ...
arXiv preprint arXiv:2401.16420, 2024
1472024
Internvid: A large-scale video-text dataset for multimodal understanding and generation
Y Wang, Y He, Y Li, K Li, J Yu, X Ma, X Li, G Chen, X Chen, Y Wang, C He, ...
arXiv preprint arXiv:2307.06942, 2023
1462023
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition
P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, H Duan, ...
arXiv preprint arXiv:2309.15112, 2023
1352023
Internlm2 technical report
Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ...
arXiv preprint arXiv:2403.17297, 2024
1282024
Influence selection for active learning
Z Liu, H Ding, H Zhong, W Li, J Dai, C He
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
892021
Opera: Alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocation
Q Huang, X Dong, P Zhang, B Wang, C He, J Wang, D Lin, W Zhang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
762024
Sphinx-x: Scaling data and parameters for a family of multi-modal large language models
D Liu, R Zhang, L Qiu, S Huang, W Lin, S Zhao, S Geng, Z Lin, P Jin, ...
arXiv preprint arXiv:2402.05935, 2024
732024
Think twice before driving: Towards scalable decoders for end-to-end autonomous driving
X Jia, P Wu, L Chen, J Xie, C He, J Yan, H Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
652023
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ...
arXiv preprint arXiv:2404.06512, 2024
602024
Beyond hallucinations: Enhancing lvlms through hallucination-aware direct preference optimization
Z Zhao, B Wang, L Ouyang, X Dong, J Wang, C He
arXiv preprint arXiv:2311.16839, 2023
562023
Vigc: Visual instruction generation and correction
B Wang, F Wu, X Han, J Peng, H Zhong, P Zhang, X Dong, W Li, W Li, ...
Proceedings of the AAAI Conference on Artificial Intelligence 38 (6), 5309-5317, 2024
522024
Global-scale associations of vegetation phenology with rainfall and temperature at a high spatio-temporal resolution
N Clinton, L Yu, H Fu, C He, P Gong
Remote Sensing 6 (8), 7320-7338, 2014
512014
Semantic segmentation based building extraction method using multi-source gis map datasets and satellite imagery
W Li, C He, J Fang, H Fu
Proceedings of the IEEE conference on computer vision and pattern …, 2018
482018
The system can't perform the operation now. Try again later.
Articles 1–20