default search action
Hongsheng Li 0001
Person information
- affiliation: Chinese University of Hong Kong, Department of Electrical Engineering, CUHK-SenseTime Joint Laboratory, Hong Kong
- affiliation (former): Lehigh University, Department of Computer Science and Engineering, PA, USA
Other persons with the same name
- Hongsheng Li — disambiguation page
- Hongsheng Li 0002 — Southeast University, School of Instrument Science and Engineering, Nanjing, China
- Hongsheng Li 0003 — Xidian University, School of Computer Science and Technology, Xi'an, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j57]Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. Int. J. Comput. Vis. 132(2): 581-595 (2024) - [j56]Peng Gao, Ziyi Lin, Renrui Zhang, Rongyao Fang, Hongyang Li, Hongsheng Li, Yu Qiao:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. Int. J. Comput. Vis. 132(5): 1546-1556 (2024) - [j55]Keqiang Sun, Shangzhe Wu, Ning Zhang, Zhaoyang Huang, Quan Wang, Hongsheng Li:
CGOF++: Controllable 3D Face Synthesis With Conditional Generative Occupancy Fields. IEEE Trans. Pattern Anal. Mach. Intell. 46(2): 913-926 (2024) - [j54]Fangzhou Hong, Lingdong Kong, Hui Zhou, Xinge Zhu, Hongsheng Li, Ziwei Liu:
Unified 3D and 4D Panoptic Segmentation via Dynamic Shifting Networks. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 3480-3495 (2024) - [j53]Yan Xu, Kwan-Yee Lin, Guofeng Zhang, Xiaogang Wang, Hongsheng Li:
RNNPose: 6-DoF Object Pose Estimation via Recurrent Correspondence Field Estimation and Pose Optimization. IEEE Trans. Pattern Anal. Mach. Intell. 46(7): 4669-4683 (2024) - [j52]Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li:
FeatAug-DETR: Enriching One-to-Many Matching for DETRs With Feature Augmentation. IEEE Trans. Pattern Anal. Mach. Intell. 46(9): 6402-6415 (2024) - [j51]Jihao Liu, Jinliang Zheng, Boxiao Liu, Yu Liu, Hongsheng Li:
Enhancing Vision-Language Model with Unmasked Token Alignment. Trans. Mach. Learn. Res. 2024 (2024) - [j50]Lin Zhao, Hui Zhou, Xinge Zhu, Xiao Song, Hongsheng Li, Wenbing Tao:
LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation. IEEE Trans. Multim. 26: 1158-1168 (2024) - [j49]Zipeng Qin, Jianbo Liu, Xiaolin Zhang, Maoqing Tian, Aojun Zhou, Shuai Yi, Hongsheng Li:
Pyramid Fusion Transformer for Semantic Segmentation. IEEE Trans. Multim. 26: 9630-9643 (2024) - [j48]Yixiao Ge, Feng Zhu, Dapeng Chen, Rui Zhao, Xiaogang Wang, Hongsheng Li:
Structured Domain Adaptation With Online Relation Regularization for Unsupervised Person Re-ID. IEEE Trans. Neural Networks Learn. Syst. 35(1): 258-271 (2024) - [c205]Houxing Ren, Mingjie Zhan, Zhongyuan Wu, Hongsheng Li:
Empowering Character-level Text Infilling by Eliminating Sub-Tokens. ACL (1) 2024: 3253-3267 - [c204]Xudong Lu, Qi Liu, Yuhui Xu, Aojun Zhou, Siyuan Huang, Bo Zhang, Junchi Yan, Hongsheng Li:
Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models. ACL (1) 2024: 6159-6172 - [c203]Xiaoliang Ju, Zhaoyang Huang, Yijiin Li, Guofeng Zhang, Yu Qiao, Hongsheng Li:
DiffInDScene: Diffusion-Based High-Quality 3D Indoor Scene Generation. CVPR 2024: 4526-4535 - [c202]Yuwen Xiong, Zhiqi Li, Yuntao Chen, Feng Wang, Xizhou Zhu, Jiapeng Luo, Wenhai Wang, Tong Lu, Hongsheng Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications. CVPR 2024: 5652-5661 - [c201]Hao Shao, Yuxuan Hu, Letian Wang, Guanglu Song, Steven L. Waslander, Yu Liu, Hongsheng Li:
LMDrive: Closed-Loop End-to-End Driving with Large Language Models. CVPR 2024: 15120-15130 - [c200]Yang Zhou, Hao Shao, Letian Wang, Steven L. Waslander, Hongsheng Li, Yu Liu:
SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction. CVPR 2024: 15281-15290 - [c199]Hao Li, Xue Yang, Zhaokai Wang, Xizhou Zhu, Jie Zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai:
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft. CVPR 2024: 16426-16435 - [c198]Jihao Liu, Jinliang Zheng, Yu Liu, Hongsheng Li:
GLID: Pre-training a Generalist Encoder-Decoder Vision Model. CVPR 2024: 22851-22860 - [c197]Yijin Li, Yichen Shen, Zhaoyang Huang, Shuo Chen, Weikang Bian, Xiaoyu Shi, Fu-Yun Wang, Keqiang Sun, Hujun Bao, Zhaopeng Cui, Guofeng Zhang, Hongsheng Li:
BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation Using RGB Frames and Events. ECCV (67) 2024: 19-36 - [c196]Ziyi Lin, Dongyang Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Yu Qiao, Hongsheng Li:
SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models. ECCV (62) 2024: 36-55 - [c195]Haiyang Wang, Hao Tang, Li Jiang, Shaoshuai Shi, Muhammad Ferjad Naeem, Hongsheng Li, Bernt Schiele, Liwei Wang:
GiT: Towards Generalist Vision Transformer Through Universal Language Interface. ECCV (29) 2024: 55-73 - [c194]Keqiang Sun, Dor Litvak, Yunzhi Zhang, Hongsheng Li, Jiajun Wu, Shangzhe Wu:
Ponymation: Learning Articulated 3D Animal Motions from Unlabeled Online Videos. ECCV (1) 2024: 100-119 - [c193]Xiaoshi Wu, Yiming Hao, Manyuan Zhang, Keqiang Sun, Zhaoyang Huang, Guanglu Song, Yu Liu, Hongsheng Li:
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models. ECCV (83) 2024: 108-124 - [c192]Benjin Zhu, Zhe Wang, Hongsheng Li:
nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding. ECCV (5) 2024: 125-141 - [c191]Manyuan Zhang, Guanglu Song, Xiaoyu Shi, Yu Liu, Hongsheng Li:
Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediction Tasks. ECCV (42) 2024: 128-145 - [c190]Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang, Xiaoyu Shi, Dazhong Shen, Guanglu Song, Yu Liu, Hongsheng Li:
Be-Your-Outpainter: Mastering Video Outpainting Through Input-Specific Adaptation. ECCV (44) 2024: 153-168 - [c189]Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Yu Qiao, Peng Gao, Hongsheng Li:
MATHVERSE: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? ECCV (8) 2024: 169-186 - [c188]Linjiang Huang, Rongyao Fang, Aiping Zhang, Guanglu Song, Si Liu, Yu Liu, Hongsheng Li:
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis. ECCV (12) 2024: 196-212 - [c187]Fu-Yun Wang, Zhaoyang Huang, Qiang Ma, Guanglu Song, Xudong Lu, Weikang Bian, Yijin Li, Yu Liu, Hongsheng Li:
ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model. ECCV (45) 2024: 329-345 - [c186]Yiwen Tang, Ray Zhang, Jiaming Liu, Zoey Guo, Bin Zhao, Zhigang Wang, Peng Gao, Hongsheng Li, Dong Wang, Xuelong Li:
Any2Point: Empowering Any-Modality Large Models for Efficient 3D Understanding. ECCV (36) 2024: 456-473 - [c185]Changyao Tian, Chenxin Tao, Jifeng Dai, Hao Li, Ziheng Li, Lewei Lu, Xiaogang Wang, Hongsheng Li, Gao Huang, Xizhou Zhu:
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process. ICLR 2024 - [c184]Ke Wang, Houxing Ren, Aojun Zhou, Zimu Lu, Sichun Luo, Weikang Shi, Renrui Zhang, Linqi Song, Mingjie Zhan, Hongsheng Li:
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning. ICLR 2024 - [c183]Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li:
Personalize Segment Anything Model with One Shot. ICLR 2024 - [c182]Renrui Zhang, Jiaming Han, Chris Liu, Aojun Zhou, Pan Lu, Yu Qiao, Hongsheng Li, Peng Gao:
LLaMA-Adapter: Efficient Fine-tuning of Large Language Models with Zero-initialized Attention. ICLR 2024 - [c181]Aojun Zhou, Ke Wang, Zimu Lu, Weikang Shi, Sichun Luo, Zipeng Qin, Shaoqing Lu, Anya Jia, Linqi Song, Mingjie Zhan, Hongsheng Li:
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification. ICLR 2024 - [c180]Dongyang Liu, Renrui Zhang, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Yu Qiao, Hongsheng Li, Peng Gao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. ICML 2024 - [c179]Xudong Lu, Aojun Zhou, Yuhui Xu, Renrui Zhang, Peng Gao, Hongsheng Li:
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models. ICML 2024 - [c178]Tao Ma, Zhiwei Zheng, Hongbin Zhou, Xinyu Cai, Xuemeng Yang, Yikang Li, Botian Shi, Hongsheng Li:
VeloVox: A Low-Cost and Accurate 4D Object Detector with Single-Frame Point Cloud of Livox LiDAR. ICRA 2024: 1992-1998 - [c177]Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling. SIGGRAPH (Conference Paper Track) 2024: 111 - [c176]Fu-Yun Wang, Zhaoyang Huang, Weikang Bian, Xiaoyu Shi, Keqiang Sun, Guanglu Song, Yu Liu, Hongsheng Li:
AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data. SIGGRAPH Asia Technical Communications 2024: 23:1-23:5 - [i268]Yuwen Xiong, Zhiqi Li, Yuntao Chen, Feng Wang, Xizhou Zhu, Jiapeng Luo, Wenhai Wang, Tong Lu, Hongsheng Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications. CoRR abs/2401.06197 (2024) - [i267]Changyao Tian, Xizhou Zhu, Yuwen Xiong, Weiyun Wang, Zhe Chen, Wenhai Wang, Yuntao Chen, Lewei Lu, Tong Lu, Jie Zhou, Hongsheng Li, Yu Qiao, Jifeng Dai:
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer. CoRR abs/2401.10208 (2024) - [i266]Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling. CoRR abs/2401.15977 (2024) - [i265]Fu-Yun Wang, Zhaoyang Huang, Xiaoyu Shi, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li:
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning. CoRR abs/2402.00769 (2024) - [i264]Peng Gao, Renrui Zhang, Chris Liu, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao:
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models. CoRR abs/2402.05935 (2024) - [i263]Xudong Lu, Qi Liu, Yuhui Xu, Aojun Zhou, Siyuan Huang, Bo Zhang, Junchi Yan, Hongsheng Li:
Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models. CoRR abs/2402.14800 (2024) - [i262]Ke Wang, Junting Pan, Weikang Shi, Zimu Lu, Mingjie Zhan, Hongsheng Li:
Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset. CoRR abs/2402.14804 (2024) - [i261]Zimu Lu, Aojun Zhou, Houxing Ren, Ke Wang, Weikang Shi, Junting Pan, Mingjie Zhan, Hongsheng Li:
MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs. CoRR abs/2402.16352 (2024) - [i260]Yuchen Duan, Weiyun Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Yu Qiao, Hongsheng Li, Jifeng Dai, Wenhai Wang:
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures. CoRR abs/2403.02308 (2024) - [i259]Haiyang Wang, Hao Tang, Li Jiang, Shaoshuai Shi, Muhammad Ferjad Naeem, Hongsheng Li, Bernt Schiele, Liwei Wang:
GiT: Towards Generalist Vision Transformer through Universal Language Interface. CoRR abs/2403.09394 (2024) - [i258]Siyuan Huang, Iaroslav Ponomarenko, Zhengkai Jiang, Xiaoqi Li, Xiaobin Hu, Peng Gao, Hongsheng Li, Hao Dong:
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models. CoRR abs/2403.11289 (2024) - [i257]Yang Zhou, Hao Shao, Letian Wang, Steven L. Waslander, Hongsheng Li, Yu Liu:
SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction. CoRR abs/2403.11492 (2024) - [i256]Linjiang Huang, Rongyao Fang, Aiping Zhang, Guanglu Song, Si Liu, Yu Liu, Hongsheng Li:
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis. CoRR abs/2403.12963 (2024) - [i255]Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang, Xiaoyu Shi, Dazhong Shen, Guanglu Song, Yu Liu, Hongsheng Li:
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation. CoRR abs/2403.13745 (2024) - [i254]Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Peng Gao, Hongsheng Li:
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? CoRR abs/2403.14624 (2024) - [i253]Hao Shao, Shengju Qian, Han Xiao, Guanglu Song, Zhuofan Zong, Letian Wang, Yu Liu, Hongsheng Li:
Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models. CoRR abs/2403.16999 (2024) - [i252]Sicheng Li, Keqiang Sun, Zhixin Lai, Xiaoshi Wu, Feng Qiu, Haoran Xie, Kazunori Miyata, Hongsheng Li:
ECNet: Effective Controllable Text-to-Image Diffusion Models. CoRR abs/2403.18417 (2024) - [i251]Weifeng Lin, Xinyu Wei, Ruichuan An, Peng Gao, Bocheng Zou, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li:
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want. CoRR abs/2403.20271 (2024) - [i250]Dongzhi Jiang, Guanglu Song, Xiaoshi Wu, Renrui Zhang, Dazhong Shen, Zhuofan Zong, Yu Liu, Hongsheng Li:
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching. CoRR abs/2404.03653 (2024) - [i249]Fan Lu, Kwan-Yee Lin, Yan Xu, Hongsheng Li, Guang Chen, Changjun Jiang:
Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior. CoRR abs/2404.06780 (2024) - [i248]Jihao Liu, Jinliang Zheng, Yu Liu, Hongsheng Li:
GLID: Pre-training a Generalist Encoder-Decoder Vision Model. CoRR abs/2404.07603 (2024) - [i247]Zhuofan Zong, Bingqi Ma, Dazhong Shen, Guanglu Song, Hao Shao, Dongzhi Jiang, Hongsheng Li, Yu Liu:
MoVA: Adapting Mixture of Vision Experts to Multimodal Context. CoRR abs/2404.13046 (2024) - [i246]Xiaoshi Wu, Yiming Hao, Manyuan Zhang, Keqiang Sun, Zhaoyang Huang, Guanglu Song, Yu Liu, Hongsheng Li:
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models. CoRR abs/2405.00760 (2024) - [i245]Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xi, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, Hongsheng Li:
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers. CoRR abs/2405.05945 (2024) - [i244]Xudong Lu, Aojun Zhou, Ziyi Lin, Qi Liu, Yuhui Xu, Renrui Zhang, Yafei Wen, Shuai Ren, Peng Gao, Junchi Yan, Hongsheng Li:
TerDiT: Ternary Diffusion Models with Transformers. CoRR abs/2405.14854 (2024) - [i243]Xudong Lu, Aojun Zhou, Yuhui Xu, Renrui Zhang, Peng Gao, Hongsheng Li:
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models. CoRR abs/2405.16057 (2024) - [i242]Houxing Ren, Mingjie Zhan, Zhongyuan Wu, Aojun Zhou, Junting Pan, Hongsheng Li:
ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation. CoRR abs/2405.17057 (2024) - [i241]Houxing Ren, Mingjie Zhan, Zhongyuan Wu, Hongsheng Li:
Empowering Character-level Text Infilling by Eliminating Sub-Tokens. CoRR abs/2405.17103 (2024) - [i240]Fu-Yun Wang, Zhaoyang Huang, Alexander William Bergman, Dazhong Shen, Peng Gao, Michael Lingelbach, Keqiang Sun, Weikang Bian, Guanglu Song, Yu Liu, Hongsheng Li, Xiaogang Wang:
Phased Consistency Model. CoRR abs/2405.18407 (2024) - [i239]Jihao Liu, Jinliang Zheng, Boxiao Liu, Yu Liu, Hongsheng Li:
Enhancing Vision-Language Model with Unmasked Token Alignment. CoRR abs/2405.19009 (2024) - [i238]Chenxin Tao, Xizhou Zhu, Shiqian Su, Lewei Lu, Changyao Tian, Xuan Luo, Gao Huang, Hongsheng Li, Yu Qiao, Jie Zhou, Jifeng Dai:
Learning 1D Causal Visual Representation with De-focus Attention Networks. CoRR abs/2406.04342 (2024) - [i237]Siyuan Huang, Haonan Chang, Yuhan Liu, Yimeng Zhu, Hao Dong, Peng Gao, Abdeslam Boularias, Hongsheng Li:
A3VLM: Actionable Articulation-Aware Vision Language Model. CoRR abs/2406.07549 (2024) - [i236]Yuan Pu, Yazhe Niu, Jiyuan Ren, Zhenjie Yang, Hongsheng Li, Yu Liu:
UniZero: Generalized and Efficient Planning with Scalable Latent World Models. CoRR abs/2406.10667 (2024) - [i235]Bingqi Ma, Zhuofan Zong, Guanglu Song, Hongsheng Li, Yu Liu:
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models. CoRR abs/2406.11831 (2024) - [i234]Le Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Lirui Zhao, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Xiangyang Zhu, Si Liu, Xiangyu Yue, Dingning Liu, Wanli Ouyang, Ziwei Liu, Yu Qiao, Hongsheng Li, Peng Gao:
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT. CoRR abs/2406.18583 (2024) - [i233]Zimu Lu, Aojun Zhou, Ke Wang, Houxing Ren, Weikang Shi, Junting Pan, Mingjie Zhan, Hongsheng Li:
Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning. CoRR abs/2407.00782 (2024) - [i232]Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Yichi Zhang, Ziyu Guo, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Bin Wei, Shanghang Zhang, Peng Gao, Hongsheng Li:
MAVIS: Mathematical Visual Instruction Tuning. CoRR abs/2407.08739 (2024) - [i231]Yuxiang Chai, Siyuan Huang, Yazhe Niu, Han Xiao, Liang Liu, Dingyu Zhang, Peng Gao, Shuai Ren, Hongsheng Li:
AMEX: Android Multi-annotation Expo Dataset for Mobile GUI Agents. CoRR abs/2407.17490 (2024) - [i230]Dongyang Liu, Shitian Zhao, Le Zhuo, Weifeng Lin, Yu Qiao, Hongsheng Li, Peng Gao:
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining. CoRR abs/2408.02657 (2024) - [i229]Fangxun Shu, Yue Liao, Le Zhuo, Chenning Xu, Guanghao Zhang, Haonan Shi, Long Chen, Tao Zhong, Wanggui He, Siming Fu, Haoyuan Li, Bolin Li, Zhelun Yu, Si Liu, Hongsheng Li, Hao Jiang:
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation. CoRR abs/2408.15881 (2024) - [i228]Dongzhi Jiang, Renrui Zhang, Ziyu Guo, Yanmin Wu, Jiayi Lei, Pengshuo Qiu, Pan Lu, Zehui Chen, Guanglu Song, Peng Gao, Yu Liu, Chunyuan Li, Hongsheng Li:
MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines. CoRR abs/2409.12959 (2024) - [i227]Weifeng Lin, Xinyu Wei, Renrui Zhang, Le Zhuo, Shitian Zhao, Siyuan Huang, Junlin Xi, Yu Qiao, Peng Gao, Hongsheng Li:
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions. CoRR abs/2409.15278 (2024) - [i226]Xin Li, Siyuan Huang, Qiaojun Yu, Zhengkai Jiang, Ce Hao, Yimeng Zhu, Hongsheng Li, Peng Gao, Cewu Lu:
SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation. CoRR abs/2409.18082 (2024) - [i225]Lijian Xu, Hao Sun, Ziyu Ni, Hongsheng Li, Shaoting Zhang:
MedViLaM: A multimodal large language model with advanced generalizability and explainability for medical data understanding and generation. CoRR abs/2409.19684 (2024) - [i224]Qiaojun Yu, Siyuan Huang, Xibin Yuan, Zhengkai Jiang, Ce Hao, Xin Li, Haonan Chang, Junbo Wang, Liu Liu, Hongsheng Li, Peng Gao, Cewu Lu:
UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models. CoRR abs/2409.20551 (2024) - [i223]Wei Huang, Yue Liao, Jianhui Liu, Ruifei He, Haoru Tan, Shiming Zhang, Hongsheng Li, Si Liu, Xiaojuan Qi:
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More. CoRR abs/2410.06270 (2024) - [i222]Xiangyu Wang, Donglin Yang, Ziqin Wang, Hohin Kwan, Jinyu Chen, Wenjun Wu, Hongsheng Li, Yue Liao, Si Liu:
Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology. CoRR abs/2410.07087 (2024) - [i221]Fu-Yun Wang, Ling Yang, Zhaoyang Huang, Mengdi Wang, Hongsheng Li:
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow. CoRR abs/2410.07303 (2024) - [i220]Ruoyi Du, Dongyang Liu, Le Zhuo, Qin Qi, Hongsheng Li, Zhanyu Ma, Peng Gao:
I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow. CoRR abs/2410.07536 (2024) - [i219]Guankun Wang, Han Xiao, Huxin Gao, Renrui Zhang, Long Bai, Xiaoxiao Yang, Zhen Li, Hongsheng Li, Hongliang Ren:
CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection. CoRR abs/2410.07540 (2024) - [i218]Yang Zhou, Hao Shao, Letian Wang, Steven L. Waslander, Hongsheng Li, Yu Liu:
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction. CoRR abs/2410.08669 (2024) - [i217]Lijian Xu, Ziyu Ni, Hao Sun, Hongsheng Li, Shaoting Zhang:
A foundation model for generalizable disease diagnosis in chest X-ray images. CoRR abs/2410.08861 (2024) - [i216]Rongyao Fang, Chengqi Duan, Kun Wang, Hao Li, Hao Tian, Xingyu Zeng, Rui Zhao, Jifeng Dai, Hongsheng Li, Xihui Liu:
PUMA: Empowering Unified MLLM with Multi-granular Visual Generation. CoRR abs/2410.13861 (2024) - [i215]Yijin Li, Yichen Shen, Zhaoyang Huang, Shuo Chen, Weikang Bian, Xiaoyu Shi, Fu-Yun Wang, Keqiang Sun, Hujun Bao, Zhaopeng Cui, Guofeng Zhang, Hongsheng Li:
BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events. CoRR abs/2410.20451 (2024) - [i214]Yitong Dong, Yijin Li, Zhaoyang Huang, Weikang Bian, Jingbo Liu, Hujun Bao, Zhaopeng Cui, Hongsheng Li, Guofeng Zhang:
A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding. CoRR abs/2411.01893 (2024) - 2023
- [j47]Changjuan Tao, Difei Gu, Rui Huang, Ling Zhou, Zhiqiang Hu, Yuanyuan Chen, Xiaofan Zhang, Hongsheng Li:
Hippocampus segmentation after brain tumor resection via postoperative region synthesis. BMC Medical Imaging 23(1): 142 (2023) - [j46]Shaoshuai Shi, Li Jiang, Jiajun Deng, Zhe Wang, Chaoxu Guo, Jianping Shi, Xiaogang Wang, Hongsheng Li:
PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection. Int. J. Comput. Vis. 131(2): 531-551 (2023) - [j45]Jiageng Mao, Shaoshuai Shi, Xiaogang Wang, Hongsheng Li:
3D Object Detection for Autonomous Driving: A Comprehensive Survey. Int. J. Comput. Vis. 131(8): 1909-1963 (2023) - [j44]Peipei Zhao, Qiguang Miao, Hongsheng Li, Ruyi Liu, Yi-Ning Quan, Jianfeng Song:
Refined probability distribution module for fine-grained visual categorization. Neurocomputing 518: 533-544 (2023) - [j43]Xianying He, Jiahui Li, Fang Yan, Linlin Wang, Wen Chen, Xiaodi Huang, Zhiqiang Hu, Qi Duan, Hongsheng Li, Shaoting Zhang, Jie Zhao:
Predicting cancer outcomes from whole slide images via hybrid supervision learning. Neurocomputing 557: 126736 (2023) - [j42]Jihan Yang, Shaoshuai Shi, Zhe Wang, Hongsheng Li, Xiaojuan Qi:
ST3D++: Denoised Self-Training for Unsupervised Domain Adaptation on 3D Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6354-6371 (2023) - [j41]Jianbo Liu, Junjun He, Yuanjie Zheng, Shuai Yi, Xiaogang Wang, Hongsheng Li:
A Holistically-Guided Decoder for Deep Representation Learning With Applications to Semantic Segmentation and Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 11390-11406 (2023) - [j40]Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unifying Convolution and Self-Attention for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 12581-12600 (2023) - [j39]Linjiang Huang, Kaixin Lu, Guanglu Song, Liang Wang, Si Liu, Yu Liu, Hongsheng Li:
Teach-DETR: Better Training DETR With Teachers. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15759-15771 (2023) - [c175]Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation. CVPR 2023: 1599-1610 - [c174]Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai:
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks. CVPR 2023: 2691-2700 - [c173]Renrui Zhang, Liuhui Wang, Yali Wang, Peng Gao, Hongsheng Li, Jianbo Shi:
Starting from Non-Parametric Networks for 3D Point Cloud Analysis. CVPR 2023: 5344-5353 - [c172]Jihao Liu, Xin Huang, Jinliang Zheng, Yu Liu, Hongsheng Li:
MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers. CVPR 2023: 6252-6261 - [c171]Xiaoshi Wu, Feng Zhu, Rui Zhao, Hongsheng Li:
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching. CVPR 2023: 7031-7040 - [c170]Benjin Zhu, Zhe Wang, Shaoshuai Shi, Hang Xu, Lanqing Hong, Hongsheng Li:
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection. CVPR 2023: 9296-9305 - [c169]Dasong Li, Xiaoyu Shi, Yi Zhang, Ka Chun Cheung, Simon See, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
A Simple Baseline for Video Restoration with Grouped Spatial-Temporal Shift. CVPR 2023: 9822-9832 - [c168]Hao Shao, Letian Wang, Ruobing Chen, Steven L. Waslander, Hongsheng Li, Yu Liu:
ReasonNet: End-to-End Driving with Temporal and Global Reasoning. CVPR 2023: 13723-13733 - [c167]Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao:
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions. CVPR 2023: 14408-14419 - [c166]Chen Gao, Xingyu Peng, Mi Yan, He Wang, Lirong Yang, Haibing Ren, Hongsheng Li, Si Liu:
Adaptive Zone-aware Hierarchical Planner for Vision-Language Navigation. CVPR 2023: 14911-14920 - [c165]Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Yu Qiao, Peng Gao, Hongsheng Li:
Prompt, Generate, Then Cache: Cascade of Foundation Models Makes Strong Few-Shot Learners. CVPR 2023: 15211-15222 - [c164]Junjie Ni, Yijin Li, Zhaoyang Huang, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang:
PATS: Patch Area Transportation with Subdivision for Local Feature Matching. CVPR 2023: 17776-17786 - [c163]Renrui Zhang, Liuhui Wang, Yu Qiao, Peng Gao, Hongsheng Li:
Learning 3D Representations from 2D Pre-Trained Models via Image-to-Point Masked Autoencoders. CVPR 2023: 21769-21780 - [c162]Jingqiu Zhou, Linjiang Huang, Liang Wang, Si Liu, Hongsheng Li:
Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels. CVPR 2023: 23003-23012 - [c161]Fan Lu, Yan Xu, Guang Chen, Hongsheng Li, Kwan-Yee Lin, Changjun Jiang:
Urban Radiance Field Representation with Deformable Neural Mesh Primitives. ICCV 2023: 465-476 - [c160]Xiaoshi Wu, Keqiang Sun, Feng Zhu, Rui Zhao, Hongsheng Li:
Human Preference Score: Better Aligning Text-to-image Models with Human Preference. ICCV 2023: 2096-2105 - [c159]Zhuofan Zong, Dongzhi Jiang, Guanglu Song, Zeyue Xue, Jingyong Su, Hongsheng Li, Yu Liu:
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction. ICCV 2023: 3758-3767 - [c158]Manyuan Zhang, Guanglu Song, Yu Liu, Hongsheng Li:
Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection. ICCV 2023: 6578-6587 - [c157]Tao Ma, Xuemeng Yang, Hongbin Zhou, Xin Li, Botian Shi, Junjie Liu, Yuchen Yang, Zhizheng Liu, Liang He, Yu Qiao, Yikang Li, Hongsheng Li:
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds. ICCV 2023: 6713-6724 - [c156]Renrui Zhang, Han Qiu, Tai Wang, Ziyu Guo, Ziteng Cui, Yu Qiao, Hongsheng Li, Peng Gao:
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection. ICCV 2023: 9121-9132 - [c155]Jiawei Yao, Chuming Li, Keqiang Sun, Yingjie Cai, Hao Li, Wanli Ouyang, Hongsheng Li:
NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates Space. ICCV 2023: 9421-9431 - [c154]Jinyu Chen, Wenguan Wang, Si Liu, Hongsheng Li, Yi Yang:
Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation. ICCV 2023: 10959-10969 - [c153]Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation. ICCV 2023: 12435-12446 - [c152]Siming Fan, Jingtan Piao, Chen Qian, Hongsheng Li, Kwan-Yee Lin:
Simulating Fluids in Real-World Still Images. ICCV 2023: 15876-15885 - [c151]Aojun Zhou, Yang Li, Zipeng Qin, Jianbo Liu, Junting Pan, Renrui Zhang, Rui Zhao, Peng Gao, Hongsheng Li:
SparseMAE: Sparse Training Meets Masked Autoencoders. ICCV 2023: 16130-16140 - [c150]Jihao Liu, Tai Wang, Boxiao Liu, Qihang Zhang, Yu Liu, Hongsheng Li:
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding. ICCV 2023: 17793-17803 - [c149]Xuesong Chen, Shaoshuai Shi, Chao Zhang, Benjin Zhu, Qiang Wang, Ka Chun Cheung, Simon See, Hongsheng Li:
TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses. ICCV 2023: 18481-18490 - [c148]Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li:
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models. ICCV (Workshops) 2023: 272-283 - [c147]Yijin Li, Zhaoyang Huang, Shuo Chen, Xiaoyu Shi, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang:
BlinkFlow: A Dataset to Push the Limits of Event-Based Optical Flow Estimation. IROS 2023: 3881-3888 - [c146]Siyuan Huang, Bo Zhang, Botian Shi, Hongsheng Li, Yikang Li, Peng Gao:
SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification. ACM Multimedia 2023: 8644-8652 - [c145]Weikang Bian, Zhaoyang Huang, Xiaoyu Shi, Yitong Dong, Yijin Li, Hongsheng Li:
Context-PIPs: Persistent Independent Particles Demands Context Features. NeurIPS 2023 - [c144]Yazhe Niu, Yuan Pu, Zhenjie Yang, Xueyan Li, Tong Zhou, Jiyuan Ren, Shuai Hu, Hongsheng Li, Yu Liu:
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios. NeurIPS 2023 - [c143]Keqiang Sun, Junting Pan, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Limin Wang, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. NeurIPS 2023 - [c142]Yi Zhang, Xiaoyu Shi, Dasong Li, Xiaogang Wang, Jian Wang, Hongsheng Li:
A Unified Conditional Framework for Diffusion-based Image Restoration. NeurIPS 2023 - [c141]Ziyu Ni, Linda Wei, Lijian Xu, Qing Xia, Hongsheng Li, Shaoting Zhang, Dimitris N. Metaxas:
Voxel2Hemodynamics: An End-to-End Deep Learning Method for Predicting Coronary Artery Hemodynamics. STACOM@MICCAI 2023: 15-24 - [c140]Xiaoyu Yang, Lijian Xu, Simon Yu, Qing Xia, Hongsheng Li, Shaoting Zhang:
Geometry-Based End-to-End Segmentation of Coronary Artery in Computed Tomography Angiography. TML4H 2023: 190-196 - [i213]Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation. CoRR abs/2303.01237 (2023) - [i212]Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li:
FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation. CoRR abs/2303.01503 (2023) - [i211]Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao:
Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners. CoRR abs/2303.02151 (2023) - [i210]Peng Gao, Renrui Zhang, Rongyao Fang, Ziyi Lin, Hongyang Li, Hongsheng Li, Qiao Yu:
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking. CoRR abs/2303.05475 (2023) - [i209]Junjie Ni, Yijin Li, Zhaoyang Huang, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang:
PATS: Patch Area Transportation with Subdivision for Local Feature Matching. CoRR abs/2303.07700 (2023) - [i208]Yijin Li, Zhaoyang Huang, Shuo Chen, Xiaoyu Shi, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang:
BlinkFlow: A Dataset to Push the Limits of Event-based Optical Flow Estimation. CoRR abs/2303.07716 (2023) - [i207]Renrui Zhang, Liuhui Wang, Yali Wang, Peng Gao, Hongsheng Li, Jianbo Shi:
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis. CoRR abs/2303.08134 (2023) - [i206]Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation. CoRR abs/2303.08340 (2023) - [i205]Jihao Liu, Tai Wang, Boxiao Liu, Qihang Zhang, Yu Liu, Hongsheng Li:
Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding. CoRR abs/2303.11325 (2023) - [i204]Xiaoshi Wu, Feng Zhu, Rui Zhao, Hongsheng Li:
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching. CoRR abs/2303.13076 (2023) - [i203]Xiaoshi Wu, Keqiang Sun, Feng Zhu, Rui Zhao, Hongsheng Li:
Better Aligning Text-to-Image Models with Human Preference. CoRR abs/2303.14420 (2023) - [i202]Renrui Zhang, Jiaming Han, Aojun Zhou, Xiangfei Hu, Shilin Yan, Pan Lu, Hongsheng Li, Peng Gao, Yu Qiao:
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention. CoRR abs/2303.16199 (2023) - [i201]Zhuofan Zong, Dongzhi Jiang, Guanglu Song, Zeyue Xue, Jingyong Su, Hongsheng Li, Yu Liu:
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction. CoRR abs/2304.00967 (2023) - [i200]Jingqiu Zhou, Linjiang Huang, Liang Wang, Si Liu, Hongsheng Li:
Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels. CoRR abs/2304.07978 (2023) - [i199]Xiaoliang Ju, Yiyang Sun, Yiming Hao, Yikang Li, Yu Qiao, Hongsheng Li:
Perception Imitation: Towards Synthesis-free Simulator for Autonomous Vehicles. CoRR abs/2304.09365 (2023) - [i198]Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao:
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model. CoRR abs/2304.15010 (2023) - [i197]Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Peng Gao, Hongsheng Li:
Personalize Segment Anything Model with One Shot. CoRR abs/2305.03048 (2023) - [i196]Xiaoyu Yang, Lijian Xu, Simon Yu, Qing Xia, Hongsheng Li, Shaoting Zhang:
Segmentation and Vascular Vectorization for Coronary Artery by Geometry-based Cascaded Neural Network. CoRR abs/2305.04208 (2023) - [i195]Siyuan Huang, Bo Zhang, Botian Shi, Peng Gao, Yikang Li, Hongsheng Li:
SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification. CoRR abs/2305.09160 (2023) - [i194]Hao Shao, Letian Wang, Ruobing Chen, Steven L. Waslander, Hongsheng Li, Yu Liu:
ReasonNet: End-to-End Driving with Temporal and Global Reasoning. CoRR abs/2305.10507 (2023) - [i193]Siyuan Huang, Zhengkai Jiang, Hao Dong, Yu Qiao, Peng Gao, Hongsheng Li:
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model. CoRR abs/2305.11176 (2023) - [i192]Fu-Yun Wang, Wenshuo Chen, Guanglu Song, Han-Jia Ye, Yu Liu, Hongsheng Li:
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising. CoRR abs/2305.18264 (2023) - [i191]Ziyu Ni, Linda Wei, Lijian Xu, Simon Yu, Qing Xia, Hongsheng Li, Shaoting Zhang:
Voxel2Hemodynamics: An End-to-end Deep Learning Method for Predicting Coronary Artery Hemodynamics. CoRR abs/2305.19107 (2023) - [i190]Xiaoliang Ju, Zhaoyang Huang, Yijin Li, Guofeng Zhang, Yu Qiao, Hongsheng Li:
DiffRoom: Diffusion-based High-Quality 3D Room Reconstruction and Generation with Occupancy Prior. CoRR abs/2306.00519 (2023) - [i189]Zeqiang Lai, Yuchen Duan, Jifeng Dai, Ziheng Li, Ying Fu, Hongsheng Li, Yu Qiao, Wenhai Wang:
Denoising Diffusion Semantic Segmentation with Mask Prior Modeling. CoRR abs/2306.01721 (2023) - [i188]Weikang Bian, Zhaoyang Huang, Xiaoyu Shi, Yitong Dong, Yijin Li, Hongsheng Li:
Context-TAP: Tracking Any Point Demands Spatial Context Features. CoRR abs/2306.02000 (2023) - [i187]Changyao Tian, Chenxin Tao, Jifeng Dai, Hao Li, Ziheng Li, Lewei Lu, Xiaogang Wang, Hongsheng Li, Gao Huang, Xizhou Zhu:
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process. CoRR abs/2306.05423 (2023) - [i186]Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Yijin Li, Hongwei Qin, Jifeng Dai, Xiaogang Wang, Hongsheng Li:
FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow. CoRR abs/2306.05442 (2023) - [i185]Xuesong Chen, Shaoshuai Shi, Chao Zhang, Benjin Zhu, Qiang Wang, Ka Chun Cheung, Simon See, Hongsheng Li:
TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses. CoRR abs/2306.05888 (2023) - [i184]Tao Ma, Xuemeng Yang, Hongbin Zhou, Xin Li, Botian Shi, Junjie Liu, Yuchen Yang, Zhizheng Liu, Liang He, Yu Qiao, Yikang Li, Hongsheng Li:
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds. CoRR abs/2306.06023 (2023) - [i183]Xiaoshi Wu, Yiming Hao, Keqiang Sun, Yixiong Chen, Feng Zhu, Rui Zhao, Hongsheng Li:
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis. CoRR abs/2306.09341 (2023) - [i182]Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li:
Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models. CoRR abs/2306.11732 (2023) - [i181]Junting Pan, Keqiang Sun, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. CoRR abs/2307.00716 (2023) - [i180]Fan Lu, Yan Xu, Guang Chen, Hongsheng Li, Kwan-Yee Lin, Changjun Jiang:
Urban Radiance Field Representation with Deformable Neural Mesh Primitives. CoRR abs/2307.10776 (2023) - [i179]Yiyuan Zhang, Kaixiong Gong, Kaipeng Zhang, Hongsheng Li, Yu Qiao, Wanli Ouyang, Xiangyu Yue:
Meta-Transformer: A Unified Framework for Multimodal Learning. CoRR abs/2307.10802 (2023) - [i178]Wenqi Shao, Yutao Hu, Peng Gao, Meng Lei, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo:
Tiny LVLM-eHub: Early Multimodal Experiments with Bard. CoRR abs/2308.03729 (2023) - [i177]Aojun Zhou, Ke Wang, Zimu Lu, Weikang Shi, Sichun Luo, Zipeng Qin, Shaoqing Lu, Anya Jia, Linqi Song, Mingjie Zhan, Hongsheng Li:
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification. CoRR abs/2308.07921 (2023) - [i176]Jinyu Chen, Wenguan Wang, Si Liu, Hongsheng Li, Yi Yang:
Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation. CoRR abs/2308.10306 (2023) - [i175]Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng:
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following. CoRR abs/2309.00615 (2023) - [i174]Jiaming Han, Renrui Zhang, Wenqi Shao, Peng Gao, Peng Xu, Han Xiao, Kaipeng Zhang, Chris Liu, Song Wen, Ziyu Guo, Xudong Lu, Shuai Ren, Yafei Wen, Xiaoxin Chen, Xiangyu Yue, Hongsheng Li, Yu Qiao:
ImageBind-LLM: Multi-modality Instruction Tuning. CoRR abs/2309.03905 (2023) - [i173]Jiawei Yao, Chuming Li, Keqiang Sun, Yingjie Cai, Hao Li, Wanli Ouyang, Hongsheng Li:
NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates Space. CoRR abs/2309.14616 (2023) - [i172]Ke Wang, Houxing Ren, Aojun Zhou, Zimu Lu, Sichun Luo, Weikang Shi, Renrui Zhang, Linqi Song, Mingjie Zhan, Hongsheng Li:
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning. CoRR abs/2310.03731 (2023) - [i171]Yazhe Niu, Yuan Pu, Zhenjie Yang, Xueyan Li, Tong Zhou, Jiyuan Ren, Shuai Hu, Hongsheng Li, Yu Liu:
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios. CoRR abs/2310.08348 (2023) - [i170]Manyuan Zhang, Guanglu Song, Yu Liu, Hongsheng Li:
Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection. CoRR abs/2310.15955 (2023) - [i169]Manyuan Zhang, Bingqi Ma, Guanglu Song, Yunxiao Wang, Hongsheng Li, Yu Liu:
Towards Large-scale Masked Face Recognition. CoRR abs/2310.16364 (2023) - [i168]Lijian Xu, Ziyu Ni, Xinglong Liu, Xiaosong Wang, Hongsheng Li, Shaoting Zhang:
Learning A Multi-Task Transformer Via Unified And Customized Instruction Tuning For Chest Radiograph Interpretation. CoRR abs/2311.01092 (2023) - [i167]Ziyi Lin, Chris Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Chen Lin, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Hongsheng Li, Yu Qiao:
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models. CoRR abs/2311.07575 (2023) - [i166]Xiaoyu Yang, Lijian Xu, Hongsheng Li, Shaoting Zhang:
ViLaM: A Vision-Language Model with Enhanced Visual Grounding and Generalization Capability. CoRR abs/2311.12327 (2023) - [i165]Rongyao Fang, Shilin Yan, Zhaoyang Huang, Jingqiu Zhou, Hao Tian, Jifeng Dai, Hongsheng Li:
InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation. CoRR abs/2311.18835 (2023) - [i164]Hao Shao, Yuxuan Hu, Letian Wang, Steven L. Waslander, Yu Liu, Hongsheng Li:
LMDrive: Closed-Loop End-to-End Driving with Large Language Models. CoRR abs/2312.07488 (2023) - [i163]Hao Li, Xue Yang, Zhaokai Wang, Xizhou Zhu, Jie Zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai:
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft. CoRR abs/2312.09238 (2023) - [i162]Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun:
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise. CoRR abs/2312.12436 (2023) - [i161]Keqiang Sun, Dor Litvak, Yunzhi Zhang, Hongsheng Li, Jiajun Wu, Shangzhe Wu:
Ponymation: Learning 3D Animal Motions from Unlabeled Online Videos. CoRR abs/2312.13604 (2023) - 2022
- [j38]Yiwei Yang, Rui Huang, Guofeng Lv, Zhiqiang Hu, Guoping Shan, Jie Zhang, Xue Bai, Peng Liu, Hongsheng Li, Ming Chen:
Automatic segmentation of the clinical target volume and organs at risk for rectal cancer radiotherapy using structure-contextual representations based on 3D high-resolution network. Biomed. Signal Process. Control. 73: 103362 (2022) - [j37]Dasong Li, Yi Zhang, Ka Lung Law, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
Efficient Burst Raw Denoising with Variance Stabilization and Multi-frequency Denoising Network. Int. J. Comput. Vis. 130(8): 2060-2080 (2022) - [j36]Qian Da, Xiaodi Huang, Zhongyu Li, Yanfei Zuo, Chenbin Zhang, Jingxin Liu, Wen Chen, Jiahui Li, Dou Xu, Zhiqiang Hu, Hongmei Yi, Yan Guo, Zhe Wang, Ling Chen, Li Zhang, Xianying He, Xiaofan Zhang, Ke Mei, Chuang Zhu, Weizeng Lu, Linlin Shen, Jun Shi, Jun Li, Sreehari S, Ganapathy Krishnamurthi, Jiangcheng Yang, Tiancheng Lin, Qingyu Song, Xuechen Liu, Simon Graham, Raja Muhammad Saad Bashir, Canqian Yang, Shaofei Qin, Xinmei Tian, Baocai Yin, Jie Zhao, Dimitris N. Metaxas, Hongsheng Li, Chaofu Wang, Shaoting Zhang:
DigestPath: A benchmark dataset with challenge review for the pathological detection and segmentation of digestive-system. Medical Image Anal. 80: 102485 (2022) - [j35]Yuanjie Zheng, Xiaodan Sui, Yanyun Jiang, Tongtong Che, Shaoting Zhang, Jie Yang, Hongsheng Li:
SymReg-GAN: Symmetric Image Registration With Generative Adversarial Networks. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 5631-5646 (2022) - [j34]Xinge Zhu, Hui Zhou, Tai Wang, Fangzhou Hong, Wei Li, Yuexin Ma, Hongsheng Li, Ruigang Yang, Dahua Lin:
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-Based Perception. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6807-6822 (2022) - [j33]Yan Xu, Junyi Lin, Jianping Shi, Guofeng Zhang, Xiaogang Wang, Hongsheng Li:
Robust Self-Supervised LiDAR Odometry Via Representative Structure Discovery and 3D Inherent Error Modeling. IEEE Robotics Autom. Lett. 7(2): 1651-1658 (2022) - [j32]Linjiang Huang, Liang Wang, Hongsheng Li:
Multi-Modality Self-Distillation for Weakly Supervised Temporal Action Localization. IEEE Trans. Image Process. 31: 1504-1519 (2022) - [j31]Zhaoyang Huang, Xiaokun Pan, Weihong Pan, Weikang Bian, Yan Xu, Ka Chun Cheung, Guofeng Zhang, Hongsheng Li:
NeuralMarker: A Framework for Learning General Marker Correspondence. ACM Trans. Graph. 41(6): 271:1-271:10 (2022) - [c139]Lin Ma, Weiming Li, Hongsheng Li, Qiang Wang, Ji-Yeon Kim:
Task Generalizable Spatial and Texture Aware Image Downsizing Network. BMVC 2022: 315 - [c138]Teli Ma, Shijie Geng, Mengmeng Wang, Sheng Xu, Hongsheng Li, Baochang Zhang, Peng Gao, Yu Qiao:
Unleashing the Potential of Vision-Language Models for Long-Tailed Visual Recognition. BMVC 2022: 481 - [c137]Hao Shao, Letian Wang, Ruobing Chen, Hongsheng Li, Yu Liu:
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer. CoRL 2022: 726-737 - [c136]Hao Li, Tianwen Fu, Jifeng Dai, Hongsheng Li, Gao Huang, Xizhou Zhu:
AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks. CVPR 2022: 999-1008 - [c135]Haiyang Wang, Shaoshuai Shi, Ze Yang, Rongyao Fang, Qi Qian, Hongsheng Li, Bernt Schiele, Liwei Wang:
RBGNet: Ray-based Grouping for 3D Object Detection. CVPR 2022: 1100-1109 - [c134]Yi Zhang, Dasong Li, Ka Lung Law, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
IDR: Self-Supervised Image Denoising via Iterative Data Refinement. CVPR 2022: 2088-2097 - [c133]Linjiang Huang, Liang Wang, Hongsheng Li:
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation. CVPR 2022: 3262-3271 - [c132]Yingjie Cai, Kwan-Yee Lin, Chao Zhang, Qiang Wang, Xiaogang Wang, Hongsheng Li:
Learning a Structured Latent Space for Unsupervised Point Cloud Completion. CVPR 2022: 5533-5543 - [c131]Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CVPR 2022: 8542-8552 - [c130]Yan Xu, Kwan-Yee Lin, Guofeng Zhang, Xiaogang Wang, Hongsheng Li:
RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization. CVPR 2022: 14860-14870 - [c129]Xizhou Zhu, Jinguo Zhu, Hao Li, Xiaoshi Wu, Hongsheng Li, Xiaohua Wang, Jifeng Dai:
Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks. CVPR 2022: 16783-16794 - [c128]Jihao Liu, Xin Huang, Guanglu Song, Hongsheng Li, Yu Liu:
UniNet: Unified Architecture Search with Convolution, Transformer, and MLP. ECCV (21) 2022: 33-49 - [c127]Junting Pan, Adrian Bulat, Fuwen Tan, Xiatian Zhu, Lukasz Dudziak, Hongsheng Li, Georgios Tzimiropoulos, Brais Martínez:
EdgeViTs: Competing Light-Weight CNNs on Mobile Devices with Vision Transformers. ECCV (11) 2022: 294-311 - [c126]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. ECCV (35) 2022: 388-404 - [c125]Jihao Liu, Boxiao Liu, Hang Zhou, Hongsheng Li, Yu Liu:
TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers. ECCV (26) 2022: 455-471 - [c124]Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification. ECCV (35) 2022: 493-510 - [c123]Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li:
FlowFormer: A Transformer Architecture for Optical Flow. ECCV (17) 2022: 668-685 - [c122]Xuesong Chen, Shaoshuai Shi, Benjin Zhu, Ka Chun Cheung, Hang Xu, Hongsheng Li:
MPPNet: Multi-frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection. ECCV (8) 2022: 680-697 - [c121]Manyuan Zhang, Guanglu Song, Yu Liu, Hongsheng Li:
Towards Robust Face Recognition with Comprehensive Search. ECCV (12) 2022: 720-736 - [c120]Dasong Li, Yi Zhang, Ka Chun Cheung, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
Learning Degradation Representations for Image Deblurring. ECCV (18) 2022: 736-753 - [c119]Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning. ICLR 2022 - [c118]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
MCMAE: Masked Convolution Meets Masked Autoencoders. NeurIPS 2022 - [c117]Junting Pan, Ziyi Lin, Xiatian Zhu, Jing Shao, Hongsheng Li:
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning. NeurIPS 2022 - [c116]Keqiang Sun, Shangzhe Wu, Zhaoyang Huang, Ning Zhang, Quan Wang, Hongsheng Li:
Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields. NeurIPS 2022 - [c115]Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li:
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training. NeurIPS 2022 - [c114]Jinguo Zhu, Xizhou Zhu, Wenhai Wang, Xiaohua Wang, Hongsheng Li, Xiaogang Wang, Jifeng Dai:
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs. NeurIPS 2022 - [i160]Zipeng Qin, Jianbo Liu, Xiaolin Zhang, Maoqing Tian, Aojun Zhou, Shuai Yi, Hongsheng Li:
Pyramid Fusion Transformer for Semantic Segmentation. CoRR abs/2201.04019 (2022) - [i159]Kunchang Li, Yali Wang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning. CoRR abs/2201.04676 (2022) - [i158]Kunchang Li, Yali Wang, Junhao Zhang, Peng Gao, Guanglu Song, Yu Liu, Hongsheng Li, Yu Qiao:
UniFormer: Unifying Convolution and Self-attention for Visual Recognition. CoRR abs/2201.09450 (2022) - [i157]Kexue Fu, Peng Gao, Renrui Zhang, Hongsheng Li, Yu Qiao, Manning Wang:
Distillation with Contrast is All You Need for Self-Supervised Point Cloud Representation Learning. CoRR abs/2202.04241 (2022) - [i156]Jihao Liu, Boxiao Liu, Hongsheng Li, Yu Liu:
Meta Knowledge Distillation. CoRR abs/2202.07940 (2022) - [i155]Yan Xu, Junyi Lin, Jianping Shi, Guofeng Zhang, Xiaogang Wang, Hongsheng Li:
Robust Self-Supervised LiDAR Odometry via Representative Structure Discovery and 3D Inherent Error Modeling. CoRR abs/2202.13353 (2022) - [i154]Linjiang Huang, Liang Wang, Hongsheng Li:
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation. CoRR abs/2203.02925 (2022) - [i153]Fangzhou Hong, Hui Zhou, Xinge Zhu, Hongsheng Li, Ziwei Liu:
LiDAR-based 4D Panoptic Segmentation via Dynamic Shifting Network. CoRR abs/2203.07186 (2022) - [i152]Yan Xu, Kwan-Yee Lin, Guofeng Zhang, Xiaogang Wang, Hongsheng Li:
RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization. CoRR abs/2203.12870 (2022) - [i151]Renrui Zhang, Han Qiu, Tai Wang, Xuanzhuo Xu, Ziyu Guo, Yu Qiao, Peng Gao, Hongsheng Li:
MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection. CoRR abs/2203.13310 (2022) - [i150]Yingjie Cai, Kwan-Yee Lin, Chao Zhang, Qiang Wang, Xiaogang Wang, Hongsheng Li:
Learning a Structured Latent Space for Unsupervised Point Cloud Completion. CoRR abs/2203.15580 (2022) - [i149]Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li:
FlowFormer: A Transformer Architecture for Optical Flow. CoRR abs/2203.16194 (2022) - [i148]Haiyang Wang, Shaoshuai Shi, Ze Yang, Rongyao Fang, Qi Qian, Hongsheng Li, Bernt Schiele, Liwei Wang:
RBGNet: Ray-based Grouping for 3D Object Detection. CoRR abs/2204.02251 (2022) - [i147]Siming Fan, Jingtan Piao, Chen Qian, Kwan-Yee Lin, Hongsheng Li:
Simulating Fluids in Real-World Still Images. CoRR abs/2204.11335 (2022) - [i146]Wei Cheng, Su Xu, Jingtan Piao, Chen Qian, Wayne Wu, Kwan-Yee Lin, Hongsheng Li:
Generalizable Neural Performer: Learning Robust Radiance Fields for Human Novel View Synthesis. CoRR abs/2204.11798 (2022) - [i145]Junting Pan, Adrian Bulat, Fuwen Tan, Xiatian Zhu, Lukasz Dudziak, Hongsheng Li, Georgios Tzimiropoulos, Brais Martínez:
EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers. CoRR abs/2205.03436 (2022) - [i144]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
ConvMAE: Masked Convolution Meets Masked Autoencoders. CoRR abs/2205.03892 (2022) - [i143]Dasong Li, Yi Zhang, Ka Lung Law, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
Efficient Burst Raw Denoising with Variance Stabilization and Multi-frequency Denoising Network. CoRR abs/2205.04721 (2022) - [i142]Xuesong Chen, Shaoshuai Shi, Benjin Zhu, Ka Chun Cheung, Hang Xu, Hongsheng Li:
MPPNet: Multi-Frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection. CoRR abs/2205.05979 (2022) - [i141]Jihao Liu, Xin Huang, Yu Liu, Hongsheng Li:
MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning. CoRR abs/2205.13137 (2022) - [i140]Renrui Zhang, Ziyu Guo, Peng Gao, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li:
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training. CoRR abs/2205.14401 (2022) - [i139]Jinguo Zhu, Xizhou Zhu, Wenhai Wang, Xiaohua Wang, Hongsheng Li, Xiaogang Wang, Jifeng Dai:
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs. CoRR abs/2206.04674 (2022) - [i138]Keqiang Sun, Shangzhe Wu, Zhaoyang Huang, Ning Zhang, Quan Wang, Hongsheng Li:
Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields. CoRR abs/2206.08361 (2022) - [i137]Jiageng Mao, Shaoshuai Shi, Xiaogang Wang, Hongsheng Li:
3D Object Detection for Autonomous Driving: A Review and New Outlooks. CoRR abs/2206.09474 (2022) - [i136]Dasong Li, Xiaoyu Shi, Yi Zhang, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
No Attention is Needed: Grouped Spatial-temporal Shift for Simple and Efficient Video Restorers. CoRR abs/2206.10810 (2022) - [i135]Junting Pan, Ziyi Lin, Xiatian Zhu, Jing Shao, Hongsheng Li:
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning for Action Recognition. CoRR abs/2206.13559 (2022) - [i134]Jihao Liu, Xin Huang, Guanglu Song, Yu Liu, Hongsheng Li:
UniNet: Unified Architecture Search with Convolution, Transformer, and MLP. CoRR abs/2207.05420 (2022) - [i133]Jihao Liu, Boxiao Liu, Hang Zhou, Hongsheng Li, Yu Liu:
TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers. CoRR abs/2207.08409 (2022) - [i132]Renrui Zhang, Zhang Wei, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification. CoRR abs/2207.09519 (2022) - [i131]Hao Shao, Letian Wang, Ruobing Chen, Hongsheng Li, Yu Liu:
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer. CoRR abs/2207.14024 (2022) - [i130]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. CoRR abs/2208.03550 (2022) - [i129]Dasong Li, Yi Zhang, Ka Chun Cheung, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
Learning Degradation Representations for Image Deblurring. CoRR abs/2208.05244 (2022) - [i128]Manyuan Zhang, Guanglu Song, Yu Liu, Hongsheng Li:
Towards Robust Face Recognition with Comprehensive Search. CoRR abs/2208.13600 (2022) - [i127]Zhe Wang, Hongsheng Li, Qinwei Zhang, Jing Yuan, Xiaogang Wang:
Magnetic Resonance Fingerprinting with compressed sensing and distance metric learning. CoRR abs/2209.08734 (2022) - [i126]Zhaoyang Huang, Xiaokun Pan, Weihong Pan, Weikang Bian, Yan Xu, Ka Chun Cheung, Guofeng Zhang, Hongsheng Li:
NeuralMarker: A Framework for Learning General Marker Correspondence. CoRR abs/2209.08896 (2022) - [i125]Renrui Zhang, Hanqiu Deng, Bohao Li, Wei Zhang, Hao Dong, Hongsheng Li, Peng Gao, Yu Qiao:
Collaboration of Pre-trained Models Makes Better Few-shot Learner. CoRR abs/2209.12255 (2022) - [i124]Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao:
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions. CoRR abs/2211.05778 (2022) - [i123]Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai:
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks. CoRR abs/2211.09808 (2022) - [i122]Linjiang Huang, Kaixin Lu, Guanglu Song, Liang Wang, Si Liu, Yu Liu, Hongsheng Li:
Teach-DETR: Better Training DETR with Teachers. CoRR abs/2211.11953 (2022) - [i121]Keqiang Sun, Shangzhe Wu, Ning Zhang, Zhaoyang Huang, Quan Wang, Hongsheng Li:
CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields. CoRR abs/2211.13251 (2022) - [i120]Renrui Zhang, Liuhui Wang, Yu Qiao, Peng Gao, Hongsheng Li:
Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders. CoRR abs/2212.06785 (2022) - [i119]Benjin Zhu, Zhe Wang, Shaoshuai Shi, Hang Xu, Lanqing Hong, Hongsheng Li:
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection. CoRR abs/2212.07289 (2022) - 2021
- [j30]Hongsheng Li, Shaoting Zhang, Dimitris N. Metaxas:
Guest editorial: Deep learning for medical image analysis. Neurocomputing 438: 209-210 (2021) - [j29]Yunhe Gao, Rui Huang, Yiwei Yang, Jie Zhang, Kainan Shao, Changjuan Tao, Yuanyuan Chen, Dimitris N. Metaxas, Hongsheng Li, Ming Chen:
FocusNetv2: Imbalanced large and small organ segmentation with adversarial shape constraint for head and neck CT images. Medical Image Anal. 67: 101831 (2021) - [j28]Yantao Shen, Tong Xiao, Shuai Yi, Dapeng Chen, Xiaogang Wang, Hongsheng Li:
Person Re-Identification With Deep Kronecker-Product Matching and Group-Shuffling Random Walk. IEEE Trans. Pattern Anal. Mach. Intell. 43(5): 1649-1665 (2021) - [j27]Shaoshuai Shi, Zhe Wang, Jianping Shi, Xiaogang Wang, Hongsheng Li:
From Points to Parts: 3D Object Detection From Point Cloud With Part-Aware and Part-Aggregation Network. IEEE Trans. Pattern Anal. Mach. Intell. 43(8): 2647-2664 (2021) - [c113]Xuesong Chen, Canmiao Fu, Feng Zheng, Yong Zhao, Hongsheng Li, Ping Luo, Guo-Jun Qi:
A Unified Multi-Scenario Attacking Network for Visual Object Tracking. AAAI 2021: 1097-1104 - [c112]Shijie Geng, Peng Gao, Moitreya Chatterjee, Chiori Hori, Jonathan Le Roux, Yongfeng Zhang, Hongsheng Li, Anoop Cherian:
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers. AAAI 2021: 1415-1423 - [c111]Jiawei Ren, Cunjun Yu, Zhongang Cai, Mingyuan Zhang, Chongsong Chen, Haiyu Zhao, Shuai Yi, Hongsheng Li:
REFINE: Prediction Fusion Network for Panoptic Segmentation. AAAI 2021: 2477-2485 - [c110]Minghang Zheng, Peng Gao, Renrui Zhang, Kunchang Li, Hongsheng Li, Hao Dong:
End-to-End Object Detection with Adaptive Clustering Transformer. BMVC 2021: 226 - [c109]Yingjie Cai, Xuesong Chen, Chao Zhang, Kwan-Yee Lin, Xiaogang Wang, Hongsheng Li:
Semantic Scene Completion via Integrating Instances and Scene In-the-Loop. CVPR 2021: 324-333 - [c108]Junting Pan, Siyu Chen, Mike Zheng Shou, Yu Liu, Jing Shao, Hongsheng Li:
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization. CVPR 2021: 464-474 - [c107]Xiao Zhang, Yixiao Ge, Yu Qiao, Hongsheng Li:
Refining Pseudo Labels With Clustering Consensus Over Generations for Unsupervised Object Re-Identification. CVPR 2021: 3436-3445 - [c106]Zhaoyang Huang, Han Zhou, Yijin Li, Bangbang Yang, Yan Xu, Xiaowei Zhou, Hujun Bao, Guofeng Zhang, Hongsheng Li:
VS-Net: Voting With Segmentation for Visual Localization. CVPR 2021: 6101-6111 - [c105]Xinge Zhu, Hui Zhou, Tai Wang, Fangzhou Hong, Yuexin Ma, Wei Li, Hongsheng Li, Dahua Lin:
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation. CVPR 2021: 9939-9948 - [c104]Jihan Yang, Shaoshuai Shi, Zhe Wang, Hongsheng Li, Xiaojuan Qi:
ST3D: Self-Training for Unsupervised Domain Adaptation on 3D Object Detection. CVPR 2021: 10368-10378 - [c103]Fangzhou Hong, Hui Zhou, Xinge Zhu, Hongsheng Li, Ziwei Liu:
LiDAR-Based Panoptic Segmentation via Dynamic Shifting Network. CVPR 2021: 13090-13099 - [c102]Jingtan Piao, Keqiang Sun, Quan Wang, Kwan-Yee Lin, Hongsheng Li:
Inverting Generative Adversarial Renderer for Face Reconstruction. CVPR 2021: 15619-15628 - [c101]Rui Liu, Yixiao Ge, Ching Lam Choi, Xiaogang Wang, Hongsheng Li:
DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network. CVPR 2021: 16377-16386 - [c100]Xiaoyang Guo, Shaoshuai Shi, Xiaogang Wang, Hongsheng Li:
LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector. ICCV 2021: 3133-3143 - [c99]Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. ICCV 2021: 3601-3610 - [c98]Yi Zhang, Hongwei Qin, Xiaogang Wang, Hongsheng Li:
Rethinking Noise Synthesis and Modeling in Raw Denoising. ICCV 2021: 4573-4581 - [c97]Chen Zhao, Yixiao Ge, Feng Zhu, Rui Zhao, Hongsheng Li, Mathieu Salzmann:
Progressive Correspondence Pruning by Consensus Learning. ICCV 2021: 6444-6453 - [c96]Linjiang Huang, Liang Wang, Hongsheng Li:
Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization. ICCV 2021: 7982-7991 - [c95]Zhipeng Luo, Zhongang Cai, Changqing Zhou, Gongjie Zhang, Haiyu Zhao, Shuai Yi, Shijian Lu, Hongsheng Li, Shanghang Zhang, Ziwei Liu:
Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency. ICCV 2021: 8846-8855 - [c94]Ziniu Wan, Zhengjia Li, Maoqing Tian, Jianbo Liu, Shuai Yi, Hongsheng Li:
Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation. ICCV 2021: 13013-13022 - [c93]Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting. ICCV 2021: 14020-14029 - [c92]Aojun Zhou, Yukun Ma, Junnan Zhu, Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li:
Learning N: M Fine-grained Structured Sparse Neural Networks From Scratch. ICLR 2021 - [c91]Xiaohan Xing, Yuenan Hou, Hang Li, Yixuan Yuan, Hongsheng Li, Max Q.-H. Meng:
Categorical Relation-Preserving Contrastive Knowledge Distillation for Medical Image Classification. MICCAI (5) 2021: 163-173 - [c90]Jiahui Li, Wen Chen, Xiaodi Huang, Shuang Yang, Zhiqiang Hu, Qi Duan, Dimitris N. Metaxas, Hongsheng Li, Shaoting Zhang:
Hybrid Supervision Learning for Pathology Whole Slide Image Classification. MICCAI (8) 2021: 309-318 - [c89]Peng Gao, Jiasen Lu, Hongsheng Li, Roozbeh Mottaghi, Aniruddha Kembhavi:
Container: Context Aggregation Networks. NeurIPS 2021: 19160-19171 - [c88]Wei Sun, Aojun Zhou, Sander Stuijk, Rob G. J. Wijnhoven, Andrew Nelson, Hongsheng Li, Henk Corporaal:
DominoSearch: Find layer-wise fine-grained N: M sparse schemes from dense neural networks. NeurIPS 2021: 20721-20732 - [c87]Zhuoran Shen, Mingyuan Zhang, Haiyu Zhao, Shuai Yi, Hongsheng Li:
Efficient Attention: Attention with Linear Complexities. WACV 2021: 3530-3538 - [i118]Chen Zhao, Yixiao Ge, Jiaqi Yang, Feng Zhu, Rui Zhao, Hongsheng Li:
Consensus-Guided Correspondence Denoising. CoRR abs/2101.00591 (2021) - [i117]Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. CoRR abs/2101.07448 (2021) - [i116]Shaoshuai Shi, Li Jiang, Jiajun Deng, Zhe Wang, Chaoxu Guo, Jianping Shi, Xiaogang Wang, Hongsheng Li:
PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection. CoRR abs/2102.00463 (2021) - [i115]Aojun Zhou, Yukun Ma, Junnan Zhu, Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li:
Learning N: M Fine-grained Structured Sparse Neural Networks From Scratch. CoRR abs/2102.04010 (2021) - [i114]Jihan Yang, Shaoshuai Shi, Zhe Wang, Hongsheng Li, Xiaojuan Qi:
ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection. CoRR abs/2103.05346 (2021) - [i113]Rui Liu, Yixiao Ge, Ching Lam Choi, Xiaogang Wang, Hongsheng Li:
DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network. CoRR abs/2103.07893 (2021) - [i112]Hao Li, Tianwen Fu, Jifeng Dai, Hongsheng Li, Gao Huang, Xizhou Zhu:
AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks. CoRR abs/2103.14026 (2021) - [i111]Jiangfan Han, Mengya Gao, Yujie Wang, Quanquan Li, Hongsheng Li, Xiaogang Wang:
Fixing the Teacher-Student Knowledge Discrepancy in Distillation. CoRR abs/2103.16844 (2021) - [i110]Yunhe Gao, Rui Huang, Yiwei Yang, Jie Zhang, Kainan Shao, Changjuan Tao, Yuanyuan Chen, Dimitris N. Metaxas, Hongsheng Li, Ming Chen:
FocusNetv2: Imbalanced Large and Small Organ Segmentation with Adversarial Shape Constraint for Head and Neck CT Images. CoRR abs/2104.01771 (2021) - [i109]Zhaoyang Huang, Xiaokun Pan, Runsen Xu, Yan Xu, Ka Chun Cheung, Guofeng Zhang, Hongsheng Li:
LIFE: Lighting Invariant Flow Estimation. CoRR abs/2104.03097 (2021) - [i108]Yingjie Cai, Xuesong Chen, Chao Zhang, Kwan-Yee Lin, Xiaogang Wang, Hongsheng Li:
Semantic Scene Completion via Integrating Instances and Scene in-the-Loop. CoRR abs/2104.03640 (2021) - [i107]Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Decoupled Spatial-Temporal Transformer for Video Inpainting. CoRR abs/2104.06637 (2021) - [i106]Yixiao Ge, Ching Lam Choi, Xiao Zhang, Peipei Zhao, Feng Zhu, Rui Zhao, Hongsheng Li:
Self-distillation with Batch Knowledge Ensembling Improves ImageNet Classification. CoRR abs/2104.13298 (2021) - [i105]Jingtan Piao, Keqiang Sun, Kwan-Yee Lin, Quan Wang, Hongsheng Li:
Inverting Generative Adversarial Renderer for Face Reconstruction. CoRR abs/2105.02431 (2021) - [i104]Zhaoyang Huang, Han Zhou, Yijin Li, Bangbang Yang, Yan Xu, Xiaowei Zhou, Hujun Bao, Guofeng Zhang, Hongsheng Li:
VS-Net: Voting with Segmentation for Visual Localization. CoRR abs/2105.10886 (2021) - [i103]Jihao Liu, Ming Zhang, Yangting Sun, Boxiao Liu, Guanglu Song, Yu Liu, Hongsheng Li:
FNAS: Uncertainty-Aware Fast Neural Architecture Search. CoRR abs/2105.11694 (2021) - [i102]Peng Gao, Jiasen Lu, Hongsheng Li, Roozbeh Mottaghi, Aniruddha Kembhavi:
Container: Context Aggregation Network. CoRR abs/2106.01401 (2021) - [i101]Peng Gao, Shijie Geng, Yu Qiao, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Scalable Transformers for Neural Machine Translation. CoRR abs/2106.02242 (2021) - [i100]Xiao Zhang, Yixiao Ge, Yu Qiao, Hongsheng Li:
Refining Pseudo Labels with Clustering Consensus over Generations for Unsupervised Object Re-identification. CoRR abs/2106.06133 (2021) - [i99]Jiahui Li, Wen Chen, Xiaodi Huang, Zhiqiang Hu, Qi Duan, Hongsheng Li, Dimitris N. Metaxas, Shaoting Zhang:
Mixed Supervision Learning for Whole Slide Image Classification. CoRR abs/2107.00934 (2021) - [i98]Xiaohan Xing, Yuenan Hou, Hang Li, Yixuan Yuan, Hongsheng Li, Max Q.-H. Meng:
Categorical Relation-Preserving Contrastive Knowledge Distillation for Medical Image Classification. CoRR abs/2107.03225 (2021) - [i97]Zhipeng Luo, Zhongang Cai, Changqing Zhou, Gongjie Zhang, Haiyu Zhao, Shuai Yi, Shijian Lu, Hongsheng Li, Shanghang Zhang, Ziwei Liu:
Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency. CoRR abs/2107.11355 (2021) - [i96]Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. CoRR abs/2108.02404 (2021) - [i95]Linjiang Huang, Liang Wang, Hongsheng Li:
Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization. CoRR abs/2108.06524 (2021) - [i94]Jihan Yang, Shaoshuai Shi, Zhe Wang, Hongsheng Li, Xiaojuan Qi:
ST3D++: Denoised Self-training for Unsupervised Domain Adaptation on 3D Object Detection. CoRR abs/2108.06682 (2021) - [i93]Lin Zhao, Hui Zhou, Xinge Zhu, Xiao Song, Hongsheng Li, Wenbing Tao:
LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation. CoRR abs/2108.07511 (2021) - [i92]Xiaoyang Guo, Shaoshuai Shi, Xiaogang Wang, Hongsheng Li:
LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector. CoRR abs/2108.08258 (2021) - [i91]Ziniu Wan, Zhengjia Li, Maoqing Tian, Jianbo Liu, Shuai Yi, Hongsheng Li:
Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation. CoRR abs/2109.02303 (2021) - [i90]Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting. CoRR abs/2109.02974 (2021) - [i89]Xinge Zhu, Hui Zhou, Tai Wang, Fangzhou Hong, Wei Li, Yuexin Ma, Hongsheng Li, Ruigang Yang, Dahua Lin:
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-based Perception. CoRR abs/2109.05441 (2021) - [i88]Jihao Liu, Hongsheng Li, Guanglu Song, Xin Huang, Yu Liu:
UniNet: Unified Architecture Search with Convolution, Transformer, and MLP. CoRR abs/2110.04035 (2021) - [i87]Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao:
CLIP-Adapter: Better Vision-Language Models with Feature Adapters. CoRR abs/2110.04544 (2021) - [i86]Yi Zhang, Hongwei Qin, Xiaogang Wang, Hongsheng Li:
Rethinking Noise Synthesis and Modeling in Raw Denoising. CoRR abs/2110.04756 (2021) - [i85]Renrui Zhang, Rongyao Fang, Wei Zhang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling. CoRR abs/2111.03930 (2021) - [i84]Yi Zhang, Dasong Li, Ka Lung Law, Xiaogang Wang, Hongwei Qin, Hongsheng Li:
IDR: Self-Supervised Image Denoising via Iterative Data Refinement. CoRR abs/2111.14358 (2021) - [i83]Teli Ma, Shijie Geng, Mengmeng Wang, Jing Shao, Jiasen Lu, Hongsheng Li, Peng Gao, Yu Qiao:
A Simple Long-Tailed Recognition Baseline via Vision-Language Model. CoRR abs/2111.14745 (2021) - [i82]Xizhou Zhu, Jinguo Zhu, Hao Li, Xiaoshi Wu, Xiaogang Wang, Hongsheng Li, Xiaohua Wang, Jifeng Dai:
Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks. CoRR abs/2112.01522 (2021) - [i81]Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li:
PointCLIP: Point Cloud Understanding by CLIP. CoRR abs/2112.02413 (2021) - 2020
- [j26]Jun-Yan Zhu, Hongsheng Li, Eli Shechtman, Ming-Yu Liu, Jan Kautz, Antonio Torralba:
Guest Editorial: Generative Adversarial Networks for Computer Vision. Int. J. Comput. Vis. 128(10): 2363-2365 (2020) - [j25]Zixuan Huang, Junming Fan, Shenggan Cheng, Shuai Yi, Xiaogang Wang, Hongsheng Li:
HMS-Net: Hierarchical Multi-Scale Sparsity-Invariant Network for Sparse Depth Completion. IEEE Trans. Image Process. 29: 3429-3441 (2020) - [c86]Yingjie Cai, Buyu Li, Zeyu Jiao, Hongsheng Li, Xingyu Zeng, Xiaogang Wang:
Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation. AAAI 2020: 10478-10485 - [c85]Yushi Lan, Yuan Liu, Xinchi Zhou, Maoqing Tian, Xuesen Zhang, Shuai Yi, Hongsheng Li:
MagnifierNet: Towards Semantic Adversary and Fusion for Person Re-identification. BMVC 2020 - [c84]Yan Xu, Zhaoyang Huang, Kwan-Yee Lin, Xinge Zhu, Jianping Shi, Hujun Bao, Guofeng Zhang, Hongsheng Li:
SelfVoxeLO: Self-supervised LiDAR Odometry with Voxel-based Deep Neural Networks. CoRL 2020: 115-125 - [c83]Xiaokang Chen, Kwan-Yee Lin, Chen Qian, Gang Zeng, Hongsheng Li:
3D Sketch-Aware Semantic Scene Completion via Semi-Supervised Structure Prior. CVPR 2020: 4192-4201 - [c82]Shaoshuai Shi, Chaoxu Guo, Li Jiang, Zhe Wang, Jianping Shi, Xiaogang Wang, Hongsheng Li:
PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection. CVPR 2020: 10526-10535 - [c81]Rui Liu, Chengxi Yang, Wenxiu Sun, Xiaogang Wang, Hongsheng Li:
StereoGAN: Bridging Synthetic-to-Real Domain Gap by Joint Optimization of Domain Translation and Stereo Matching. CVPR 2020: 12754-12763 - [c80]Xiaoyi Dong, Jiangfan Han, Dongdong Chen, Jiayang Liu, Huanyu Bian, Zehua Ma, Hongsheng Li, Xiaogang Wang, Weiming Zhang, Nenghai Yu:
Robust Superpixel-Guided Attentional Adversarial Attack. CVPR 2020: 12892-12901 - [c79]Jianbo Liu, Junjun He, Jiawei Zhang, Jimmy S. Ren, Hongsheng Li:
EfficientFCN: Holistically-Guided Decoding for Semantic Segmentation. ECCV (26) 2020: 1-17 - [c78]Xihui Liu, Zhe Lin, Jianming Zhang, Handong Zhao, Quan Tran, Xiaogang Wang, Hongsheng Li:
Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions. ECCV (11) 2020: 89-106 - [c77]Xiao Zhang, Rui Zhao, Yu Qiao, Hongsheng Li:
RBF-Softmax: Learning Deep Representative Prototypes with Radial Basis Function Softmax. ECCV (26) 2020: 296-311 - [c76]Yixiao Ge, Haibo Wang, Feng Zhu, Rui Zhao, Hongsheng Li:
Self-supervising Fine-Grained Region Similarities for Large-Scale Image Localization. ECCV (4) 2020: 369-386 - [c75]Xiaokang Chen, Kwan-Yee Lin, Jingbo Wang, Wayne Wu, Chen Qian, Hongsheng Li, Gang Zeng:
Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation. ECCV (11) 2020: 561-577 - [c74]Jianbo Liu, Junjun He, Yu Qiao, Jimmy S. Ren, Hongsheng Li:
Learning to Predict Context-Adaptive Convolution for Semantic Segmentation. ECCV (25) 2020: 769-786 - [c73]Yixiao Ge, Dapeng Chen, Hongsheng Li:
Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification. ICLR 2020 - [c72]Rui Huang, Yuanjie Zheng, Zhiqiang Hu, Shaoting Zhang, Hongsheng Li:
Multi-organ Segmentation via Co-training Weight-Averaged Models from Few-Organ Datasets. MICCAI (4) 2020: 146-155 - [c71]Yixiao Ge, Feng Zhu, Dapeng Chen, Rui Zhao, Hongsheng Li:
Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID. NeurIPS 2020 - [c70]Jiawei Ren, Cunjun Yu, Shunan Sheng, Xiao Ma, Haiyu Zhao, Shuai Yi, Hongsheng Li:
Balanced Meta-Softmax for Long-Tailed Visual Recognition. NeurIPS 2020 - [i80]Yixiao Ge, Dapeng Chen, Hongsheng Li:
Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification. CoRR abs/2001.01526 (2020) - [i79]Yingjie Cai, Buyu Li, Zeyu Jiao, Hongsheng Li, Xingyu Zeng, Xiaogang Wang:
Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation. CoRR abs/2002.01619 (2020) - [i78]Yushi Lan, Yuan Liu, Maoqing Tian, Xinchi Zhou, Xuesen Zhang, Shuai Yi, Hongsheng Li:
MagnifierNet: Towards Semantic Regularization and Fusion for Person Re-identification. CoRR abs/2002.10979 (2020) - [i77]Yixiao Ge, Feng Zhu, Rui Zhao, Hongsheng Li:
Structured Domain Adaptation for Unsupervised Person Re-identification. CoRR abs/2003.06650 (2020) - [i76]Xiaokang Chen, Kwan-Yee Lin, Chen Qian, Gang Zeng, Hongsheng Li:
3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure Prior. CoRR abs/2003.14052 (2020) - [i75]Jianbo Liu, Junjun He, Jimmy S. Ren, Yu Qiao, Hongsheng Li:
Learning to Predict Context-adaptive Convolution for Semantic Segmentation. CoRR abs/2004.08222 (2020) - [i74]Rui Liu, Chengxi Yang, Wenxiu Sun, Xiaogang Wang, Hongsheng Li:
StereoGAN: Bridging Synthetic-to-Real Domain Gap by Joint Optimization of Domain Translation and Stereo Matching. CoRR abs/2005.01927 (2020) - [i73]Yixiao Ge, Dapeng Chen, Feng Zhu, Rui Zhao, Hongsheng Li:
Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID. CoRR abs/2006.02713 (2020) - [i72]Yixiao Ge, Haibo Wang, Feng Zhu, Rui Zhao, Hongsheng Li:
Self-supervising Fine-grained Region Similarities for Large-scale Image Localization. CoRR abs/2006.03926 (2020) - [i71]Junting Pan, Siyu Chen, Zheng Shou, Jing Shao, Hongsheng Li:
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization. CoRR abs/2006.07976 (2020) - [i70]Siyu Chen, Junting Pan, Guanglu Song, Manyuan Zhang, Hao Shao, Ziyi Lin, Jing Shao, Hongsheng Li, Yu Liu:
1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020. CoRR abs/2006.09116 (2020) - [i69]Xiaokang Chen, Kwan-Yee Lin, Jingbo Wang, Wayne Wu, Chen Qian, Hongsheng Li, Gang Zeng:
Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation. CoRR abs/2007.09183 (2020) - [i68]Haisheng Su, Jinyuan Feng, Hao Shao, Zhenyu Jiang, Manyuan Zhang, Wei Wu, Yu Liu, Hongsheng Li, Junjie Yan:
Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020. CoRR abs/2007.09883 (2020) - [i67]Jiawei Ren, Cunjun Yu, Shunan Sheng, Xiao Ma, Haiyu Zhao, Shuai Yi, Hongsheng Li:
Balanced Meta-Softmax for Long-Tailed Visual Recognition. CoRR abs/2007.10740 (2020) - [i66]Hui Zhou, Xinge Zhu, Xiao Song, Yuexin Ma, Zhe Wang, Hongsheng Li, Dahua Lin:
Cylinder3D: An Effective 3D Framework for Driving-scene LiDAR Semantic Segmentation. CoRR abs/2008.01550 (2020) - [i65]Xihui Liu, Zhe Lin, Jianming Zhang, Handong Zhao, Quan Tran, Xiaogang Wang, Hongsheng Li:
Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions. CoRR abs/2008.01576 (2020) - [i64]Rui Huang, Yuanjie Zheng, Zhiqiang Hu, Shaoting Zhang, Hongsheng Li:
Multi-organ Segmentation via Co-training Weight-averaged Models from Few-organ Datasets. CoRR abs/2008.07149 (2020) - [i63]Jianbo Liu, Junjun He, Jiawei Zhang, Jimmy S. Ren, Hongsheng Li:
EfficientFCN: Holistically-guided Decoding for Semantic Segmentation. CoRR abs/2008.10487 (2020) - [i62]Shaoshuai Shi, Chaoxu Guo, Jihan Yang, Hongsheng Li:
PV-RCNN: The Top-Performing LiDAR-only Solutions for 3D Detection / 3D Tracking / Domain Adaptation of Waymo Open Dataset Challenges. CoRR abs/2008.12599 (2020) - [i61]Yan Xu, Zhaoyang Huang, Kwan-Yee Lin, Xinge Zhu, Jianping Shi, Hujun Bao, Guofeng Zhang, Hongsheng Li:
SelfVoxeLO: Self-supervised LiDAR Odometry with Voxel-based Deep Neural Networks. CoRR abs/2010.09343 (2020) - [i60]Minghang Zheng, Peng Gao, Xiaogang Wang, Hongsheng Li, Hao Dong:
End-to-End Object Detection with Adaptive Clustering Transformer. CoRR abs/2011.09315 (2020) - [i59]Xinge Zhu, Hui Zhou, Tai Wang, Fangzhou Hong, Yuexin Ma, Wei Li, Hongsheng Li, Dahua Lin:
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation. CoRR abs/2011.10033 (2020) - [i58]Fangzhou Hong, Hui Zhou, Xinge Zhu, Hongsheng Li, Ziwei Liu:
LiDAR-based Panoptic Segmentation via Dynamic Shifting Network. CoRR abs/2011.11964 (2020) - [i57]Jianbo Liu, Sijie Ren, Yuanjie Zheng, Xiaogang Wang, Hongsheng Li:
A Holistically-Guided Decoder for Deep Representation Learning with Applications to Semantic Segmentation and Object Detection. CoRR abs/2012.10162 (2020) - [i56]Daisheng Jin, Xiao Ma, Chongzhi Zhang, Yizhuo Zhou, Jiashu Tao, Mingyuan Zhang, Haiyu Zhao, Shuai Yi, Zhoujun Li, Xianglong Liu, Hongsheng Li:
Towards Overcoming False Positives in Visual Relationship Detection. CoRR abs/2012.12510 (2020)
2010 – 2019
- 2019
- [j24]Han Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaogang Wang, Xiaolei Huang, Dimitris N. Metaxas:
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks. IEEE Trans. Pattern Anal. Mach. Intell. 41(8): 1947-1962 (2019) - [j23]Hui Zhou, Wanli Ouyang, Jian Cheng, Xiaogang Wang, Hongsheng Li:
Deep Continuous Conditional Random Fields With Asymmetric Inter-Object Constraints for Online Multi-Object Tracking. IEEE Trans. Circuits Syst. Video Technol. 29(4): 1011-1022 (2019) - [c69]Kui Xu, Zhe Wang, Jianping Shi, Hongsheng Li, Qiangfeng Cliff Zhang:
A2-Net: Molecular Structure Estimation from Cryo-EM Density Volumes. AAAI 2019: 1230-1237 - [c68]Mingyang Liang, Xiaoyang Guo, Hongsheng Li, Xiaogang Wang, You Song:
Unsupervised Cross-Spectral Stereo Matching by Learning to Synthesize. AAAI 2019: 8706-8713 - [c67]Shaoshuai Shi, Xiaogang Wang, Hongsheng Li:
PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud. CVPR 2019: 770-779 - [c66]Xihui Liu, Zihao Wang, Jing Shao, Xiaogang Wang, Hongsheng Li:
Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing. CVPR 2019: 1950-1959 - [c65]Xiaoyang Guo, Kai Yang, Wukui Yang, Xiaogang Wang, Hongsheng Li:
Group-Wise Correlation Stereo Network. CVPR 2019: 3273-3282 - [c64]Peng Gao, Zhengkai Jiang, Haoxuan You, Pan Lu, Steven C. H. Hoi, Xiaogang Wang, Hongsheng Li:
Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering. CVPR 2019: 6639-6648 - [c63]Rui Liu, Yu Liu, Xinyu Gong, Xiaogang Wang, Hongsheng Li:
Conditional Adversarial Generative Flow for Controllable Image Synthesis. CVPR 2019: 7992-8001 - [c62]Xiao Zhang, Rui Zhao, Junjie Yan, Mengya Gao, Yu Qiao, Xiaogang Wang, Hongsheng Li:
P2SGrad: Refined Gradients for Optimizing Deep Face Models. CVPR 2019: 9906-9914 - [c61]Xiao Zhang, Rui Zhao, Yu Qiao, Xiaogang Wang, Hongsheng Li:
AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations. CVPR 2019: 10823-10832 - [c60]Jiageng Mao, Xiaogang Wang, Hongsheng Li:
Interpolated Convolutional Networks for 3D Point Cloud Understanding. ICCV 2019: 1578-1587 - [c59]Yan Xu, Xinge Zhu, Jianping Shi, Guofeng Zhang, Hujun Bao, Hongsheng Li:
Depth Completion From Sparse LiDAR Data With Depth-Normal Constraints. ICCV 2019: 2811-2820 - [c58]Zihao Wang, Xihui Liu, Hongsheng Li, Lu Sheng, Junjie Yan, Xiaogang Wang, Jing Shao:
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval. ICCV 2019: 5763-5772 - [c57]Peng Gao, Haoxuan You, Zhanpeng Zhang, Xiaogang Wang, Hongsheng Li:
Multi-Modality Latent Interaction Network for Visual Question Answering. ICCV 2019: 5824-5834 - [c56]Jingtan Piao, Chen Qian, Hongsheng Li:
Semi-Supervised Monocular 3D Face Reconstruction With End-to-End Shape-Preserved Domain Transfer. ICCV 2019: 9397-9406 - [c55]Luyang Wang, Yan Chen, Zhenhua Guo, Keyuan Qian, Mude Lin, Hongsheng Li, Jimmy S. J. Ren:
Generalizing Monocular 3D Human Pose Estimation in the Wild. ICCV Workshops 2019: 4024-4033 - [c54]Jiahui Li, Shuang Yang, Xiaodi Huang, Qian Da, Xiaoqun Yang, Zhiqiang Hu, Qi Duan, Chaofu Wang, Hongsheng Li:
Signet Ring Cell Detection with a Semi-supervised Learning Framework. IPMI 2019: 842-854 - [c53]Yunhe Gao, Rui Huang, Ming Chen, Zhe Wang, Jincheng Deng, Yuanyuan Chen, Yiwei Yang, Jie Zhang, Chanjuan Tao, Hongsheng Li:
FocusNet: Imbalanced Large and Small Organ Segmentation with an End-to-End Deep Neural Network for Head and Neck CT Images. MICCAI (3) 2019: 829-838 - [c52]Xihui Liu, Guojun Yin, Jing Shao, Xiaogang Wang, Hongsheng Li:
Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis. NeurIPS 2019: 568-578 - [i55]Kui Xu, Zhe Wang, Jianping Shi, Hongsheng Li, Qiangfeng Cliff Zhang:
A^2-Net: Molecular Structure Estimation from Cryo-EM Density Volumes. CoRR abs/1901.00785 (2019) - [i54]Xihui Liu, Zihao Wang, Jing Shao, Xiaogang Wang, Hongsheng Li:
Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing. CoRR abs/1903.00839 (2019) - [i53]Mingyang Liang, Xiaoyang Guo, Hongsheng Li, Xiaogang Wang, You Song:
Unsupervised Cross-spectral Stereo Matching by Learning to Synthesize. CoRR abs/1903.01078 (2019) - [i52]Xiaoyang Guo, Kai Yang, Wukui Yang, Xiaogang Wang, Hongsheng Li:
Group-wise Correlation Stereo Network. CoRR abs/1903.04025 (2019) - [i51]Rui Liu, Yu Liu, Xinyu Gong, Xiaogang Wang, Hongsheng Li:
Conditional Adversarial Generative Flow for Controllable Image Synthesis. CoRR abs/1904.01782 (2019) - [i50]Luyang Wang, Yan Chen, Zhenhua Guo, Keyuan Qian, Mude Lin, Hongsheng Li, Jimmy S. J. Ren:
Generalizing Monocular 3D Human Pose Estimation in the Wild. CoRR abs/1904.05512 (2019) - [i49]Xiao Zhang, Rui Zhao, Yu Qiao, Xiaogang Wang, Hongsheng Li:
AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations. CoRR abs/1905.00292 (2019) - [i48]Xiao Zhang, Rui Zhao, Junjie Yan, Mengya Gao, Yu Qiao, Xiaogang Wang, Hongsheng Li:
P2SGrad: Refined Gradients for Optimizing Deep Face Models. CoRR abs/1905.02479 (2019) - [i47]Shaoshuai Shi, Zhe Wang, Xiaogang Wang, Hongsheng Li:
Part-A2 Net: 3D Part-Aware and Aggregation Neural Network for Object Detection from Point Cloud. CoRR abs/1907.03670 (2019) - [i46]Jiahui Li, Shuang Yang, Xiaodi Huang, Qian Da, Xiaoqun Yang, Zhiqiang Hu, Qi Duan, Chaofu Wang, Hongsheng Li:
Signet Ring Cell Detection With a Semi-supervised Learning Framework. CoRR abs/1907.03954 (2019) - [i45]Yunhe Gao, Rui Huang, Ming Chen, Zhe Wang, Jincheng Deng, Yuanyuan Chen, Yiwei Yang, Jie Zhang, Chanjuan Tao, Hongsheng Li:
FocusNet: Imbalanced Large and Small Organ Segmentation with an End-to-End Deep Neural Network for Head and Neck CT Images. CoRR abs/1907.12056 (2019) - [i44]Peng Gao, Haoxuan You, Zhanpeng Zhang, Xiaogang Wang, Hongsheng Li:
Multi-modality Latent Interaction Network for Visual Question Answering. CoRR abs/1908.04289 (2019) - [i43]Jiageng Mao, Xiaogang Wang, Hongsheng Li:
Interpolated Convolutional Networks for 3D Point Cloud Understanding. CoRR abs/1908.04512 (2019) - [i42]Zihao Wang, Xihui Liu, Hongsheng Li, Lu Sheng, Junjie Yan, Xiaogang Wang, Jing Shao:
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval. CoRR abs/1909.05506 (2019) - [i41]Yan Xu, Xinge Zhu, Jianping Shi, Guofeng Zhang, Hujun Bao, Hongsheng Li:
Depth Completion from Sparse LiDAR Data with Depth-Normal Constraints. CoRR abs/1910.06727 (2019) - [i40]Xihui Liu, Guojun Yin, Jing Shao, Xiaogang Wang, Hongsheng Li:
Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis. CoRR abs/1910.06809 (2019) - [i39]Shaoshuai Shi, Chaoxu Guo, Li Jiang, Zhe Wang, Jianping Shi, Xiaogang Wang, Hongsheng Li:
PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection. CoRR abs/1912.13192 (2019) - 2018
- [j22]Chen Chen, Lei He, Hongsheng Li, Junzhou Huang:
Fast iteratively reweighted least squares algorithms for analysis-based sparse reconstruction. Medical Image Anal. 49: 141-152 (2018) - [j21]Wanli Ouyang, Hui Zhou, Hongsheng Li, Quanquan Li, Junjie Yan, Xiaogang Wang:
Jointly Learning Deep Features, Deformable Parts, Occlusion and Classification for Pedestrian Detection. IEEE Trans. Pattern Anal. Mach. Intell. 40(8): 1874-1887 (2018) - [j20]Xingyu Zeng, Wanli Ouyang, Junjie Yan, Hongsheng Li, Tong Xiao, Kun Wang, Yu Liu, Yucong Zhou, Bin Yang, Zhe Wang, Hui Zhou, Xiaogang Wang:
Crafting GBD-Net for Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 40(9): 2109-2123 (2018) - [j19]Kai Kang, Hongsheng Li, Junjie Yan, Xingyu Zeng, Bin Yang, Tong Xiao, Cong Zhang, Zhe Wang, Ruohui Wang, Xiaogang Wang, Wanli Ouyang:
T-CNN: Tubelets With Convolutional Neural Networks for Object Detection From Videos. IEEE Trans. Circuits Syst. Video Technol. 28(10): 2896-2907 (2018) - [c51]Xinge Zhu, Zhichao Yin, Jianping Shi, Hongsheng Li, Dahua Lin:
Generative Adversarial Frontal View to Bird View Synthesis. 3DV 2018: 454-463 - [c50]Pan Lu, Hongsheng Li, Wei Zhang, Jianyong Wang, Xiaogang Wang:
Co-Attending Free-Form Regions and Detections With Multi-Modal Multiplicative Feature Embedding for Visual Question Answering. AAAI 2018: 7218-7225 - [c49]Yue Luo, Jimmy S. J. Ren, Mude Lin, Jiahao Pang, Wenxiu Sun, Hongsheng Li, Liang Lin:
Single View Stereo Matching. CVPR 2018: 155-163 - [c48]Dapeng Chen, Hongsheng Li, Tong Xiao, Shuai Yi, Xiaogang Wang:
Video Person Re-Identification With Competitive Snippet-Similarity Aggregation and Co-Attentive Snippet Embedding. CVPR 2018: 1169-1178 - [c47]Yantao Shen, Hongsheng Li, Tong Xiao, Shuai Yi, Dapeng Chen, Xiaogang Wang:
Deep Group-Shuffling Random Walk for Person Re-Identification. CVPR 2018: 2265-2274 - [c46]Wei Yang, Wanli Ouyang, Xiaolong Wang, Jimmy S. J. Ren, Hongsheng Li, Xiaogang Wang:
3D Human Pose Estimation in the Wild by Adversarial Learning. CVPR 2018: 5255-5264 - [c45]Maoqing Tian, Shuai Yi, Hongsheng Li, Shihua Li, Xuesen Zhang, Jianping Shi, Junjie Yan, Xiaogang Wang:
Eliminating Background-Bias for Robust Person Re-Identification. CVPR 2018: 5794-5803 - [c44]Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, Xiaogang Wang:
End-to-End Deep Kronecker-Product Matching for Person Re-Identification. CVPR 2018: 6886-6895 - [c43]Dapeng Chen, Dan Xu, Hongsheng Li, Nicu Sebe, Xiaogang Wang:
Group Consistent Similarity Learning via Deep CRF for Person Re-Identification. CVPR 2018: 8649-8658 - [c42]Dapeng Chen, Hongsheng Li, Xihui Liu, Yantao Shen, Jing Shao, Zejian Yuan, Xiaogang Wang:
Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association. ECCV (16) 2018: 56-73 - [c41]Xihui Liu, Hongsheng Li, Jing Shao, Dapeng Chen, Xiaogang Wang:
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data. ECCV (15) 2018: 353-369 - [c40]Peng Gao, Hongsheng Li, Shuang Li, Pan Lu, Yikang Li, Steven C. H. Hoi, Xiaogang Wang:
Question-Guided Hybrid Convolution for Visual Question Answering. ECCV (1) 2018: 485-501 - [c39]Xiaoyang Guo, Hongsheng Li, Shuai Yi, Jimmy S. J. Ren, Xiaogang Wang:
Learning Monocular Depth by Distilling Cross-Domain Stereo Networks. ECCV (11) 2018: 506-523 - [c38]Yantao Shen, Hongsheng Li, Shuai Yi, Dapeng Chen, Xiaogang Wang:
Person Re-identification with Deep Similarity-Guided Graph Neural Network. ECCV (15) 2018: 508-526 - [c37]Yixiao Ge, Zhuowan Li, Haiyu Zhao, Guojun Yin, Shuai Yi, Xiaogang Wang, Hongsheng Li:
FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification. NeurIPS 2018: 1230-1241 - [i38]Yue Luo, Jimmy S. J. Ren, Mude Lin, Jiahao Pang, Wenxiu Sun, Hongsheng Li, Liang Lin:
Single View Stereo Matching. CoRR abs/1803.02612 (2018) - [i37]Xihui Liu, Hongsheng Li, Jing Shao, Dapeng Chen, Xiaogang Wang:
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data. CoRR abs/1803.08314 (2018) - [i36]Wei Yang, Wanli Ouyang, Xiaolong Wang, Jimmy S. J. Ren, Hongsheng Li, Xiaogang Wang:
3D Human Pose Estimation in the Wild by Adversarial Learning. CoRR abs/1803.09722 (2018) - [i35]Zhe Wang, Hongsheng Li, Wanli Ouyang, Xiaogang Wang:
Learnable Histogram: Statistical Context Features for Deep Neural Networks. CoRR abs/1804.09398 (2018) - [i34]Hui Zhou, Wanli Ouyang, Jian Cheng, Xiaogang Wang, Hongsheng Li:
Deep Continuous Conditional Random Fields with Asymmetric Inter-object Constraints for Online Multi-object Tracking. CoRR abs/1806.01183 (2018) - [i33]Yantao Shen, Hongsheng Li, Shuai Yi, Dapeng Chen, Xiaogang Wang:
Person Re-identification with Deep Similarity-Guided Graph Neural Network. CoRR abs/1807.09975 (2018) - [i32]Yantao Shen, Hongsheng Li, Tong Xiao, Shuai Yi, Dapeng Chen, Xiaogang Wang:
Deep Group-shuffling Random Walk for Person Re-identification. CoRR abs/1807.11178 (2018) - [i31]Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, Xiaogang Wang:
End-to-End Deep Kronecker-Product Matching for Person Re-identification. CoRR abs/1807.11182 (2018) - [i30]Xinge Zhu, Zhichao Yin, Jianping Shi, Hongsheng Li, Dahua Lin:
Generative Adversarial Frontal View to Bird View Synthesis. CoRR abs/1808.00327 (2018) - [i29]Dapeng Chen, Hongsheng Li, Xihui Liu, Yantao Shen, Zejian Yuan, Xiaogang Wang:
Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association. CoRR abs/1808.01571 (2018) - [i28]Peng Gao, Pan Lu, Hongsheng Li, Shuang Li, Yikang Li, Steven C. H. Hoi, Xiaogang Wang:
Question-Guided Hybrid Convolution for Visual Question Answering. CoRR abs/1808.02632 (2018) - [i27]Xiaoyang Guo, Hongsheng Li, Shuai Yi, Jimmy S. J. Ren, Xiaogang Wang:
Learning Monocular Depth by Distilling Cross-domain Stereo Networks. CoRR abs/1808.06586 (2018) - [i26]Zixuan Huang, Junming Fan, Shuai Yi, Xiaogang Wang, Hongsheng Li:
HMS-Net: Hierarchical Multi-scale Sparsity-invariant Network for Sparse Depth Completion. CoRR abs/1808.08685 (2018) - [i25]Yixiao Ge, Zhuowan Li, Haiyu Zhao, Guojun Yin, Shuai Yi, Xiaogang Wang, Hongsheng Li:
FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification. CoRR abs/1810.02936 (2018) - [i24]Shaoshuai Shi, Xiaogang Wang, Hongsheng Li:
PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud. CoRR abs/1812.04244 (2018) - [i23]Peng Gao, Hongsheng Li, Haoxuan You, Zhengkai Jiang, Pan Lu, Steven C. H. Hoi, Xiaogang Wang:
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering. CoRR abs/1812.05252 (2018) - 2017
- [j18]Shuang Li, Zewei Yang, Hongsheng Li:
Statistical Evaluation of No-Reference Image Quality Assessment Metrics for Remote Sensing Images. ISPRS Int. J. Geo Inf. 6(5): 133 (2017) - [j17]Shuai Yi, Xiaogang Wang, Cewu Lu, Jiaya Jia, Hongsheng Li:
L0 Regularized Stationary-Time Estimation for Crowd Analysis. IEEE Trans. Pattern Anal. Mach. Intell. 39(5): 981-994 (2017) - [j16]Wanli Ouyang, Xingyu Zeng, Xiaogang Wang, Shi Qiu, Ping Luo, Yonglong Tian, Hongsheng Li, Shuo Yang, Zhe Wang, Hongyang Li, Kun Wang, Junjie Yan, Chen Change Loy, Xiaoou Tang:
DeepID-Net: Object Detection with Deformable Part Based Convolutional Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(7): 1320-1334 (2017) - [c36]Kai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, Junjie Yan, Xihui Liu, Xiaogang Wang:
Object Detection in Videos with Tubelet Proposal Networks. CVPR 2017: 889-897 - [c35]Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, Xiaogang Wang:
Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification. CVPR 2017: 2027-2036 - [c34]Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, Xiaogang Wang:
Person Search with Natural Language Description. CVPR 2017: 5187-5196 - [c33]Zhongdao Wang, Luming Tang, Xihui Liu, Zhuliang Yao, Shuai Yi, Jing Shao, Junjie Yan, Shengjin Wang, Hongsheng Li, Xiaogang Wang:
Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-identification. ICCV 2017: 379-387 - [c32]Wei Yang, Shuang Li, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
Learning Feature Pyramids for Human Pose Estimation. ICCV 2017: 1290-1299 - [c31]Shuang Li, Tong Xiao, Hongsheng Li, Wei Yang, Xiaogang Wang:
Identity-Aware Textual-Visual Matching with Latent Co-attention. ICCV 2017: 1908-1917 - [c30]Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, Xiaogang Wang:
Learning Deep Neural Networks for Vehicle Re-ID with Visual-spatio-Temporal Path Proposals. ICCV 2017: 1918-1927 - [c29]Qi Chu, Wanli Ouyang, Hongsheng Li, Xiaogang Wang, Bin Liu, Nenghai Yu:
Online Multi-object Tracking Using CNN-Based Single Object Tracker with Spatial-Temporal Attention Mechanism. ICCV 2017: 4846-4855 - [c28]Han Zhang, Tao Xu, Hongsheng Li:
StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks. ICCV 2017: 5908-5916 - [c27]Zhe Wang, Yanxin Yin, Jianping Shi, Wei Fang, Hongsheng Li, Xiaogang Wang:
Zoom-in-Net: Deep Mining Lesions for Diabetic Retinopathy Detection. MICCAI (3) 2017: 267-275 - [i22]Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, Xiaogang Wang:
Person Search with Natural Language Description. CoRR abs/1702.05729 (2017) - [i21]Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, Xiaogang Wang:
Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification. CoRR abs/1702.05891 (2017) - [i20]Kai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, Junjie Yan, Xihui Liu, Xiaogang Wang:
Object Detection in Videos with Tubelet Proposal Networks. CoRR abs/1702.06355 (2017) - [i19]Zhe Wang, Hongsheng Li, Wanli Ouyang, Xiaogang Wang:
Learning Deep Representations for Scene Labeling with Semantic Context Guided Supervision. CoRR abs/1706.02493 (2017) - [i18]Zhe Wang, Yanxin Yin, Jianping Shi, Wei Fang, Hongsheng Li, Xiaogang Wang:
Zoom-in-Net: Deep Mining Lesions for Diabetic Retinopathy Detection. CoRR abs/1706.04372 (2017) - [i17]Wei Yang, Shuang Li, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
Learning Feature Pyramids for Human Pose Estimation. CoRR abs/1708.01101 (2017) - [i16]Shuang Li, Tong Xiao, Hongsheng Li, Wei Yang, Xiaogang Wang:
Identity-Aware Textual-Visual Matching with Latent Co-attention. CoRR abs/1708.01988 (2017) - [i15]Qi Chu, Wanli Ouyang, Hongsheng Li, Xiaogang Wang, Bin Liu, Nenghai Yu:
Online Multi-Object Tracking Using CNN-based Single Object Tracker with Spatial-Temporal Attention Mechanism. CoRR abs/1708.02843 (2017) - [i14]Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, Xiaogang Wang:
Learning Deep Neural Networks for Vehicle Re-ID with Visual-spatio-temporal Path Proposals. CoRR abs/1708.03918 (2017) - [i13]Han Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaogang Wang, Xiaolei Huang, Dimitris N. Metaxas:
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks. CoRR abs/1710.10916 (2017) - [i12]Pan Lu, Hongsheng Li, Wei Zhang, Jianyong Wang, Xiaogang Wang:
Co-attending Free-form Regions and Detections with Multi-modal Multiplicative Feature Embedding for Visual Question Answering. CoRR abs/1711.06794 (2017) - 2016
- [j15]Zhe Wang, Hongsheng Li, Qinwei Zhang, Jing Yuan, Xiaogang Wang:
Magnetic Resonance Fingerprinting with compressed sensing and distance metric learning. Neurocomputing 174: 560-570 (2016) - [j14]Shuai Yi, Hongsheng Li, Xiaogang Wang:
Pedestrian Behavior Modeling From Stationary Crowds With Applications to Intelligent Surveillance. IEEE Trans. Image Process. 25(9): 4354-4368 (2016) - [j13]Cong Zhang, Kai Kang, Hongsheng Li, Xiaogang Wang, Rong Xie, Xiaokang Yang:
Data-Driven Crowd Understanding: A Baseline for a Large-Scale Crowd Dataset. IEEE Trans. Multim. 18(6): 1048-1061 (2016) - [c26]Kai Kang, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
Object Detection from Video Tubelets with Convolutional Neural Networks. CVPR 2016: 817-825 - [c25]Tong Xiao, Hongsheng Li, Wanli Ouyang, Xiaogang Wang:
Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification. CVPR 2016: 1249-1258 - [c24]Wei Yang, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
End-to-End Learning of Deformable Mixture of Parts and Deep Convolutional Neural Networks for Human Pose Estimation. CVPR 2016: 3073-3082 - [c23]Xiao Chu, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
Structured Feature Learning for Pose Estimation. CVPR 2016: 4715-4723 - [c22]Zhe Wang, Hongsheng Li, Wanli Ouyang, Xiaogang Wang:
Learnable Histogram: Statistical Context Features for Deep Neural Networks. ECCV (1) 2016: 246-262 - [c21]Shuai Yi, Hongsheng Li, Xiaogang Wang:
Pedestrian Behavior Understanding and Prediction with Deep Neural Networks. ECCV (1) 2016: 263-279 - [c20]Zhuoyi Zhao, Hongsheng Li, Rui Zhao, Xiaogang Wang:
Crossing-Line Crowd Counting with Two-Phase Deep Neural Networks. ECCV (8) 2016: 712-726 - [c19]Xiao Chu, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
CRF-CNN: Modeling Structured Information in Human Pose Estimation. NIPS 2016: 316-324 - [i11]Xiao Chu, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
Structured Feature Learning for Pose Estimation. CoRR abs/1603.09065 (2016) - [i10]Kai Kang, Hongsheng Li, Junjie Yan, Xingyu Zeng, Bin Yang, Tong Xiao, Cong Zhang, Zhe Wang, Ruohui Wang, Xiaogang Wang, Wanli Ouyang:
T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos. CoRR abs/1604.02532 (2016) - [i9]Kai Kang, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
Object Detection from Video Tubelets with Convolutional Neural Networks. CoRR abs/1604.04053 (2016) - [i8]Tong Xiao, Hongsheng Li, Wanli Ouyang, Xiaogang Wang:
Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification. CoRR abs/1604.07528 (2016) - [i7]Xingyu Zeng, Wanli Ouyang, Junjie Yan, Hongsheng Li, Tong Xiao, Kun Wang, Yu Liu, Yucong Zhou, Bin Yang, Zhe Wang, Hui Zhou, Xiaogang Wang:
Crafting GBD-Net for Object Detection. CoRR abs/1610.02579 (2016) - [i6]Xiao Chu, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
CRF-CNN: Modeling Structured Information in Human Pose Estimation. CoRR abs/1611.00468 (2016) - [i5]Han Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaolei Huang, Xiaogang Wang, Dimitris N. Metaxas:
StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks. CoRR abs/1612.03242 (2016) - 2015
- [j12]Menglin Jiang, Shaoting Zhang, Hongsheng Li, Dimitris N. Metaxas:
Computer-Aided Diagnosis of Mammographic Masses Using Scalable Image Retrieval. IEEE Trans. Biomed. Eng. 62(2): 783-792 (2015) - [j11]Jian Cheng, Haijun Liu, Feng Wang, Hongsheng Li, Ce Zhu:
Silhouette Analysis for Human Action Recognition Based on Supervised Temporal t-SNE and Incremental Learning. IEEE Trans. Image Process. 24(10): 3203-3217 (2015) - [c18]Cong Zhang, Hongsheng Li, Xiaogang Wang, Xiaokang Yang:
Cross-scene crowd counting via deep convolutional neural networks. CVPR 2015: 833-841 - [c17]Rui Zhao, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
Saliency detection by multi-context deep learning. CVPR 2015: 1265-1274 - [c16]Wanli Ouyang, Xiaogang Wang, Xingyu Zeng, Shi Qiu, Ping Luo, Yonglong Tian, Hongsheng Li, Shuo Yang, Zhe Wang, Chen Change Loy, Xiaoou Tang:
DeepID-Net: Deformable deep convolutional neural networks for object detection. CVPR 2015: 2403-2412 - [c15]Shuai Yi, Hongsheng Li, Xiaogang Wang:
Understanding pedestrian behaviors from stationary crowd groups. CVPR 2015: 3488-3496 - [c14]Shuai Yi, Hongsheng Li, Xiaogang Wang:
Pedestrian Travel Time Estimation in Crowded Scenes. ICCV 2015: 3137-3145 - 2014
- [j10]Jian Cheng, Lan Li, Hongsheng Li, Feng Wang:
SAR target recognition based on improved joint sparse representation. EURASIP J. Adv. Signal Process. 2014: 87 (2014) - [j9]Yuanjie Zheng, Ebenezer Daniel, Allan A. Hunter III, Rui Xiao, Jianbin Gao, Hongsheng Li, Maureen G. Maguire, David H. Brainard, James C. Gee:
Landmark matching based retinal image alignment by enforcing sparsity in correspondence matrix. Medical Image Anal. 18(6): 903-913 (2014) - [j8]Jian Cheng, Haijun Liu, Hongsheng Li:
Silhouette analysis for human action recognition based on maximum spatio-temporal dissimilarity embedding. Mach. Vis. Appl. 25(4): 1007-1018 (2014) - [j7]Hongsheng Li, Xiaolei Huang, Junzhou Huang, Shaoting Zhang:
Feature Matching with Affine-Function Transformation Models. IEEE Trans. Pattern Anal. Mach. Intell. 36(12): 2407-2422 (2014) - [j6]Hongsheng Li, Yuanjie Zheng, Shaoting Zhang, Jian Cheng:
Solving a Special Type of Jigsaw Puzzles: Banknote Reconstruction From a Large Number of Fragments. IEEE Trans. Multim. 16(2): 571-578 (2014) - [c13]Chen Chen, Junzhou Huang, Lei He, Hongsheng Li:
Preconditioning for Accelerated Iteratively Reweighted Least Squares in Structured Sparsity Reconstruction. CVPR 2014: 2713-2720 - [i4]Wanli Ouyang, Ping Luo, Xingyu Zeng, Shi Qiu, Yonglong Tian, Hongsheng Li, Shuo Yang, Zhe Wang, Yuanjun Xiong, Chen Qian, Zhenyao Zhu, Ruohui Wang, Chen Change Loy, Xiaogang Wang, Xiaoou Tang:
DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection. CoRR abs/1409.3505 (2014) - [i3]Chen Chen, Junzhou Huang, Lei He, Hongsheng Li:
Fast Iteratively Reweighted Least Squares Algorithms for Analysis-Based Sparsity Reconstruction. CoRR abs/1411.5057 (2014) - [i2]Hongsheng Li, Rui Zhao, Xiaogang Wang:
Highly Efficient Forward and Backward Propagation of Convolutional Neural Networks for Pixelwise Classification. CoRR abs/1412.4526 (2014) - [i1]Wanli Ouyang, Xiaogang Wang, Xingyu Zeng, Shi Qiu, Ping Luo, Yonglong Tian, Hongsheng Li, Shuo Yang, Zhe Wang, Chen Change Loy, Xiaoou Tang:
DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection. CoRR abs/1412.5661 (2014) - 2013
- [j5]Hongsheng Li, Xiaolei Huang, Lei He:
Object Matching Using a Locally Affine Invariant and Linear Programming Techniques. IEEE Trans. Pattern Anal. Mach. Intell. 35(2): 411-424 (2013) - 2012
- [j4]Shaoting Zhang, Junzhou Huang, Hongsheng Li, Dimitris N. Metaxas:
Automatic Image Annotation and Retrieval Using Group Sparsity. IEEE Trans. Syst. Man Cybern. Part B 42(3): 838-849 (2012) - [c12]Edward Kim, Hongsheng Li, Xiaolei Huang:
A hierarchical image clustering cosegmentation framework. CVPR 2012: 686-693 - 2011
- [j3]Junzhou Huang, Shaoting Zhang, Hongsheng Li, Dimitris N. Metaxas:
Composite splitting algorithms for convex optimization. Comput. Vis. Image Underst. 115(12): 1610-1622 (2011) - [j2]Hongsheng Li, Tian Shen, Xiaolei Huang:
Approximately Global Optimization for Robust Alignment of Generalized Shapes. IEEE Trans. Pattern Anal. Mach. Intell. 33(6): 1116-1131 (2011) - [j1]Tian Shen, Hongsheng Li, Xiaolei Huang:
Active Volume Models for Medical Image Segmentation. IEEE Trans. Medical Imaging 30(3): 774-791 (2011) - [c11]Hongsheng Li, Junzhou Huang, Shaoting Zhang, Xiaolei Huang:
Optimal object matching via convexification and composition. ICCV 2011: 33-40 - [c10]Tian Shen, Xiaolei Huang, Hongsheng Li, Edward Kim, Shaoting Zhang, Junzhou Huang:
A 3D Laplacian-driven parametric deformable model. ICCV 2011: 279-286 - [c9]Hongsheng Li, Tian Shen, Xiaolei Huang:
Actin Filament Segmentation Using Dynamic Programming. IPMI 2011: 411-423 - [c8]Ting Xu, Hongsheng Li, Tian Shen, Nikola Ojkic, Dimitrios Vavylonis, Xiaolei Huang:
Extraction and analysis of actin networks based on Open Active Contour models. ISBI 2011: 1334-1340 - 2010
- [c7]Hongsheng Li, Edward Kim, Xiaolei Huang, Lei He:
Object matching with a locally affine-invariant constraint. CVPR 2010: 1641-1648 - [c6]Shaoting Zhang, Junzhou Huang, Yuchi Huang, Yang Yu, Hongsheng Li, Dimitris N. Metaxas:
Automatic image annotation using group sparsity. CVPR 2010: 3312-3319 - [c5]Hongsheng Li, Tian Shen, Dimitrios Vavylonis, Xiaolei Huang:
Actin Filament Segmentation Using Spatiotemporal Active-Surface and Active-Contour Models. MICCAI (1) 2010: 86-94
2000 – 2009
- 2009
- [c4]Tian Shen, Hongsheng Li, Zhen Qian, Xiaolei Huang:
Active volume models for 3D medical image segmentation. CVPR 2009: 707-714 - [c3]Hongsheng Li, Tian Shen, Xiaolei Huang:
Global optimization for alignment of generalized shapes. CVPR 2009: 856-863 - [c2]Hongsheng Li, Tian Shen, Matthew B. Smith, Ikuko Fujiwara, Dimitrios Vavylonis, Xiaolei Huang:
Automated Actin Filament Segmentation, Tracking and TIP Elongation Measurements Based on Open Active Contour Models. ISBI 2009: 1302-1305 - [c1]Hongsheng Li, Tian Shen, Dimitrios Vavylonis, Xiaolei Huang:
Actin Filament Tracking Based on Particle Filters and Stretching Open Active Contour Models. MICCAI (1) 2009: 673-681
Coauthor Index
aka: Jimmy S. Ren
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-23 19:34 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint