default search action
Yiwu Zhong
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c11]Chenfan Qu, Yiwu Zhong, Chongyu Liu, Guitao Xu, Dezhi Peng, Fengjun Guo, Lianwen Jin:
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel Methods. CVPR 2024: 10781-10790 - [c10]Duo Zheng, Shijia Huang, Lin Zhao, Yiwu Zhong, Liwei Wang:
Towards Learning a Generalist Model for Embodied Navigation. CVPR 2024: 13624-13634 - [c9]Zi-Yuan Hu, Yiwu Zhong, Shijia Huang, Michael R. Lyu, Liwei Wang:
Enhancing Temporal Modeling of Video LLMs via Time Gating. EMNLP (Findings) 2024: 2845-2856 - [c8]Yiwu Zhong, Zi-Yuan Hu, Michael R. Lyu, Liwei Wang:
Beyond Embeddings: The Promise of Visual Table in Visual Reasoning. EMNLP 2024: 6876-6911 - [i13]Yiwu Zhong, Zi-Yuan Hu, Michael R. Lyu, Liwei Wang:
Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models. CoRR abs/2403.18252 (2024) - [i12]Chenfan Qu, Yiwu Zhong, Fengjun Guo, Lianwen Jin:
Generalized Tampered Scene Text Detection in the era of Generative AI. CoRR abs/2407.21422 (2024) - [i11]Zi-Yuan Hu, Yiwu Zhong, Shijia Huang, Michael R. Lyu, Liwei Wang:
Enhancing Temporal Modeling of Video LLMs via Time Gating. CoRR abs/2410.05714 (2024) - [i10]Mu Cai, Reuben Tan, Jianrui Zhang, Bocheng Zou, Kai Zhang, Feng Yao, Fangrui Zhu, Jing Gu, Yiwu Zhong, Yuzhang Shang, Yao Dou, Jaden Park, Jianfeng Gao, Yong Jae Lee, Jianwei Yang:
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models. CoRR abs/2410.10818 (2024) - 2023
- [c7]Yiwu Zhong, Licheng Yu, Yang Bai, Shangwen Li, Xueting Yan, Yin Li:
Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations. CVPR 2023: 14825-14835 - [c6]An Yan, Yu Wang, Yiwu Zhong, Chengyu Dong, Zexue He, Yujie Lu, William Yang Wang, Jingbo Shang, Julian J. McAuley:
Learning Concise and Descriptive Attributes for Visual Recognition. ICCV 2023: 3067-3077 - [i9]Yiwu Zhong, Licheng Yu, Yang Bai, Shangwen Li, Xueting Yan, Yin Li:
Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations. CoRR abs/2303.17839 (2023) - [i8]An Yan, Yu Wang, Yiwu Zhong, Chengyu Dong, Zexue He, Yujie Lu, William Wang, Jingbo Shang, Julian J. McAuley:
Learning Concise and Descriptive Attributes for Visual Recognition. CoRR abs/2308.03685 (2023) - [i7]An Yan, Yu Wang, Yiwu Zhong, Zexue He, Petros Karypis, Zihan Wang, Chengyu Dong, Amilcare Gentili, Chun-Nan Hsu, Jingbo Shang, Julian J. McAuley:
Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models. CoRR abs/2310.03182 (2023) - [i6]An Yan, Zhengyuan Yang, Wanrong Zhu, Kevin Lin, Linjie Li, Jianfeng Wang, Jianwei Yang, Yiwu Zhong, Julian J. McAuley, Jianfeng Gao, Zicheng Liu, Lijuan Wang:
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation. CoRR abs/2311.07562 (2023) - [i5]Duo Zheng, Shijia Huang, Lin Zhao, Yiwu Zhong, Liwei Wang:
Towards Learning a Generalist Model for Embodied Navigation. CoRR abs/2312.02010 (2023) - 2022
- [c5]Liunian Harold Li, Pengchuan Zhang, Haotian Zhang, Jianwei Yang, Chunyuan Li, Yiwu Zhong, Lijuan Wang, Lu Yuan, Lei Zhang, Jenq-Neng Hwang, Kai-Wei Chang, Jianfeng Gao:
Grounded Language-Image Pre-training. CVPR 2022: 10955-10965 - [c4]Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, Chunyuan Li, Noel Codella, Liunian Harold Li, Luowei Zhou, Xiyang Dai, Lu Yuan, Yin Li, Jianfeng Gao:
RegionCLIP: Region-based Language-Image Pretraining. CVPR 2022: 16772-16782 - 2021
- [c3]Yiwu Zhong, Jing Shi, Jianwei Yang, Chenliang Xu, Yin Li:
Learning to Generate Scene Graph from Natural Language Supervision. ICCV 2021: 1803-1814 - [c2]Jing Shi, Yiwu Zhong, Ning Xu, Yin Li, Chenliang Xu:
A Simple Baseline for Weakly-Supervised Scene Graph Generation. ICCV 2021: 16373-16382 - [i4]Yiwu Zhong, Jing Shi, Jianwei Yang, Chenliang Xu, Yin Li:
Learning to Generate Scene Graph from Natural Language Supervision. CoRR abs/2109.02227 (2021) - [i3]Liunian Harold Li, Pengchuan Zhang, Haotian Zhang, Jianwei Yang, Chunyuan Li, Yiwu Zhong, Lijuan Wang, Lu Yuan, Lei Zhang, Jenq-Neng Hwang, Kai-Wei Chang, Jianfeng Gao:
Grounded Language-Image Pre-training. CoRR abs/2112.03857 (2021) - [i2]Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, Chunyuan Li, Noel Codella, Liunian Harold Li, Luowei Zhou, Xiyang Dai, Lu Yuan, Yin Li, Jianfeng Gao:
RegionCLIP: Region-based Language-Image Pretraining. CoRR abs/2112.09106 (2021) - 2020
- [c1]Yiwu Zhong, Liwei Wang, Jianshu Chen, Dong Yu, Yin Li:
Comprehensive Image Captioning via Scene Graph Decomposition. ECCV (14) 2020: 211-229 - [i1]Yiwu Zhong, Liwei Wang, Jianshu Chen, Dong Yu, Yin Li:
Comprehensive Image Captioning via Scene Graph Decomposition. CoRR abs/2007.11731 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-08 01:30 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint