default search action
Wanrong Zhu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c16]Raphael Schumann, Wanrong Zhu, Weixi Feng, Tsu-Jui Fu, Stefan Riezler, William Yang Wang:
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View. AAAI 2024: 18924-18933 - [c15]Yujie Lu, Pan Lu, Zhiyu Chen, Wanrong Zhu, Xin Wang, William Yang Wang:
Multimodal Procedural Planning via Dual Text-Image Prompting. EMNLP (Findings) 2024: 10931-10954 - [i27]Wanrong Zhu, Zhipeng Lou, Ziyang Wei, Wei Biao Wu:
High Confidence Level Inference is Almost Free using Parallel Stochastic Optimization. CoRR abs/2401.09346 (2024) - [i26]Wanrong Zhu, Jennifer Healey, Ruiyi Zhang, William Yang Wang, Tong Sun:
Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models. CoRR abs/2404.15271 (2024) - [i25]An Yan, Zhengyuan Yang, Junda Wu, Wanrong Zhu, Jianwei Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Julian J. McAuley, Jianfeng Gao, Lijuan Wang:
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs. CoRR abs/2404.16375 (2024) - [i24]Xuehai He, Weixi Feng, Kaizhi Zheng, Yujie Lu, Wanrong Zhu, Jiachen Li, Yue Fan, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Kevin Lin, William Yang Wang, Lijuan Wang, Xin Eric Wang:
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos. CoRR abs/2406.08407 (2024) - [i23]Zekun Li, Xianjun Yang, Kyuri Choi, Wanrong Zhu, Ryan Hsieh, HyeonJung Kim, Jin Hyuk Lim, Sungyoung Ji, Byungju Lee, Xifeng Yan, Linda Ruth Petzold, Stephen D. Wilson, Woosang Lim, William Yang Wang:
MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension. CoRR abs/2407.04903 (2024) - 2023
- [c14]Wanrong Zhu, An Yan, Yujie Lu, Wenda Xu, Xin Wang, Miguel P. Eckstein, William Yang Wang:
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation. EACL (Findings) 2023: 78-92 - [c13]Wanrong Zhu, Xin Wang, An Yan, Miguel P. Eckstein, William Yang Wang:
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation. EACL (Findings) 2023: 93-105 - [c12]Wanrong Zhu, Xinyi Wang, Yujie Lu, Tsu-Jui Fu, Xin Wang, Miguel P. Eckstein, William Wang:
Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation. EMNLP 2023: 11113-11122 - [c11]Yujie Lu, Weixi Feng, Wanrong Zhu, Wenda Xu, Xin Eric Wang, Miguel P. Eckstein, William Yang Wang:
Neuro-Symbolic Procedural Planning with Commonsense Prompting. ICLR 2023 - [c10]Yonatan Bitton, Hritik Bansal, Jack Hessel, Rulin Shao, Wanrong Zhu, Anas Awadalla, Josh Gardner, Rohan Taori, Ludwig Schmidt:
VisIT-Bench: A Dynamic Benchmark for Evaluating Instruction-Following Vision-and-Language Models. NeurIPS 2023 - [c9]Weixi Feng, Wanrong Zhu, Tsu-Jui Fu, Varun Jampani, Arjun R. Akula, Xuehai He, Sugato Basu, Xin Eric Wang, William Yang Wang:
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models. NeurIPS 2023 - [c8]Xinyi Wang, Wanrong Zhu, Michael Saxon, Mark Steyvers, William Yang Wang:
Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning. NeurIPS 2023 - [c7]Wanrong Zhu, Jack Hessel, Anas Awadalla, Samir Yitzhak Gadre, Jesse Dodge, Alex Fang, Youngjae Yu, Ludwig Schmidt, William Yang Wang, Yejin Choi:
Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved with Text. NeurIPS 2023 - [i22]Xinyi Wang, Wanrong Zhu, William Yang Wang:
Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Learning. CoRR abs/2301.11916 (2023) - [i21]Wanrong Zhu, Jack Hessel, Anas Awadalla, Samir Yitzhak Gadre, Jesse Dodge, Alex Fang, Youngjae Yu, Ludwig Schmidt, William Yang Wang, Yejin Choi:
Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text. CoRR abs/2304.06939 (2023) - [i20]Yujie Lu, Pan Lu, Zhiyu Chen, Wanrong Zhu, Xin Eric Wang, William Yang Wang:
Multimodal Procedural Planning via Dual Text-Image Prompting. CoRR abs/2305.01795 (2023) - [i19]Wanrong Zhu, Xinyi Wang, Yujie Lu, Tsu-Jui Fu, Xin Eric Wang, Miguel P. Eckstein, William Yang Wang:
Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation. CoRR abs/2305.11317 (2023) - [i18]Weixi Feng, Wanrong Zhu, Tsu-Jui Fu, Varun Jampani, Arjun R. Akula, Xuehai He, Sugato Basu, Xin Eric Wang, William Yang Wang:
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models. CoRR abs/2305.15393 (2023) - [i17]Raphael Schumann, Wanrong Zhu, Weixi Feng, Tsu-Jui Fu, Stefan Riezler, William Yang Wang:
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View. CoRR abs/2307.06082 (2023) - [i16]Ziyang Wei, Wanrong Zhu, Wei Biao Wu:
Weighted Averaged Stochastic Gradient Descent: Asymptotic Normality and Optimality. CoRR abs/2307.06915 (2023) - [i15]Anas Awadalla, Irena Gao, Josh Gardner, Jack Hessel, Yusuf Hanafy, Wanrong Zhu, Kalyani Marathe, Yonatan Bitton, Samir Yitzhak Gadre, Shiori Sagawa, Jenia Jitsev, Simon Kornblith, Pang Wei Koh, Gabriel Ilharco, Mitchell Wortsman, Ludwig Schmidt:
OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models. CoRR abs/2308.01390 (2023) - [i14]Yonatan Bitton, Hritik Bansal, Jack Hessel, Rulin Shao, Wanrong Zhu, Anas Awadalla, Josh Gardner, Rohan Taori, Ludwig Schmidt:
VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use. CoRR abs/2308.06595 (2023) - [i13]An Yan, Zhengyuan Yang, Wanrong Zhu, Kevin Lin, Linjie Li, Jianfeng Wang, Jianwei Yang, Yiwu Zhong, Julian J. McAuley, Jianfeng Gao, Zicheng Liu, Lijuan Wang:
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation. CoRR abs/2311.07562 (2023) - 2022
- [j1]Wanrong Zhu, Zhipeng Lou, Wei Biao Wu:
Beyond Sub-Gaussian Noises: Sharp Concentration Analysis for Stochastic Gradient Descent. J. Mach. Learn. Res. 23: 46:1-46:22 (2022) - [c6]Wanrong Zhu, Bo Pang, Ashish V. Thapliyal, William Yang Wang, Radu Soricut:
End-to-end Dense Video Captioning as Sequence Generation. COLING 2022: 5651-5665 - [c5]Yujie Lu, Wanrong Zhu, Xin Wang, Miguel P. Eckstein, William Yang Wang:
Imagination-Augmented Natural Language Understanding. NAACL-HLT 2022: 4392-4402 - [c4]Wanrong Zhu, Yuankai Qi, Pradyumna Narayana, Kazoo Sone, Sugato Basu, Xin Wang, Qi Wu, Miguel P. Eckstein, William Yang Wang:
Diagnosing Vision-and-Language Navigation: What Really Matters. NAACL-HLT 2022: 5981-5993 - [i12]Wanrong Zhu, Bo Pang, Ashish V. Thapliyal, William Yang Wang, Radu Soricut:
End-to-end Dense Video Captioning as Sequence Generation. CoRR abs/2204.08121 (2022) - [i11]Yujie Lu, Wanrong Zhu, Xin Eric Wang, Miguel P. Eckstein, William Yang Wang:
Imagination-Augmented Natural Language Understanding. CoRR abs/2204.08535 (2022) - [i10]Yujie Lu, Weixi Feng, Wanrong Zhu, Wenda Xu, Xin Eric Wang, Miguel P. Eckstein, William Yang Wang:
Neuro-Symbolic Causal Language Planning with Commonsense Prompting. CoRR abs/2206.02928 (2022) - [i9]Wanrong Zhu, An Yan, Yujie Lu, Wenda Xu, Xin Eric Wang, Miguel P. Eckstein, William Yang Wang:
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation. CoRR abs/2210.03765 (2022) - [i8]An Yan, Jiacheng Li, Wanrong Zhu, Yujie Lu, William Yang Wang, Julian J. McAuley:
CLIP also Understands Text: Prompting CLIP for Phrase Understanding. CoRR abs/2210.05836 (2022) - 2021
- [c3]Wanrong Zhu, Xin Wang, Tsu-Jui Fu, An Yan, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang:
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation. EACL 2021: 1207-1221 - [i7]Wanrong Zhu, Yuankai Qi, Pradyumna Narayana, Kazoo Sone, Sugato Basu, Xin Eric Wang, Qi Wu, Miguel P. Eckstein, William Yang Wang:
Diagnosing Vision-and-Language Navigation: What Really Matters. CoRR abs/2103.16561 (2021) - [i6]Wanrong Zhu, Xin Eric Wang, An Yan, Miguel P. Eckstein, William Yang Wang:
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation. CoRR abs/2106.05970 (2021) - 2020
- [c2]Wanrong Zhu, Xin Wang, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang:
Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations. EMNLP (1) 2020: 8806-8811 - [i5]Wanrong Zhu, Xi Chen, Wei Biao Wu:
A Fully Online Approach for Covariance Matrices Estimation of Stochastic Gradient Descent Solutions. CoRR abs/2002.03979 (2020) - [i4]Wanrong Zhu, Xin Wang, Tsu-Jui Fu, An Yan, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang:
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation. CoRR abs/2007.00229 (2020) - [i3]Wanrong Zhu, Xin Eric Wang, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang:
Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations. CoRR abs/2010.03644 (2020)
2010 – 2019
- 2019
- [c1]Zhiting Hu, Haoran Shi, Bowen Tan, Wentao Wang, Zichao Yang, Tiancheng Zhao, Junxian He, Lianhui Qin, Di Wang, Xuezhe Ma, Zhengzhong Liu, Xiaodan Liang, Wanrong Zhu, Devendra Singh Sachan, Eric P. Xing:
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation. ACL (3) 2019: 159-164 - [i2]Wanrong Zhu, Zhiting Hu, Eric P. Xing:
Text Infilling. CoRR abs/1901.00158 (2019) - 2018
- [i1]Zhiting Hu, Haoran Shi, Zichao Yang, Bowen Tan, Tiancheng Zhao, Junxian He, Wentao Wang, Xingjiang Yu, Lianhui Qin, Di Wang, Xuezhe Ma, Zhengzhong Liu, Xiaodan Liang, Wanrong Zhu, Devendra Singh Sachan, Eric P. Xing:
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation. CoRR abs/1809.00794 (2018)
Coauthor Index
aka: Xin Eric Wang
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-09 13:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint