default search action
Yao Liu 0009
Person information
- affiliation: Stanford University, CA, USA
- affiliation: Peking University, Beijing, China
Other persons with the same name
- Yao Liu — disambiguation page
- Yao Liu 0001 — Rutgers University, New Brunswick, USA (and 2 more)
- Yao Liu 0002 — National University of Singapore
- Yao Liu 0003 — Delft University of Technology, Biomedical Electronics Group, The Netherlands
- Yao Liu 0004 — Wuhan University, School of Remote Sensing Information Engineering, China
- Yao Liu 0005 — Central South University, School of Information Science and Engineering, Changsha, China (and 2 more)
- Yao Liu 0006 — City University of Hong Kong, Department of Electronic Engineering, Hong Kong
- Yao Liu 0007 — University of South Florida, Department of Computer Science and Engineering, Tampa, FL, USA (and 1 more)
- Yao Liu 0008 — University of California, San Diego, USA
- Yao Liu 0010 — Xidian University, Xi'an, China
- Yao Liu 0011 — Nanjing University, Nanjing, China
- Yao Liu 0012 — China Aero Geophysical Survey and Remote Sensing Center for Land and Resources, Beijing, China
- Yao Liu 0013 — University of Pavia, Italy
- Yao Liu 0014 — Alibaba Group, Hangzhou, China (and 1 more)
- Yao Liu 0015 — Texas A&M University, College Station, TX, USA
- Yao Liu 0016 — Southwest University, Chong Qing, China
- Yao Liu 0017 — East China Normal University, Faculty of Information, College of Computer Science and Software Engineering, Shanghai Key Laboratory of Multidimensional Information Processing, Shanghai, China (and 1 more)
- Yao Liu 0018 — University of Science and Technology of China, Hefei, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j1]Sherry Ruan, Allen Nie, William Steenbergen, Jiayu He, J. Q. Zhang, Meng Guo, Yao Liu, Kyle Dang Nguyen, Catherine Y. Wang, Rui Ying, James A. Landay, Emma Brunskill:
Reinforcement learning tutor better supported lower performers in a math task. Mach. Learn. 113(5): 3023-3048 (2024) - [c12]Zuxin Liu, Jesse Zhang, Kavosh Asadi, Yao Liu, Ding Zhao, Shoham Sabach, Rasool Fakoor:
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models. ICLR 2024 - [c11]Kavosh Asadi, Yao Liu, Shoham Sabach, Ming Yin, Rasool Fakoor:
Learning the Target Network in Function Space. ICML 2024 - [i18]Kavosh Asadi, Yao Liu, Shoham Sabach, Ming Yin, Rasool Fakoor:
Learning the Target Network in Function Space. CoRR abs/2406.01838 (2024) - [i17]Jesse Zhang, Minho Heo, Zuxin Liu, Erdem Biyik, Joseph J. Lim, Yao Liu, Rasool Fakoor:
EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data. CoRR abs/2406.17768 (2024) - [i16]Ke Yang, Yao Liu, Sapana Chaudhary, Rasool Fakoor, Pratik Chaudhari, George Karypis, Huzefa Rangwala:
AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents. CoRR abs/2410.13825 (2024) - [i15]Zhepeng Cen, Yao Liu, Siliang Zeng, Pratik Chaudhari, Huzefa Rangwala, George Karypis, Rasool Fakoor:
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens. CoRR abs/2410.14655 (2024) - 2023
- [c10]Yao Liu, Pratik Chaudhari, Rasool Fakoor:
Budgeting Counterfactual for Offline RL. NeurIPS 2023 - [c9]Kavosh Asadi, Shoham Sabach, Yao Liu, Omer Gottesman, Rasool Fakoor:
TD Convergence: An Optimization Perspective. NeurIPS 2023 - [i14]Sherry Ruan, Allen Nie, William Steenbergen, Jiayu He, JQ Zhang, Meng Guo, Yao Liu, Kyle Dang Nguyen, Catherine Y. Wang, Rui Ying, James A. Landay, Emma Brunskill:
Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task. CoRR abs/2304.04933 (2023) - [i13]Kavosh Asadi, Shoham Sabach, Yao Liu, Omer Gottesman, Rasool Fakoor:
TD Convergence: An Optimization Perspective. CoRR abs/2306.17750 (2023) - [i12]Yao Liu, Pratik Chaudhari, Rasool Fakoor:
Budgeting Counterfactual for Offline RL. CoRR abs/2307.06328 (2023) - [i11]Zuxin Liu, Jesse Zhang, Kavosh Asadi, Yao Liu, Ding Zhao, Shoham Sabach, Rasool Fakoor:
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models. CoRR abs/2310.05905 (2023) - 2022
- [c8]Yao Liu, Yannis Flet-Berliac, Emma Brunskill:
Offline policy optimization with eligible actions. UAI 2022: 1253-1263 - [i10]Yao Liu, Yannis Flet-Berliac, Emma Brunskill:
Offline Policy Optimization with Eligible Actions. CoRR abs/2207.00632 (2022) - 2021
- [b1]Yao Liu:
Adaptive and efficient batch reinforcement learning algorithms. Stanford University, USA, 2021 - 2020
- [c7]Omer Gottesman, Joseph Futoma, Yao Liu, Sonali Parbhoo, Leo A. Celi, Emma Brunskill, Finale Doshi-Velez:
Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions. ICML 2020: 3658-3667 - [c6]Yao Liu, Pierre-Luc Bacon, Emma Brunskill:
Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling. ICML 2020: 6184-6193 - [c5]Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill:
Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration. NeurIPS 2020 - [i9]Omer Gottesman, Joseph Futoma, Yao Liu, Sonali Parbhoo, Leo Anthony Celi, Emma Brunskill, Finale Doshi-Velez:
Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions. CoRR abs/2002.03478 (2020) - [i8]Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill:
Provably Good Batch Reinforcement Learning Without Great Exploration. CoRR abs/2007.08202 (2020)
2010 – 2019
- 2019
- [c4]Omer Gottesman, Yao Liu, Scott Sussex, Emma Brunskill, Finale Doshi-Velez:
Combining parametric and nonparametric models for off-policy evaluation. ICML 2019: 2366-2375 - [c3]Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill:
Off-Policy Policy Gradient with Stationary Distribution Correction. UAI 2019: 1180-1190 - [i7]Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill:
Off-Policy Policy Gradient with State Distribution Correction. CoRR abs/1904.08473 (2019) - [i6]Omer Gottesman, Yao Liu, Scott Sussex, Emma Brunskill, Finale Doshi-Velez:
Combining Parametric and Nonparametric Models for Off-Policy Evaluation. CoRR abs/1905.05787 (2019) - [i5]Yao Liu, Pierre-Luc Bacon, Emma Brunskill:
Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling. CoRR abs/1910.06508 (2019) - [i4]Benjamin Petit, Loren Amdahl-Culleton, Yao Liu, Jimmy Smith, Pierre-Luc Bacon:
All-Action Policy Gradient Methods: A Numerical Integration Approach. CoRR abs/1910.09093 (2019) - 2018
- [c2]Yao Liu, Omer Gottesman, Aniruddh Raghu, Matthieu Komorowski, Aldo A. Faisal, Finale Doshi-Velez, Emma Brunskill:
Representation Balancing MDPs for Off-policy Policy Evaluation. NeurIPS 2018: 2649-2658 - [i3]Yao Liu, Omer Gottesman, Aniruddh Raghu, Matthieu Komorowski, Aldo Faisal, Finale Doshi-Velez, Emma Brunskill:
Representation Balancing MDPs for Off-Policy Policy Evaluation. CoRR abs/1805.09044 (2018) - [i2]Yao Liu, Emma Brunskill:
When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms. CoRR abs/1805.09045 (2018) - [i1]Aniruddh Raghu, Omer Gottesman, Yao Liu, Matthieu Komorowski, Aldo Faisal, Finale Doshi-Velez, Emma Brunskill:
Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters. CoRR abs/1807.01066 (2018) - 2016
- [c1]Yao Liu, Zhaohan Guo, Emma Brunskill:
PAC Continuous State Online Multitask Reinforcement Learning with Identification. AAMAS 2016: 438-446
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-13 01:02 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint