default search action

combined dblp search
author search
venue search
publication search

ask others

Yao Liu 0009

> Home > Persons

Person information

affiliation: Stanford University, CA, USA
affiliation: Peking University, Beijing, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ml/RuanNSHZGLNWYLB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/RuanNSHZGLNWYLB24
Sherry Ruan, Allen Nie, William Steenbergen, Jiayu He, J. Q. Zhang, Meng Guo, Yao Liu, Kyle Dang Nguyen, Catherine Y. Wang, Rui Ying, James A. Landay, Emma Brunskill:
Reinforcement learning tutor better supported lower performers in a math task. Mach. Learn. 113(5): 3023-3048 (2024)
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiuZA0ZSF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiuZA0ZSF24
Zuxin Liu, Jesse Zhang, Kavosh Asadi, Yao Liu, Ding Zhao, Shoham Sabach, Rasool Fakoor:
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models. ICLR 2024
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Asadi0SYF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Asadi0SYF24
Kavosh Asadi, Yao Liu, Shoham Sabach, Ming Yin, Rasool Fakoor:
Learning the Target Network in Function Space. ICML 2024
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-01838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-01838
Kavosh Asadi, Yao Liu, Shoham Sabach, Ming Yin, Rasool Fakoor:
Learning the Target Network in Function Space. CoRR abs/2406.01838 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17768
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17768
Jesse Zhang, Minho Heo, Zuxin Liu, Erdem Biyik, Joseph J. Lim, Yao Liu, Rasool Fakoor:
EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data. CoRR abs/2406.17768 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-13825
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-13825
Ke Yang, Yao Liu, Sapana Chaudhary, Rasool Fakoor, Pratik Chaudhari, George Karypis, Huzefa Rangwala:
AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents. CoRR abs/2410.13825 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-14655
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-14655
Zhepeng Cen, Yao Liu, Siliang Zeng, Pratik Chaudhari, Huzefa Rangwala, George Karypis, Rasool Fakoor:
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens. CoRR abs/2410.14655 (2024)
2023
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0009CF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0009CF23
Yao Liu, Pratik Chaudhari, Rasool Fakoor:
Budgeting Counterfactual for Offline RL. NeurIPS 2023
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/AsadiS0GF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AsadiS0GF23
Kavosh Asadi, Shoham Sabach, Yao Liu, Omer Gottesman, Rasool Fakoor:
TD Convergence: An Optimization Perspective. NeurIPS 2023
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-04933
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-04933
Sherry Ruan, Allen Nie, William Steenbergen, Jiayu He, JQ Zhang, Meng Guo, Yao Liu, Kyle Dang Nguyen, Catherine Y. Wang, Rui Ying, James A. Landay, Emma Brunskill:
Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task. CoRR abs/2304.04933 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-17750
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-17750
Kavosh Asadi, Shoham Sabach, Yao Liu, Omer Gottesman, Rasool Fakoor:
TD Convergence: An Optimization Perspective. CoRR abs/2306.17750 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-06328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-06328
Yao Liu, Pratik Chaudhari, Rasool Fakoor:
Budgeting Counterfactual for Offline RL. CoRR abs/2307.06328 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05905
Zuxin Liu, Jesse Zhang, Kavosh Asadi, Yao Liu, Ding Zhao, Shoham Sabach, Rasool Fakoor:
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models. CoRR abs/2310.05905 (2023)
2022
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/0009FB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/0009FB22
Yao Liu, Yannis Flet-Berliac, Emma Brunskill:
Offline policy optimization with eligible actions. UAI 2022: 1253-1263
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-00632
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-00632
Yao Liu, Yannis Flet-Berliac, Emma Brunskill:
Offline Policy Optimization with Eligible Actions. CoRR abs/2207.00632 (2022)
2021
[b1]
- view
  - electronic edition @ stanford.edu
  - details & citations
- export record
  dblp key:
  - phd/us/Liu21e
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Liu21e
Yao Liu:
Adaptive and efficient batch reinforcement learning algorithms. Stanford University, USA, 2021
2020
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/GottesmanF0PCBD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GottesmanF0PCBD20
Omer Gottesman, Joseph Futoma, Yao Liu, Sonali Parbhoo, Leo A. Celi, Emma Brunskill, Finale Doshi-Velez:
Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions. ICML 2020: 3658-3667
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/0009BB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0009BB20
Yao Liu, Pierre-Luc Bacon, Emma Brunskill:
Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling. ICML 2020: 6184-6193
[c5]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0009SAB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0009SAB20
Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill:
Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration. NeurIPS 2020
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-03478
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-03478
Omer Gottesman, Joseph Futoma, Yao Liu, Sonali Parbhoo, Leo Anthony Celi, Emma Brunskill, Finale Doshi-Velez:
Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions. CoRR abs/2002.03478 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-08202
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-08202
Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill:
Provably Good Batch Reinforcement Learning Without Great Exploration. CoRR abs/2007.08202 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/GottesmanLSBD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GottesmanLSBD19
Omer Gottesman, Yao Liu, Scott Sussex, Emma Brunskill, Finale Doshi-Velez:
Combining parametric and nonparametric models for off-policy evaluation. ICML 2019: 2366-2375
[c3]
- view
- export record
  dblp key:
  - conf/uai/LiuSAB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/LiuSAB19
Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill:
Off-Policy Policy Gradient with Stationary Distribution Correction. UAI 2019: 1180-1190
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-08473
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-08473
Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill:
Off-Policy Policy Gradient with State Distribution Correction. CoRR abs/1904.08473 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-05787
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-05787
Omer Gottesman, Yao Liu, Scott Sussex, Emma Brunskill, Finale Doshi-Velez:
Combining Parametric and Nonparametric Models for Off-Policy Evaluation. CoRR abs/1905.05787 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-06508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-06508
Yao Liu, Pierre-Luc Bacon, Emma Brunskill:
Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling. CoRR abs/1910.06508 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-09093
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-09093
Benjamin Petit, Loren Amdahl-Culleton, Yao Liu, Jimmy Smith, Pierre-Luc Bacon:
All-Action Policy Gradient Methods: A Numerical Integration Approach. CoRR abs/1910.09093 (2019)
2018
[c2]
- view
- export record
  dblp key:
  - conf/nips/LiuGRKFDB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuGRKFDB18
Yao Liu, Omer Gottesman, Aniruddh Raghu, Matthieu Komorowski, Aldo A. Faisal, Finale Doshi-Velez, Emma Brunskill:
Representation Balancing MDPs for Off-policy Policy Evaluation. NeurIPS 2018: 2649-2658
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-09044
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-09044
Yao Liu, Omer Gottesman, Aniruddh Raghu, Matthieu Komorowski, Aldo Faisal, Finale Doshi-Velez, Emma Brunskill:
Representation Balancing MDPs for Off-Policy Policy Evaluation. CoRR abs/1805.09044 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-09045
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-09045
Yao Liu, Emma Brunskill:
When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms. CoRR abs/1805.09045 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-01066
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-01066
Aniruddh Raghu, Omer Gottesman, Yao Liu, Matthieu Komorowski, Aldo Faisal, Finale Doshi-Velez, Emma Brunskill:
Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters. CoRR abs/1807.01066 (2018)
2016
[c1]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/LiuGB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LiuGB16
Yao Liu, Zhaohan Guo, Emma Brunskill:
PAC Continuous State Online Multitask Reinforcement Learning with Identification. AAMAS 2016: 438-446

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.