default search action

combined dblp search
author search
venue search
publication search

ask others

Hengshuai Yao

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/AtakishiyevSYG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/AtakishiyevSYG24
Shahin Atakishiyev, Mohammad Salameh, Hengshuai Yao, Randy Goebel:
Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions. IEEE Access 12: 101603-101625 (2024)
2023
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ChenDCYPS0GJC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ChenDCYPS0GJC23
Xing Chen, Dongcui Diao, Hechang Chen, Hengshuai Yao, Haiyin Piao, Zhixiao Sun, Zhiwei Yang, Randy Goebel, Bei Jiang, Yi Chang:
The Sufficiency of Off-Policyness and Soft Clipping: PPO Is Still Insufficient according to an Off-Policy Measure. AAAI 2023: 7078-7086
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-15892
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-15892
Hengshuai Yao:
A new Gradient TD Algorithm with only One Step-size: Convergence Rate Analysis using L-λ Smoothness. CoRR abs/2307.15892 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-09732
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-09732
Hengshuai Yao:
Baird Counterexample Is Solved: with an example of How to Debug a Two-time-scale Algorithm. CoRR abs/2308.09732 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11348
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11348
Xing Chen, Yijun Liu, Zhaogeng Liu, Hechang Chen, Hengshuai Yao, Yi Chang:
Careful at Estimation and Bold at Exploration. CoRR abs/2308.11348 (2023)
2022
[c21]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/PanMFWYR022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/PanMFWYR022
Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand, Martha White, Hengshuai Yao, Mohsen Rohani, Jun Luo:
Understanding and mitigating the limitations of prioritized experience replay. UAI 2022: 1561-1571
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-01705
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-01705
Hengshuai Yao:
Learning to Accelerate by the Methods of Step-size Planning. CoRR abs/2204.01705 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10047
Xing Chen, Dongcui Diao, Hechang Chen, Hengshuai Yao, Jielong Yang, Haiyin Piao, Zhixiao Sun, Bei Jiang, Yi Chang:
Sigmoidally Preconditioned Off-policy Learning: a new exploration method for reinforcement learning. CoRR abs/2205.10047 (2022)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01370
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01370
Dongcui Diao, Hengshuai Yao, Bei Jiang:
Class Interference of Deep Neural Networks. CoRR abs/2211.01370 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-16209
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-16209
Hengshuai Yao:
The Vanishing Decision Boundary Complexity and the Strong First Component. CoRR abs/2211.16209 (2022)
2021
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/MillsSNHRYLLJ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/MillsSNHRYLLJ21
Keith G. Mills, Mohammad Salameh, Di Niu, Fred X. Han, Seyed Saeed Changiz Rezaei, Hengshuai Yao, Wei Lu, Shuo Lian, Shangling Jui:
Exploring Neural Architecture Search Space via Deep Deterministic Sampling. IEEE Access 9: 110962-110974 (2021)
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/make/KimABFGZMRSYC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/make/KimABFGZMRSYC21
Mi-Young Kim, Shahin Atakishiyev, Housam Khalifa Bashier Babiker, Nawshad Farruque, Randy Goebel, Osmar R. Zaïane, Mohammad H. Motallebi, Juliano Rabelo, Talat Iqba Syed, Hengshuai Yao, Peter Chun:
A Multi-Component Framework for the Analysis and Design of Explainable Artificial Intelligence. Mach. Learn. Knowl. Extr. 3(4): 900-921 (2021)
[c20]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhangYW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhangYW21
Shangtong Zhang, Hengshuai Yao, Shimon Whiteson:
Breaking the Deadly Triad with a Target Network. ICML 2021: 12621-12631
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-08862
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-08862
Shangtong Zhang, Hengshuai Yao, Shimon Whiteson:
Breaking the Deadly Triad with a Target Network. CoRR abs/2101.08862 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-08776
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-08776
Ke Sun, Yi Liu, Yingnan Zhao, Hengshuai Yao, Shangling Jui, Linglong Kong:
Exploring the Robustness of Distributional Reinforcement Learning against Noisy State Observations. CoRR abs/2109.08776 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-10518
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-10518
Shahin Atakishiyev, Mohammad Salameh, Hengshuai Yao, Randy Goebel:
Towards safe, explainable, and regulated autonomous driving. CoRR abs/2111.10518 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-11561
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-11561
Shahin Atakishiyev, Mohammad Salameh, Hengshuai Yao, Randy Goebel:
Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions. CoRR abs/2112.11561 (2021)
2020
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhangLYW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhangLYW20
Shangtong Zhang, Bo Liu, Hengshuai Yao, Shimon Whiteson:
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation. ICML 2020: 11204-11213
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/JinNSGYJ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/JinNSGYJ20
Jun Jin, Nhat M. Nguyen, Nazmus Sakib, Daniel Graves, Hengshuai Yao, Martin Jägersand:
Mapless Navigation among Dynamics with Social-safety-awareness: a reinforcement learning approach from 2D laser scans. ICRA 2020: 6979-6985
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/SiamDOYJ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/SiamDOYJ20
Mennatullah Siam, Naren Doraiswamy, Boris N. Oreshkin, Hengshuai Yao, Martin Jägersand:
Weakly Supervised Few-shot Object Segmentation using Co-Attention with Visual and Semantic Embeddings. IJCAI 2020: 860-867
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-09540
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-09540
Mennatullah Siam, Naren Doraiswamy, Boris N. Oreshkin, Hengshuai Yao, Martin Jägersand:
Weakly Supervised Few-shot Object Segmentation using Co-Attention with Visual and Semantic Inputs. CoRR abs/2001.09540 (2020)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-03807
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-03807
Vincent Liu, Adam White, Hengshuai Yao, Martha White:
Towards a practical measure of interference for reinforcement learning. CoRR abs/2007.03807 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-09569
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-09569
Jincheng Mei, Yangchen Pan, Martha White, Amir-massoud Farahmand, Hengshuai Yao:
Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities. CoRR abs/2007.09569 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-06548
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-06548
Daoming Lyu, Qi Qi, Mohammad Ghavamzadeh, Hengshuai Yao, Tianbao Yang, Bo Liu:
Variance-Reduced Off-Policy Memory-Efficient Policy Search. CoRR abs/2009.06548 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangY19
Shangtong Zhang, Hengshuai Yao:
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search. AAAI 2019: 5789-5796
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangY19a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangY19a
Shangtong Zhang, Hengshuai Yao:
QUOTA: The Quantile Option Architecture for Reinforcement Learning. AAAI 2019: 5797-5804
[c14]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/MavrinZYK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/MavrinZYK19
Borislav Mavrin, Shangtong Zhang, Hengshuai Yao, Linglong Kong:
Exploration in the Face of Parametric and Intrinsic Uncertainties. AAMAS 2019: 2117-2119
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icdm/TuLZLKLJTY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdm/TuLZLKLJTY19
Wei Tu, Peng Liu, Jingyu Zhao, Yi Liu, Linglong Kong, Guodong Li, Bei Jiang, Guangjian Tian, Hengshuai Yao:
M-estimation in Low-Rank Matrix Factorization: A General Framework. ICDM 2019: 568-577
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MavrinYKWY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MavrinYKWY19
Borislav Mavrin, Hengshuai Yao, Linglong Kong, Kaiwen Wu, Yaoliang Yu:
Distributional Reinforcement Learning for Efficient Exploration. ICML 2019: 4424-4434
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/PanYFW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/PanYFW19
Yangchen Pan, Hengshuai Yao, Amir-massoud Farahmand, Martha White:
Hill Climbing on Value Estimates for Search-control in Dyna. IJCAI 2019: 3209-3215
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-07765
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-07765
Borislav Mavrin, Hengshuai Yao, Linglong Kong:
Deep Reinforcement Learning with Decorrelation. CoRR abs/1903.07765 (2019)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-08606
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-08606
Nazmus Sakib, Hengshuai Yao, Hong Zhang:
Reinforcing Classical Planning for Adversary Driving Scenarios. CoRR abs/1903.08606 (2019)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-06125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-06125
Borislav Mavrin, Shangtong Zhang, Hengshuai Yao, Linglong Kong, Kaiwen Wu, Yaoliang Yu:
Distributional Reinforcement Learning for Efficient Exploration. CoRR abs/1905.06125 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07791
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07791
Yangchen Pan, Hengshuai Yao, Amir-massoud Farahmand, Martha White:
Hill Climbing on Value Estimates for Search-control in Dyna. CoRR abs/1906.07791 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01705
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01705
Khurram Javed, Hengshuai Yao, Martha White:
Is Fast Adaptation All You Need? CoRR abs/1910.01705 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-03074
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-03074
Jun Jin, Nhat M. Nguyen, Nazmus Sakib, Daniel Graves, Hengshuai Yao, Martin Jägersand:
Mapless Navigation among Dynamics with Social-safety-awareness: a reinforcement learning approach from 2D laser scans. CoRR abs/1911.03074 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-04384
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-04384
Shangtong Zhang, Bo Liu, Hengshuai Yao, Shimon Whiteson:
Provably Convergent Off-Policy Actor-Critic with Function Approximation. CoRR abs/1911.04384 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-08936
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-08936
Mennatullah Siam, Naren Doraiswamy, Boris N. Oreshkin, Hengshuai Yao, Martin Jägersand:
One-Shot Weakly Supervised Video Object Segmentation. CoRR abs/1912.08936 (2019)
2018
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/itsc/ZhuCYNYZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/itsc/ZhuCYNYZ18
Donglai Zhu, Hao Chen, Hengshuai Yao, Masoud S. Nosrati, Peyman Yadmellat, Yunfei Zhang:
Practical Issues of Action-Conditioned Next Image Prediction. ITSC 2018: 3150-3155
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-02975
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-02975
Donglai Zhu, Hao Chen, Hengshuai Yao, Masoud S. Nosrati, Peyman Yadmellat, Yunfei Zhang:
Practical Issues of Action-conditioned Next Image Prediction. CoRR abs/1802.02975 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-10690
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-10690
Donglai Zhu, Hengshuai Yao, Bei Jiang, Peng Yu:
Negative Log Likelihood Ratio Loss for Deep Neural Network Classification. CoRR abs/1804.10690 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02073
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02073
Shangtong Zhang, Borislav Mavrin, Linglong Kong, Bo Liu, Hengshuai Yao:
QUOTA: The Quantile Option Architecture for Reinforcement Learning. CoRR abs/1811.02073 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02696
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02696
Shangtong Zhang, Hao Chen, Hengshuai Yao:
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search. CoRR abs/1811.02696 (2018)
2014
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/adprl/YaoSPZ14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/adprl/YaoSPZ14
Hengshuai Yao, Csaba Szepesvári, Bernardo Ávila Pires, Xinhua Zhang:
Pseudo-MDPs and factored linear action models. ADPRL 2014: 1-9
[c8]
- view
- export record
  dblp key:
  - conf/nips/YaoSSMB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YaoSSMB14
Hengshuai Yao, Csaba Szepesvári, Richard S. Sutton, Joseph Modayil, Shalabh Bhatnagar:
Universal Option Models. NIPS 2014: 990-998
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/www/LeeYHCCM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/LeeYHCCM14
Chi-Hoon Lee, Hengshuai Yao, Xu He, Su Han Chan, JieYang Chang, Farzin Maghoul:
Learning to predict trending queries: classification - based. WWW (Companion Volume) 2014: 335-336
2013
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1303-5988
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1303-5988
Hengshuai Yao, Dale Schuurmans:
Reinforcement Ranking. CoRR abs/1303.5988 (2013)
2012
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YaoS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YaoS12
Hengshuai Yao, Csaba Szepesvári:
Approximate Policy Iteration with Linear Action Models. AAAI 2012: 1212-1218
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1210-1626
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1210-1626
Hengshuai Yao:
Discovering and Leveraging the Most Valuable Links for Ranking. CoRR abs/1210.1626 (2012)

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/YaoBS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/YaoBS09
Hengshuai Yao, Shalabh Bhatnagar, Csaba Szepesvári:
LMS-2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS. CDC 2009: 1181-1188
[c4]
- view
- export record
  dblp key:
  - conf/nips/YaoSBDS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YaoSBDS09
Hengshuai Yao, Richard S. Sutton, Shalabh Bhatnagar, Diao Dongcui, Csaba Szepesvári:
Multi-Step Dyna Planning for Policy Evaluation and Control. NIPS 2009: 2187-2195
2008
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icml/YaoL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YaoL08
Hengshuai Yao, Zhi-Qiang Liu:
Preconditioned temporal difference learning. ICML 2008: 1208-1215
[c2]
- view
  - electronic edition @ unl.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/isaim/YaoL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isaim/YaoL08
Hengshuai Yao, Zhi-Qiang Liu:
Minimal Residual Approaches for Policy Evaluation in Large Sparse Markov Chains. ISAIM 2008
2006
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/imsccs/YaoDS06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/imsccs/YaoDS06
Hengshuai Yao, Diao Dongcui, Zengqi Sun:
Historical Temporal Difference Learning: Some Initial Results. IMSCCS (2) 2006: 678-685

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.