[B! ai][rl] goingerのブックマーク

goinger id:goinger

aiとrlに関するgoingerのブックマーク (24)

autonweb:14686 [Auton Lab]
goinger 2009/09/26
ml

ai

rl
リンク
Reinforcement Learning Simulator
Reinforcement Learning - Simulator Introduction The motivation behind this work is to simulate and animate the Reinforcement Learning algorithms to be able to better understand their behavior, which will enable to enhancements to these algorithms. Visualization is a better way of presenting new concepts to others. Our perception about animating these algorithms is to enable the students to get an
goinger 2009/09/26
ai

ml

rl
リンク
RLApplications.bib
goinger 2009/09/08
rl

ai
リンク
RL-Glue 3.0 Technical Details
goinger 2009/09/08
rl

rl-glue

document

ai
リンク
http://www.cs.ualberta.ca/~sutton/Talks/RL-Tutorial/sld001.htm
goinger 2009/09/08
rl

ai
リンク
RL FAQ
Reinforcement Learning FAQ: Frequently Asked Questions about Reinforcement Learning Edited by Rich Sutton Initiated 8/13/01 Last updated 2/4/04 I get many questions about reinforcement learning -- how to think about it and how do it successfully in practice. This FAQ is an attempt to pull together in one place some of the more common questions and answers. I have been free with my opinio
goinger 2009/09/08
rl

faq

ai
リンク
Sign in - Google Accounts
goinger 2009/09/07
rl

library

ai
リンク
Temporal difference learning - Wikipedia
Temporal difference (TD) learning refers to a class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate of the value function. These methods sample from the environment, like Monte Carlo methods, and perform updates based on current estimates, like dynamic programming methods.[1] While Monte Carlo methods only adjust their estimates once the final ou
goinger 2009/09/07
rl

ai

wikipedia
リンク
A Short Introduction To Some RL Algorithms - Hado van Hasselt
goinger 2009/09/07
rl

algorithm

ai
リンク
Hado van Hasselt - Reinforcement Learning Implementations
goinger 2009/09/07
rl

ai
リンク
Reinforcement learning - Wikipedia
Reinforcement learning (RL) is an interdisciplinary area of machine learning and optimal control concerned with how an intelligent agent should take actions in a dynamic environment in order to maximize a reward signal. Reinforcement learning is one of the three basic machine learning paradigms, alongside supervised learning and unsupervised learning. Q-learning at its simplest stores data in tabl
goinger 2009/09/07
rl

ai
リンク
Google Code Archive - Long-term storage for Google Code Project Hosting.
Code Archive Skip to content Google About Google Privacy Terms
goinger 2009/09/07
rl

library

ai
リンク
RL-Library
The mission of the Reinforcement-Learning Library (RL-Library) is to create a centralized place for the reinforcement-learning community to share their RL-Glue compatible software projects. The RL-Library serves two distinct needs. First, to provide standardized, trusted implementations of agents and environments from the reinforcement-learning literature. Second, as a repository for other RL-Glue
goinger 2009/09/07
rl

ai
リンク
研究者学術情報データベース
知能情報学, ソフトコンピューティング, 生命・健康・医療情報学 (キーワード：知能情報処理、学習と発見、探索アルゴリズム、ニューラルネットワーク、遺伝アルゴリズム、確率的情報処理、バイオインフォーマティクス、コンピュータシミュレーション、生体情報、脳型情報処理)
goinger 2009/09/06
ai

rl
リンク
Amazon.co.jp: 強化学習: Richard S.Sutton (著), Andrew G.Barto (著), 三上貞芳 (翻訳): 本
goinger 2009/09/06
rl

ai
リンク
村田研究室のページ
申し訳ございません．お探しのページが見つかりませんでした．お探しのページは，移動もしくは削除された可能性があります． Sorry．The page you're looking for can't be found． The page you're looking for have been moved or deleted．村田研究室のWebサイトへようこそ！〒169-8555　東京都新宿区大久保 3-4-1　63号館6F-18 早稲田大学先進理工学研究科電気・情報生命専攻村田昇研究室 Em ail: noboru.murata[at]eb.waseda.ac.jp
goinger 2009/09/06
ai

rl
リンク
システムマネジメント
goinger 2009/09/05
ai

rl
リンク
はてなブログ | 無料ブログを作成しよう
エンジニアパパと4歳の娘で2024年に作ったもの娘も4歳となり、何かを親が作る、というより「親と一緒に作る」ことが増えてきました。今年も細かいモノづくりをたくさんおこなったので。年末ということで一気にまとめて紹介してみようと思います。この記事は子育てエンジニア Advent Calendar 2024の12/07の記事で…
goinger 2009/09/05
nn

ai

rl
リンク
http://nao.s164.xrea.com/RL-FAQ-j.html
goinger 2009/09/05
rl

ai
リンク
A. Perez-Uribe Introduction to Reinforcement learning
Herein, we present a brief introduction to reinforcement learning techniques. As an example we describe the SARSA algorithm, and its use in a maze learning probl em. Finally, the corresponding C code is available for downloading. Contents : Reinforcement learning SARSA algorithm Maze learning Temporal credit assignment C code Reinforcement learning is a synonym of learning by interaction. During le
goinger 2009/09/04
rl

ai
リンク
1 2 次のページ

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx