Twitter

[https://twitter.com/jeremyphoward/status/955179789282828288] - 2020-11-06 01:27:39 - public:stevetao

AI, Artificial-Intelligence, Reinforcement-Learning - 3 | id:436747 -

Intuitive RL (Reinforcement Learning): An Introduction to Advantage-Actor-Critic (A2C)

[https://weekly-geekly.github.io/articles/442522/index.html] - 2019-07-31 12:08:18 - public:stevetao

A3C, AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning - 5 | id:265905 -

Beat Atari with Deep Reinforcement Learning! (Part 1: DQN)

[https://becominghuman.ai/lets-build-an-atari-ai-part-1-dqn-df57e8ff3b26] - 2019-03-28 16:09:20 - public:stevetao

AI, Artificial-Intelligence, Deep-Learning, DQN, Machine-Learning, Reinforcement-Learning - 6 | id:243986 -

GitHub - jaromiru/AI-blog: Accompanying repository for Let's make a DQN / A3C series.

[https://github.com/jaromiru/AI-blog] - 2019-02-21 03:28:37 - public:stevetao

A3C, AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning - 5 | id:243681 -

An intro to Advantage Actor Critic methods: let’s play Sonic the Hedgehog!

[https://medium.freecodecamp.org/an-intro-to-advantage-actor-critic-methods-lets-play-sonic-the-hedgehog-86d6240171d] - 2019-02-20 17:25:48 - public:stevetao

A3C, AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning - 5 | id:243677 -

Reinforcement Learning | IntechOpen

[https://www.intechopen.com/books/reinforcement_learning] - 2018-11-28 18:10:28 - public:stevetao

AI, Artificial-Intelligence, Book, Machine-Learning, Reinforcement-Learning - 5 | id:226416 -

Dissecting Reinforcement Learning-Part.1

[https://mpatacchiola.github.io/blog/2016/12/09/dissecting-reinforcement-learning.html] - 2018-10-18 20:24:04 - public:stevetao

AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning - 4 | id:186912 -

How to fix reinforcement learning

[https://thegradient.pub/how-to-fix-rl/] - 2018-10-18 15:33:51 - public:stevetao

AI, Artificial-Intelligence, Flaw, Machine-Learning, Reinforcement-Learning - 5 | id:186829 -

Reinforcement learning’s foundational flaw

[https://thegradient.pub/why-rl-is-flawed/] - 2018-10-18 15:13:34 - public:stevetao

AI, Artificial-Intelligence, Flaw, Machine-Learning, Reinforcement-Learning - 5 | id:186828 -

Beyond DQN/A3C: A Survey in Advanced Reinforcement Learning

[https://towardsdatascience.com/advanced-reinforcement-learning-6d769f529eb3] - 2018-10-16 14:07:43 - public:stevetao

A3C, AI, Artificial-Intelligence, DQN, Machine-Learning, Reinforcement-Learning, Survey - 7 | id:186785 -

GitHub - ShangtongZhang/reinforcement-learning-an-introduction: Python Implementation of Reinforcement Learning: An Introduction

[https://github.com/ShangtongZhang/reinforcement-learning-an-introduction] - 2018-10-12 00:27:23 - public:stevetao

AI, Artificial-Intelligence, Machine-Learning, Python, Reinforcement-Learning - 5 | id:186716 -

Design - Reinforcement Learning Coach Documentation

[https://coach.nervanasys.com/design/index.html#network-design] - 2018-09-26 18:48:17 - public:stevetao

AI, Artificial-Intelligence, Coach, Machine-Learning, Reinforcement-Learning - 5 | id:184489 -

OpenAI Baselines: ACKTR & A2C

[https://blog.openai.com/baselines-acktr-a2c/] - 2018-09-26 18:47:29 - public:stevetao

A2C, AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning - 5 | id:184488 -

Actor-Critic Methods: A3C and A2C

[https://danieltakeshi.github.io/2018/06/28/a2c-a3c/] - 2018-09-26 18:47:02 - public:stevetao

A2C, A3C, AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning - 6 | id:184487 -

Deep Reinforcement Learning with Online Generalized Advantage Estimation – Tom Breloff

[http://www.breloff.com/DeepRL-OnlineGAE/] - 2018-09-26 17:29:23 - public:stevetao

AI, Artificial-Intelligence, Deep-Learning, Machine-Learning, Reinforcement-Learning - 5 | id:184485 -

CS 294 Deep Reinforcement Learning, Fall 2017

[http://rail.eecs.berkeley.edu/deeprlcourse-fa17/] - 2018-09-26 13:46:09 - public:stevetao

AI, Artificial-Intelligence, Course, Deep-Learning, Machine-Learning, Reinforcement-Learning - 6 | id:184484 -

Reinforcement Learning w/ Keras+OpenAI: The Basics – Yash Patel – Medium

[https://medium.com/@yashpatel_86510/reinforcement-learning-w-keras-openai-698add10b4eb] - 2018-09-26 03:35:28 - public:stevetao

AI, Artificial-Intelligence, Keras, Machine-Learning, OpenAI, Reinforcement-Learning - 6 | id:184479 -

A Beginner's Guide to Deep Reinforcement Learning | Skymind

[https://skymind.ai/wiki/deep-reinforcement-learning] - 2018-09-24 20:55:26 - public:stevetao

AI, Artificial-Intelligence, Deep-Learning, Machine-Learning, Reinforcement-Learning - 5 | id:182896 -

GitHub - MorvanZhou/Reinforcement-learning-with-tensorflow: Simple Reinforcement learning tutorials

[https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow] - 2018-09-24 20:54:03 - public:stevetao

AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning, TensorFlow - 5 | id:182895 -

Intuitive RL: Intro to Advantage-Actor-Critic (A2C)

[https://hackernoon.com/intuitive-rl-intro-to-advantage-actor-critic-a2c-4ff545978752] - 2018-09-24 20:53:33 - public:stevetao

A2C, A3C, AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning - 6 | id:182894 -

Let’s make an A3C: Implementation | ヤロミル

[https://jaromiru.com/2017/03/26/lets-make-an-a3c-implementation/] - 2018-09-24 20:52:13 - public:stevetao

A3C, AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning - 5 | id:182893 -

Asynchronous Advantage Actor Critic (A3C) — Ray 0.5.2 documentation

[https://ray.readthedocs.io/en/latest/example-a3c.html] - 2018-09-24 20:50:16 - public:stevetao

A3C, AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning - 5 | id:182892 -

Deep Reinforcement Learning: Playing CartPole through Asynchronous Advantage Actor Critic (A3C)…

[https://medium.com/tensorflow/deep-reinforcement-learning-playing-cartpole-through-asynchronous-advantage-actor-critic-a3c-7eab2eea5296] - 2018-09-24 20:48:40 - public:stevetao

A3C, AI, Artificial-Intelligence, Deep-Learning, Machine-Learning, Reinforcement-Learning - 6 | id:182891 -

Reinforcement learning with the A3C algorithm

[https://cgnicholls.github.io/reinforcement-learning/2017/03/27/a3c.html] - 2018-09-24 20:48:10 - public:stevetao

A3C, AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning - 5 | id:182890 -

Using Asynchronous Method For Deep Reinforcement Learning

[https://www.analyticsindiamag.com/using-asynchronous-method-for-deep-reinforcement-learning/] - 2018-09-24 20:15:48 - public:stevetao

A3C, AI, Artificial-Intelligence, Deep-Learning, Machine-Learning, Reinforcement-Learning - 6 | id:182889 -

Reinforcement Learning Course by David Silver

[http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html] - 2018-09-20 18:51:31 - public:stevetao

AI, Artificial-Intelligence, Course, Machine-Learning, Reinforcement-Learning - 5 | id:182810 -

Reinforcement Learning: An Introduction

[http://incompleteideas.net/book/ebook/the-book.html] - 2018-08-21 20:03:31 - public:stevetao

AI, Artificial-Intelligence, Book, Free, Machine-Learning, Reinforcement-Learning - 6 | id:178023 -

Reinforcement Learning: An Introduction

[http://incompleteideas.net/book/ebook/] - 2018-08-21 19:56:54 - public:stevetao

AI, Artificial-Intelligence, Book, Machine-Learning, Reinforcement-Learning, Textbook - 6 | id:178283 -

Simple Reinforcement Learning with Tensorflow Part 0: Q-Learning with Tables and Neural Networks

[https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-0-q-learning-with-tables-and-neural-networks-d195264329d0] - 2018-08-21 19:54:49 - public:stevetao

A3C, AI, Artificial-Intelligence, Machine-Learning, Neural-Network, Reinforcement-Learning, TensorFlow - 7 | id:178290 -

GitHub - MorvanZhou/pytorch-A3C: Simple A3C implementation with pytorch + multiprocessing

[https://github.com/MorvanZhou/pytorch-A3C] - 2018-08-21 19:20:56 - public:stevetao

A3C, AI, Artificial-Intelligence, Machine-Learning, Python, Reinforcement-Learning - 6 | id:178289 -

Simple Reinforcement Learning with Tensorflow Part 8: Asynchronous Actor-Critic Agents (A3C)

[https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-8-asynchronous-actor-critic-agents-a3c-c88f72a5e9f2] - 2018-08-21 13:19:22 - public:stevetao

A3C, AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning, TensorFlow - 6 | id:178288 -

asynchronous advantage actor critic (a3c) - Google Search

[https://www.google.com/search?q=asynchronous+advantage+actor+critic+(a3c)&rlz=1C1GGRV_enUS801US801&oq=asynchronous+advantage+actorcritic&aqs=chrome.2.69i57j0l5.3095j0j8&sourceid=chrome&ie=UTF-8] - 2018-08-17 20:18:04 - public:stevetao

A3C, AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning - 5 | id:178287 -

Asynchronous Agent Actor Critic (A3C) – hdmetor's blog

[https://hdmetor.github.io/a3c-explained/] - 2018-08-17 20:17:54 - public:stevetao

A3C, AI, Artificial-Intelligence, Machine-Learning, Reinforcement-Learning - 5 | id:178286 -

yabs.io

Yet Another Bookmarks Service

Viewing stevetao's Bookmarks

Jeremy Howard on Twitter: “What's the best way to learn about advantage-actor-critic (A2C) reinforcement learning? A comic of course! Talented @fastdotai student @rgilman33 has you covered: https://t.co/N4uPs6OeqQ https://t.co/ZWhe0PxArV“ / Twitter

Intuitive RL (Reinforcement Learning): An Introduction to Advantage-Actor-Critic (A2C)

Beat Atari with Deep Reinforcement Learning! (Part 1: DQN)

GitHub - jaromiru/AI-blog: Accompanying repository for Let's make a DQN / A3C series.

An intro to Advantage Actor Critic methods: let’s play Sonic the Hedgehog!

Reinforcement Learning | IntechOpen

Dissecting Reinforcement Learning-Part.1

How to fix reinforcement learning

Reinforcement learning’s foundational flaw

Beyond DQN/A3C: A Survey in Advanced Reinforcement Learning

GitHub - ShangtongZhang/reinforcement-learning-an-introduction: Python Implementation of Reinforcement Learning: An Introduction

Design - Reinforcement Learning Coach Documentation

OpenAI Baselines: ACKTR & A2C

Actor-Critic Methods: A3C and A2C

Deep Reinforcement Learning with Online Generalized Advantage Estimation – Tom Breloff

CS 294 Deep Reinforcement Learning, Fall 2017

Reinforcement Learning w/ Keras+OpenAI: The Basics – Yash Patel – Medium

A Beginner's Guide to Deep Reinforcement Learning | Skymind

GitHub - MorvanZhou/Reinforcement-learning-with-tensorflow: Simple Reinforcement learning tutorials

Intuitive RL: Intro to Advantage-Actor-Critic (A2C)

Let’s make an A3C: Implementation | ヤロミル

Asynchronous Advantage Actor Critic (A3C) — Ray 0.5.2 documentation

Deep Reinforcement Learning: Playing CartPole through Asynchronous Advantage Actor Critic (A3C)…

Reinforcement learning with the A3C algorithm

Using Asynchronous Method For Deep Reinforcement Learning

Reinforcement Learning Course by David Silver

Reinforcement Learning: An Introduction

Reinforcement Learning: An Introduction

Simple Reinforcement Learning with Tensorflow Part 0: Q-Learning with Tables and Neural Networks

GitHub - MorvanZhou/pytorch-A3C: Simple A3C implementation with pytorch + multiprocessing

Simple Reinforcement Learning with Tensorflow Part 8: Asynchronous Actor-Critic Agents (A3C)

asynchronous advantage actor critic (a3c) - Google Search

Asynchronous Agent Actor Critic (A3C) – hdmetor's blog

Viewing 1 - 33, 33 links out of 33 links, page: 1

Follow Tags

Export: