11月深度学习班第9课强化学习与DQN - 走看看

zoukankan html css js c++ java

11月深度学习班第9课强化学习与DQN

强化学习与DQN

强化学习成就

 Learned the world’s best player of Backgammon (Tesauro 1995)
 Learned acrobatic helicopter autopilots (Ng, Abbeel, Coates et al
2006+)
 Widely used in the placement and selection of advertisements on
the web (e.g. A-B tests)
 Used to make strategic decisions in Jeopardy! (IBM’s Watson
2011)
 Achieved human-level performance on Atari games from pixel
-level visual input, in conjunction with deep learning (Google
Deepmind 2015)
 In all these cases, performance was better than could be obtained by
any other method, and was obtained without human instruction

Life is short, but I have a cat.

查看全文

相关阅读:
大二下每周总结
 大二下学期之阅读笔记(人月神话）
大二下学期之阅读笔记（人月神话）
大二下学期第一次结对作业（第一阶段：地图下钻）
大二下学期第一次结对作业（第一阶段）
java 多线程编程之: synchronized
书籍
 elasticsearch size 设置最大返回条数
 Java 设计模式--策略模式，枚举+工厂方法实现
 Elasticsearch rollover API

原文地址：https://www.cnblogs.com/koocn/p/7757710.html

Copyright © 2011-2022 走看看