Deep RL Bootcamp Lecture 9 Model-based Reinforcement - 走看看

zoukankan html css js c++ java

Deep RL Bootcamp Lecture 9 Model-based Reinforcement

So, the process is similar to one-to-many RNN?

learn much more efficiently than model-free method

iteratively get better

less than 300 trials ~ 25min robot time per task

visual prediction from the observation

during train of model, there is no reward. Some random motions are programmed. at the task time, there is a reward function, basically trying to move a pixel to the goal position.

查看全文

相关阅读:
Jquery基于ActiveX的批量上传
 枚举类型在as3中的实现
 delphi操作word基类,插入表格,分页符,日期,页码,替换,图片
 消除文本锯齿
 As3显示对象scrollRect滚动问题
 Bitmap序列化
 加载图片的方法
 球体旋转坐标推导
 AS3基于TextField实现图文并排更新于2015.08.05
Flash与外部程序之间的通信

原文地址：https://www.cnblogs.com/ecoflex/p/8983080.html

Copyright © 2011-2022 走看看