1. 强化学习——从Q-Learning到DQN到底发生了什么?
https://blog.csdn.net/qq_41352018/article/details/80274282?utm_medium=distribute.pc_relevant.none-task-blog-BlogCommendFromBaidu-2.not_use_machine_learn_pai&depth_1-utm_source=distribute.pc_relevant.none-task-blog-BlogCommendFromBaidu-2.not_use_machine_learn_pai
2. 深度强化学习-Policy Gradient基本实现
https://www.jianshu.com/p/2ccbab48414b
3. 强化学习 Reinforcement Learning (莫烦 Python 教程)
https://www.bilibili.com/video/BV13W411Y75P?p=1