https://github.com/sweetice/Deep-reinforcement-learning-with-pytorch/blob/master/Char4%20A2C/A2C.py
另外这个里面有a2c,a3c的区别的示意图
https://github.com/MG2033/A2C
http://www.dataguru.cn/article-14078-1.html