漫谈RNN之梯度消失及梯度爆炸:http://bbs.imefuture.com/article/4405
漫谈RNN之长短期记忆模型LSTM:http://bbs.imefuture.com/article/4406
漫谈RNN之长短期记忆模型LSTM(续):http://bbs.imefuture.com/article/4407
attention:https://zhuanlan.zhihu.com/p/47282410
Transformer : https://jalammar.github.io/illustrated-transformer/