持续更新 2020-05-28
Transformer详解
论文阅读 | Lite Transformer with Long-Short Range Attention
参考:
ICLR 2020趋势分析:NLP中更好&更快的Transformer