Proj KaggleNLU Paper Reading: Unsupervised Cross-lingual Representation Learning at Scale - 走看看

zoukankan html css js c++ java

Proj KaggleNLU Paper Reading: Unsupervised Cross-lingual Representation Learning at Scale
Github

(https://github.com/facebookresearch/fairseq, https://github.com/facebookresearch/pytext)
https://github.com/facebookresearch/XLM.git

Abstract

本文任务：训练大规模跨语言面向多种自然语言Transfer任务的预训练方法
方法：用超过2TB的CommonCrawl数据训练了Transformer-based masked model
效果：
1. 比mBERT在很多benchmarks上更好
2. +14.6% average accu-racy on XNLI
3. +13% average F1 score on MLQA
4. +2.4% F1 score on NER
5. 在（训练）资源更少的语言上表现更好
6. improving 15.7% in XNLI accuracy for Swahili over previous XLM models
7. 11.4% for Urdu over previous XLM models.
8. 对关键参数进行了详细分析
9. positive transfer 和 capacity dilution之间的权衡
10. high/ low resource languages
11. XLM-R能够处理多种语言而比单语言模型更好
查看全文

相关阅读:
对GDI+绘制圆弧接口的理解
 陈灯可重用代码管理器（插件版最新版本：3.2；桌面版最新版本：2.3）
Apache OpenJPA 2.1.0 发布
 B3log Solo 0.2.5.1 发布了！
Apache OpenJPA 2.1.0 发布
 jsoup 1.5.1 发布，超棒的HTML解析器
 程序员阿士顿的故事
 Web 是开源最大的成功
 Web 是开源最大的成功
 Python执行系统命令的方法 os.system()，os.popen()，commands renwofei423的个人空间开源中国社区

原文地址：https://www.cnblogs.com/xuesu/p/15406439.html

Copyright © 2011-2022 走看看