Noise Contrastive Estimation - 走看看

zoukankan html css js c++ java

Noise Contrastive Estimation

Notes from Notes on Noise Contrastive Estimation and Negative Sampling
one sample:

[x_i o [y_i^0,cdots,y_{i}^{k}] ]
where (y_i^0) are true labeled words , and (y_i^1,cdots,y_i^{k}) are noise samples word index, which is generated by unigram distribution (q(w)) of the dataset.
the probability of true data:

[p(y_i^0=1|x_i, heta)=frac{exp(y_i^0,h_ heta)}{exp(y_i^0 h_ heta) + k*q(y_i^0)} ]
the noise sample probability:

[p(y_i^t=0|x_i, heta)=frac{k*q(y_i^t)}{exp(y_i^t h_ heta) + k*q(y_i^t)},t=1,cdots,k ]
the cost function of this sample:

[l_{nce}=log p(y_i^0|x_i, heta)+sum_{t=1}^k{log p(y_i^t|x_i, heta)} ]
the overall cost function of the dataset:

[mathcal{L}_{nce}=frac{1}{N}sum_i^N{left{log p(y_i^0|x_i, heta)+sum_{t=1}^k{log p(y_i^t|x_i, heta)} ight}} ]
Related Paper

[Noise-Contrastive Estimation of Unnormalized Statistical Models with Applications to Natural Image Statistics]

[Word2vec Parameter Learning Explained]

[Efficient Estimation of Word Representation in Vector Space]

[Distributed Representations of Words and Phrases and their Compositionality]

[Notes on Noise Contrastive Estimation and Negative Sampling]

查看全文

相关阅读:
CF575A Fibonotci [线段树+矩阵快速幂]
P3768 简单的数学题 [杜教筛，莫比乌斯反演]
2-SAT 学习笔记
 CF776D The Door Problem [2sat]
KD-Tree 学习笔记
 Mybatis入门笔记(2)——基于代理Dao实现CRUD
Mybatis入门笔记(1)——基于原始dao实现CRUD
mybatis入门看这一篇就够了
 使用JDBC程序的问题总结
 关于递归你知道多少？

原文地址：https://www.cnblogs.com/ZJUT-jiangnan/p/5934647.html

Copyright © 2011-2022 走看看