A Regularized Competition Model for Question Diffi culty Estimation in Community Question Answering Services-20160520

zoukankan html css js c++ java

A Regularized Competition Model for Question Diffi culty Estimation in Community Question Answering Services-20160520
1、Information

publication：EMNLP 2014

author:Jing Liu(在前一篇sigir基础上，拓展模型的论文)

2、What

衡量CQA中问题的困难程度,提出从两个方向建模

1)利用Competition的比较：Competition Model
q = {ua ≺q , q ≺ub , ua ≺ub , uo1 ≺ub , · · · , uoM ≺ub } ,

2) question Text Similarities for QDE，相似程度的问题具有相似的描述。（冷启动问题）

3、Dataset

Stack Overflow:

是一个与程序相关的IT技术问答网站。

数据下载地址：

http://www.ics.uci.edu/~duboisc/stackoverflow/
- qid: Unique question id
- i: User id of questioner
- qs: Score of the question
- qt: Time of the question (in epoch time)
- tags: a comma-separated list of the tags associated with the question. Examples of tags are ``html'', ``R'', ``mysql'', ``python'', and so on; often between two and six tags are used on each question.
- qvc: Number of views of this question (at the time of the datadump)
- qac: Number of answers for this question (at the time of the datadump)
- aid: Unique answer id
- j: User id of answerer
- as: Score of the answer
- at: Time of the answer
4、How

input: question user Competition,question-question的Competition，similarity.

output: pair compare result.

method：RCM

5、Evaluation:accuracy:ACC =# correctly judged question pairs/# all question pairs

baseline:pagerank,TS,CM

6、additional analysis

1)不同方式计算text similarity

2）estimate difficult sorce of cold start problem:KNN

3) 不同difficult level的text words 举例

7、conclusion
查看全文

相关阅读:
linux 端口被占用
 vue项目刷新当前页面
 SQL关于删除的三个语句：DROP、TRUNCATE、 DELETE 的区别
 mybatis模糊查询去除特殊符号%(百分号)和_(下划线)
SpringMVC 五种注解参数绑定
 导出数据到Excel--多sheet
POI 导出工具实例
 Java 数组转换成字符串添加逗号类似 js array的join
SpringBoot常用注解总结
 Java类的主动使用和被动使用-面试题

原文地址：https://www.cnblogs.com/baiting/p/5511738.html