24小时热门版块排行榜    

查看: 402  |  回复: 1
本帖产生 1 个 翻译EPI ,点击这里进行查看
当前只显示满足指定条件的回帖,点击这里查看本话题的所有回帖

起沃尔特与

木虫 (小有名气)

【答案】应助回帖

★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★ ★
yuxintian: 金币+50, 翻译EPI+1 2013-03-21 14:37:36
At present, most of the supervision and labeling methods  can achieve good effect in the large-scale  corpus environment , but in a real world application,
tagging corpus resources is not only difficult to obtain, hard also to be versatile. In this article, we present a prototype model extension algorithma based on A-method:
First of all, using the original small-scale  training data conducts integration annotators with a certain accuracy rate.
Secondly,  useing the A-algorithm expands the training data automatically.  To predict the candidate example among untagged data, then the
numerical data which is greater than a certain thresholdto should join in a training set .
Finally, in line with the constraints existed in training data cutting the noise for clips. And using the training
data after extension to afresh the training classifier iteratively, until approaching the final stable iteration.
远见胜于经验。
2楼2013-03-21 08:52:35
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
相关版块跳转 我要订阅楼主 yuxintian 的主题更新
信息提示
请填处理意见