| 查看: 2837 | 回复: 30 | ||
| 【悬赏金币】回答本帖问题,作者空空如也0将赠送您 50 个金币 | ||
| 当前只显示满足指定条件的回帖,点击这里查看本话题的所有回帖 | ||
[求助]
给了一个月的修订时间,我10天就改好了。我要不要等到一个月后再投?已有6人参与
|
||
|
审稿意见: Reviewer #1: I understand that AL-based approach needs both "initial training set" and "the newly added samples" in order to select most representative samples. However, there are two questions. First, what do you mean representative samples? Does it mean a set of samples which covers both valid and invalid links? Second, it seems that in the Algorithm 1, active learning relies on the termination condition to ensure the representativeness of the samples. If I am correct, the termination condition used in the paper (i.e., the number of labeled samples reaches a preset value) does not make any sense. A better termination condition could be the labeled samples shall contains at least 1 valid link and 1 invalid link. Reviewer #2: Tracking the relation between artifacts in software project is important. Generally, it is human intensive task to construct the traceability links. Traditional information retrieve techniques has been employed to automatic analyze and recover traceability links. Even machine learning approaches as adopted to train an effective predictive model for traceability link recovery. It requires humans to label traceability links. This paper presents a TLR approach based on active learning. Evaluation experiments were conducted on seven commonly used traceability datasets. It was compared with an IR-based approach and a current machine learning approach. The experiment shows that AL-based approach outperforms the other two approaches in terms of F-score. Concerns 1、 Page1, "(hereafter called AL-based approach)" is repeated in the abstract and introduction. 2、 Page 1, section 1, left column, last line, "traceability" means "traceability relationships" or "traceability links"? 3、 Page 2, left column, "TSL-based approach is that how to select traceability links for labeling to generate traceability information."=》that 4、 Page 3, Section 3, Step1: (1)" randomly selecting a small number of samples for labeling to initialize Dt", =>"randomly labeling a small number of samples to initialize Dt " 5、 Page 3, Section 3, Step1: (3) "selecting an unlabeled sample from the unlabeled sample set based on sample selection strategy and requesting experts to label the sample"=> The authors need to define the D and Dl here. Regarding to the context, the Dt and Dl seem equivalent, why use different symbol? 6、 Page 3, Section 3, Step 4, This paper chose Random Forest as the classification algorithm. However, the authors only claimed that "The reason for choosing Random Forest is because it has been shown to be accurate and robust". It would be better to explain why random forest is more suitable for the task. 7、 Page 4, Algorithm1 needs to be reconstructed. It would be much better to define input and output, as well as all the variables used in the algorithm. 8、 Section 4. It would be much better to move Experimental metric at the beginning of section 4. Authors use the F-score before its definition. 9、 The format of refences should be standardized, especially, the names and abbreviations of journals and conferences. |
» 猜你喜欢
投稿精细化工
已经有6人回复
博士读完未来一定会好吗
已经有36人回复
之前让一硕士生水了7个发明专利,现在这7个获批发明专利的维护费可从哪儿支出哈?
已经有10人回复
博士申请都是内定的吗?
已经有9人回复
心脉受损
已经有8人回复
读博
已经有5人回复
yunsu
银虫 (正式写手)
- 应助: 6 (幼儿园)
- 金币: 1159.8
- 散金: 380
- 红花: 7
- 帖子: 578
- 在线: 94.5小时
- 虫号: 1045776
- 注册: 2010-06-22
- 性别: GG
- 专业: 土壤学
26楼2019-07-31 15:59:34
2楼2019-07-30 17:29:30
国际科学编辑
铁杆木虫 (知名作家)
- 应助: 1532 (讲师)
- 金币: 3627.6
- 散金: 612
- 红花: 275
- 沙发: 3
- 帖子: 7972
- 在线: 146.9小时
- 虫号: 4407167
- 注册: 2016-02-16
- 性别: GG
- 专业: 病原微生物变异与耐药
3楼2019-07-30 17:31:43
jokage
新虫 (著名写手)
- 应助: 1 (幼儿园)
- 金币: 1167.2
- 散金: 4647
- 红花: 22
- 帖子: 2043
- 在线: 437.1小时
- 虫号: 2325646
- 注册: 2013-03-07
- 性别: GG
- 专业: 天文技术和方法
4楼2019-07-30 17:34:01













回复此楼