24小时热门版块排行榜    

Znn3bq.jpeg
北京石油化工学院2026年研究生招生接收调剂公告
查看: 1266  |  回复: 3
当前主题已经存档。

alwens

铁杆木虫 (正式写手)

老木虫

[交流] 文章修改意见(补做Y-randomization)

最近投稿到J. Comput.-Aided Mol. Des.的一篇文章,审稿人对拟和模型的验证提出一个问题,就是希望增加一个Y-randomization test,以增加说服力。以前发的文章一般是对模型做一个内部数据集的交叉验证,再做个对外部测试集验证就可以了,Y-randomization还真没做过。查了一下常用的统计学软件如origin和SPSS,好像也没找到相关的模块或功能。因为原理很简单,就动手写了个R的脚本,现在把编辑问的问题,我的回答,以及这个脚本拿来和大家分享,希望对各位同行有参考价值。

问题:I advise the authors to perform additional validation for the models developed. For example, conduct the Y-randomization test (scramble stability test). Eighteen compounds in the test set is a low number for a training set of 108 molecules.

回答:Thanks for the reviewer’s advice. We have included the Y-randomization test results in Section 3.4, Results and Discussion.
Except for the concern of the generalizability, the high internal validation performance of our xxxx models might be a result of chance correlation. To address this problem, these three models were validated by applying the Y-randomization of response test (in this work, the experimental activity values). It consists of repeating the calculation procedure several times after shuffling the Y vector randomly. If all models obtained by the Y-randomization test have relatively high values for both q2 and r2 statistics, this is due to a chance correlation and implies that the current modeling method cannot lead to an acceptable model using the available data set. This was not the case for the data set and methodology used in this work. Ten random shuffles of the Y vector were performed and the results are shown in Table 4. The low q2 and r2 values show that the good results in our original models are not due to a chance correlation or structural dependency of the training set.
Table 4.        Y-Randomization results of the three models.
Iteration        Model1        Model2        Model3
        r2        q2        r2        q2        r2        q2
1        0.06087        0.02613        0.03359        0.00700        0.00004        0.27470
2        0.00050        0.06217        0.00011        0.05587        0.00675        0.01233
3        0.00287        0.07728        0.01106        0.00452        0.00673        0.01450
4        0.00309        0.06925        0.01152        0.00495        0.03866        0.00979
5        0.02495        0.00113        0.00021        0.37390        0.00426        0.04193
6        0.00080        0.27630        0.00003        0.43000        0.00200        0.15350
7        0.00424        0.04414        0.00728        0.01734        0.03321        0.00651
8        0.02441        0.00040        0.01375        0.00040        0.02045        0.00008
9        0.00199        0.09985        0.01244        0.00248        0.03296        0.00430
10        0.00795        0.01594        0.00122        0.2014        0.01232        0.00541
脚本:
d<-read.table('数据文件1',header=TRUE)
vp<-1:nrow(d)
TIMES=100
for ( j in 1:TIMES){
x <- d$模型1
y <- sample(d$试验值)
print(summary(lm(y~x)))
for ( i in 1:nrow(d)){
x1 <- x
x0 <- x[-i]
y1 <- y
y0 <- y[-i]
yp<-predict(lm(y0~x0),data.frame(x0 = x), se.fit = TRUE)
vp=yp$fit
}
print(summary(lm(y~vp)))
}
注:“数据文件1”为tab分隔的,行标为试验值,列标为模型预测值的文本文件。

[ Last edited by alwens on 2006-9-6 at 15:42 ]
回复此楼

» 收录本帖的淘帖专辑推荐

学习工作等

» 猜你喜欢

万里夕阳锦背高 翻身犹恨东洋小 太公怎钓?
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

goldjay

至尊木虫 (知名作家)

龙虫

0.5

写得不错!
路漫漫其修远兮,吾将上下而求索
2楼2006-09-06 16:17:30
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

1

3楼2006-09-06 18:02:57
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

1

4楼2006-09-07 10:51:06
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
相关版块跳转 我要订阅楼主 alwens 的主题更新
普通表情 高级回复 (可上传附件)
最具人气热帖推荐 [查看全部] 作者 回/看 最后发表
[考研] 338求调剂 +6 wxygxsaaaaa 2026-04-06 6/300 2026-04-08 01:14 by BruceLiu320
[考研] 求考研材料调剂 +3 材化李可 2026-04-07 3/150 2026-04-08 00:21 by JourneyLucky
[考研] 调剂 +18 不逢春 2026-04-05 19/950 2026-04-07 22:04 by lijunpoly
[考研] 071000生物学,一志愿深圳大学296分,求调剂 +12 TIckLw 2026-04-06 13/650 2026-04-07 20:34 by lijunpoly
[考研] 22408 318分求调剂 +4 勤奋的小笼包 2026-04-06 6/300 2026-04-07 15:05 by 纸鹤555
[考研] 305分求调剂 +3 哈_哈_哈_哈_哈 2026-04-04 5/250 2026-04-07 14:49 by 哈_哈_哈_哈_哈
[考研] 080500求调剂 +12 黄宇博 2026-04-06 12/600 2026-04-07 12:41 by upczlm1989
[考研] 一志愿上海海洋大学083200食品学硕,求调剂,接受其他专业 +9 what张 2026-04-01 11/550 2026-04-07 09:45 by momo皓
[考研] 085100建筑学 寻求跨专业调剂 一志愿南大294分 校级省级国家级奖项若干 踏实肯干 +3 1021075758 2026-04-06 4/200 2026-04-07 09:23 by 蓝云思雨
[考研] 346分的生物与医药08600求调剂 +6 常雨阳上岸 2026-04-05 7/350 2026-04-06 12:36 by lys0704
[考研] 315求调剂 +14 欣喜777 2026-04-04 15/750 2026-04-06 06:25 by houyaoxu
[考研] 358求调剂 +7 秋gk 2026-04-04 7/350 2026-04-05 13:29 by huangmoli
[考研] 一志愿江南大学085501机械工程专硕326分,本科佳木斯大学 +5 顾若浮生 2026-04-03 9/450 2026-04-05 09:57 by 1753564080
[考研] +5 化工专硕323分 2026-04-04 5/250 2026-04-05 08:02 by 544594351
[考研] 调剂 +11 JLLLLLLLLLL 2026-04-03 11/550 2026-04-04 22:21 by hemengdong
[考研] 278求调剂 +14 范婷娜 2026-04-04 15/750 2026-04-04 22:15 by lqwchd
[考研] 怎么删帖子啊 +3 缝曦1000 2026-04-04 3/150 2026-04-04 14:20 by 土木硕士招生
[考研] 0710生物学336分求调剂 +6 kiyy 2026-04-01 8/400 2026-04-04 10:10 by kiyy
[考研] 330求调剂 +3 白神呜呼呼 2026-04-02 3/150 2026-04-03 10:15 by 蓝云思雨
[考研] 325分化学调剂 +5 15771691647 2026-04-02 5/250 2026-04-03 09:58 by ChemPharm
信息提示
请填处理意见