24小时热门版块排行榜    

查看: 3609  |  回复: 19
当前只显示满足指定条件的回帖,点击这里查看本话题的所有回帖

kele1982

金虫 (正式写手)

[交流] 【求助】QSAR模型中交叉验证系数(q2)怎么获得? 已有2人参与

请问我用逐步回归分析方法得到一个QSAR模型,但是结果里面没有交叉验证系数q2(英文叫: leave-one-out),请问怎么计算得到啊?谢谢
回复此楼
踏上科研不归路!
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

niliu

铁杆木虫 (著名写手)

★ ★ ★ ★ ★
yuhuobuku(金币+5,VIP+0):欢迎参加讨论 4-8 09:36
Consensus QSAR models: Do the benefits outweigh the complexity?
  
Author(s): Hewitt M (Hewitt, Mark), Cronin MTD (Cronin, Mark T. D.), Madden JC (Madden, Judith C.), Rowe PH (Rowe, Philip H.), Johnson C (Johnson, Clara), Obi A (Obi, Anrdrea), Enoch SJ (Enoch, Steven J.)

Source: JOURNAL OF CHEMICAL INFORMATION AND MODELING    Volume: 47    Issue: 4    Pages: 1460-1468    DOI: 10.1021/ci700016d    Published: JUL-AUG 2007   

Abstract: This study has assessed the use of consensus regression, as compared to single multiple linear regression, models for the development of quantitative structure-activity relationships (QSARs). To provide a comparison, four data sets of varying size and complexity were analyzed: silastic membrane flux, toxicity of phenols to Tetrahymena pyriformis, acute toxicity to the fathead minnow and flash point. For each data set, a genetic algorithm was used to develop a model population and the performance of consensus models was compared to that of the best single model. Two consensus models were developed, one using the top 10 models, and the other using a subset of models chosen to provide maximal coverage of model space. The results highlight the ability of the genetic algorithm to develop predictive models from a large descriptor pool. However, the consensus models were shown to offer no significant improvements over single regression models, which are as statistically robust as the equivalent consensus models. Consensus models developed from a selection of the best QSARs were shown not to be superior to a selection of diverse in "model space" QSARs. For the data sets analyzed in this study, and in light of the Organization for Economic Cooperation and Development principles for the validation of QSARs, the increase in model complexity when using consensus models does not seem warranted given the minimal improvement in model statistics.
18楼2009-04-01 08:00:21
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
查看全部 20 个回答

snoopyzhao

至尊木虫 (职业作家)


yyx19840628(金币+1,VIP+0):谢谢 2-11 10:44
根据 leave-one-out 的算法自己编程序算吧,如果你现在的统计程序不提供的话
2楼2009-02-11 08:43:59
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

yalefield

金虫 (文坛精英)

老汉一枚

★ ★ ★ ★ ★
yyx19840628(金币+2,VIP+0):谢谢 2-11 10:44
kele1982(金币+3,VIP+0):谢谢! 2-15 10:29
请给出一些细节.
如,用的什么软件?
还是自己编写程序?

训练集和测试集是怎么划分的?

Leave-one-out(LOO), 叫做留一法(当然,还有留N法)
训练集和测试集都要用到留一法。
3楼2009-02-11 10:14:07
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

snoopyzhao

至尊木虫 (职业作家)

引用回帖:
Originally posted by yalefield at 2009-2-11 10:14:
训练集和测试集是怎么划分的?

训练集和测试集都要用到留一法。

跟贴请教老汉两个问题:

1)通常训练集与测试集应该如何划分?

2)测试集如何用到留一法?我只知道对训练集使用留一法。

谢谢指教!
4楼2009-02-11 11:08:07
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
普通表情 高级回复 (可上传附件)
信息提示
请填处理意见