24小时热门版块排行榜    

CyRhmU.jpeg
南方科技大学公共卫生及应急管理学院2026级博士研究生招生报考通知(长期有效)
查看: 347  |  回复: 2
当前主题已经存档。
【有奖交流】积极回复本帖子,参与交流,就有机会分得作者 cxn253800 的 15 个金币

cxn253800

木虫 (著名写手)

[交流] 【求助】并行计算出错

我在用vasp并行计算的时候出现了如下的一些问题,请教各位高手这大概是由于哪方面原因造成的,应该怎样解决,另外能不能做到在其基础上继续计算,不胜感激!

[xlxy50123k@cn54 p+0.2_11_4]$ cat 2617.cn42/log.out
running on   20 nodes
distr:  one band on    1 nodes,   20 groups
vasp.4.6.25  17Sep03 complex
POSCAR found :  2 types and   72 ions
WARNING: for PREC=h ENMAX is automatically increase by 25 %
        this was not the case for versions prior to vasp.4.4
WARNING: for PREC=h ENMAX is automatically increase by 25 %
        this was not the case for versions prior to vasp.4.4
LDA part: xc-table for Ceperly-Alder, standard interpolation
POSCAR, INCAR and KPOINTS ok, starting setup
FFT: planning ...           1
reading WAVECAR
entering main loop
       N       E                     dE             d eps       ncg     rms          rms(c)
DAV:   1     0.915581049949E+03    0.91558E+03   -0.57441E+04102400   0.575E+02
DAV:   2    -0.316854107449E+03   -0.12324E+04   -0.11908E+04129640   0.204E+02
DAV:   3    -0.488596428903E+03   -0.17174E+03   -0.16638E+03168780   0.636E+01
DAV:   4    -0.497571559058E+03   -0.89751E+01   -0.87995E+01153340   0.162E+01
DAV:   5    -0.498020261711E+03   -0.44870E+00   -0.44446E+00162740   0.358E+00    0.211E+01
DAV:   6    -0.476717736694E+03    0.21303E+02   -0.77787E+01154360   0.144E+01    0.803E+00
DAV:   7    -0.473883612020E+03    0.28341E+01   -0.61735E+00138100   0.461E+00    0.363E+00
DAV:   8    -0.473340642132E+03    0.54297E+00   -0.27394E+00150440   0.300E+00    0.150E+00
DAV:   9    -0.473265671114E+03    0.74971E-01   -0.81023E-01154800   0.180E+00    0.789E-01
DAV:  10    -0.473253888489E+03    0.11783E-01   -0.20425E-01156460   0.848E-01    0.557E-01
p11_32455:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_11_32462: (122433.809734) net_send: could not write to fd=5, errno = 32
p12_30964:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_12_30971: (122433.762920) net_send: could not write to fd=5, errno = 32
p13_10157:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_13_10166: (122433.723040) net_send: could not write to fd=5, errno = 32
p15_6626: (122433.526654) net_recv failed for fd = 24
p15_6626:  p4_error: net_recv read, errno = : 104
rm_l_15_6633: (122433.580176) net_send: could not write to fd=5, errno = 32
p16_26554:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_16_26561: (122433.402299) net_send: could not write to fd=5, errno = 32
p17_32596: (122432.820773) net_recv failed for fd = 4
p17_32596:  p4_error: net_recv read, errno = : 104
rm_l_17_32603: (122432.848173) net_send: could not write to fd=5, errno = 32
p18_7839:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_18_7858: (122432.691534) net_send: could not write to fd=5, errno = 32
p14_31298:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_14_31305: (122433.730857) net_send: could not write to fd=5, errno = 32
p0_21878:  p4_error: net_recv read:  probable EOF on socket: 1
p10_31272: (122433.841958) net_recv failed for fd = 6
p10_31272:  p4_error: net_recv read, errno = : 104
rm_l_10_31279: (122433.842219) net_send: could not write to fd=5, errno = 32
p7_9173:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_7_9180: (122433.988595) net_send: could not write to fd=5, errno = 32
p2_31183:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_2_31190: (122434.240183) net_send: could not write to fd=5, errno = 32
p9_32164: (122433.879182) net_recv failed for fd = 6
p9_32164:  p4_error: net_recv read, errno = : 104
rm_l_9_32171: (122433.886948) net_send: could not write to fd=5, errno = 32
p6_31185:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_6_31192: (122434.033577) net_send: could not write to fd=5, errno = 32
p5_8406:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_5_8413: (122434.035358) net_send: could not write to fd=5, errno = 32
p19_13370: (122432.247734) net_recv failed for fd = 22
p19_13370:  p4_error: net_recv read, errno = : 104
rm_l_19_13377: (122432.270883) net_send: could not write to fd=5, errno = 32
p1_21882:  p4_error: net_recv read:  probable EOF on socket: 1
rm_l_1_21889: (122434.560077) net_send: could not write to fd=5, errno = 32
forrtl: error (69): process interrupted (SIGINT)
p10_31272: (122455.951022) net_send: could not write to fd=5, errno = 32
p13_10157: (122465.874662) net_send: could not write to fd=5, errno = 32
p18_7839: (122464.843650) net_send: could not write to fd=5, errno = 32
p11_32455: (122465.962520) net_send: could not write to fd=5, errno = 32
p16_26554: (122465.555746) net_send: could not write to fd=5, errno = 32
p5_8406: (122466.189766) net_send: could not write to fd=5, errno = 32
p12_30964: (122465.917570) net_send: could not write to fd=5, errno = 32
p17_32596: (122464.974103) net_send: could not write to fd=5, errno = 32
p9_32164: (122466.037930) net_send: could not write to fd=5, errno = 32
p19_13370: (122464.400454) net_send: could not write to fd=5, errno = 32
p14_31298: (122465.852539) net_send: could not write to fd=5, errno = 32
p15_6626: (122465.688378) net_send: could not write to fd=5, errno = 32
p7_9173: (122466.141305) net_send: could not write to fd=5, errno = 32
p6_31185: (122466.195071) net_send: could not write to fd=5, errno = 32
p2_31183: (122468.379319) net_send: could not write to fd=5, errno = 32
p1_21882: (122470.560027) net_send: could not write to fd=5, errno = 32
p0_21878: (122470.630148) net_send: could not write to fd=4, errno = 32

[ Last edited by aylayl08 on 2009-10-25 at 11:36 ]
回复此楼
自己没有的东西,我们无法给予别人。
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

quantumfang

至尊木虫 (著名写手)

小木虫中医研究院院长

★ ★ ★ ★ ★ ★ ★
aylayl08(金币+2,VIP+0):感谢解答,欢迎常来 10-25 09:22
cxn253800(金币+5,VIP+0):感谢回复~ 10-25 11:33
EOF on socket网络数据已经截止(End Of File)

网络出错或者网络文件系统出错

肯定要重启了
2楼2009-10-25 09:20:32
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

cxn253800

木虫 (著名写手)

引用回帖:
Originally posted by quantumfang at 2009-10-25 09:20:
EOF on socket网络数据已经截止(End Of File)

网络出错或者网络文件系统出错

肯定要重启了

这样的问题经常存在,有的是在任务递交后马上报错,现在是计算了大部分然后又出错,看来又要从头算了
自己没有的东西,我们无法给予别人。
3楼2009-10-25 11:38:26
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
相关版块跳转 我要订阅楼主 cxn253800 的主题更新
普通表情 高级回复(可上传附件)
信息提示
请填处理意见