| 查看: 2378 | 回复: 9 | ||
| 本帖产生 1 个 1ST强帖 ,点击这里进行查看 | ||
[求助]
同个任务同样的节点数VASP4.6能算,而VASP5.2不能
|
||
|
对24个原子的体系的结构优化任务,256个节点,VASP4.6可以计算,但VASP5.2(自旋轨道耦合版本)计算得到段错误提示: forrtl: severe (174): SIGSEGV, segmentation fault occurred。 用ulimit -a查了下stack size:无限制。 以前小体系的时候测试过,发现两个版本计算效率挺接近的,这次体系增大,同时增大节点数,发现很大差异,基本上VASP5.2很容易出现段错误提示。 看到wuli8版主的段错误一个贴,最终解决是在FLAG的选项中加入: -heap-arrays 64,而我的VASP5.2版本原来编译时候已经加入了这项: FFLAGS = -FR -lowercase -assume byterecl -heap-arrays 64 -I/opt/intel/mkl/10.2.1.017/include/fftw 其它编译参数: FC=mpiifort CPP = $(CPP_) -DMPI -DHOST=\"LinuxIFC\" -DIFC \ -Dkind8 -DCACHE_SIZE=4000 -DPGF90 -Davoidalloc \ -DMPI_BLOCK=8000 -DRPROMU_DGEMV -DRACCMU_DGEMV SCA=-L/opt/intel/mkl/10.2.1.017/lib/em64t -lmkl_scalapack_lp64 -lmkl_blacs_intelmpi_lp64 请教各位,编译有没问题,以前曾用这个VASP5.2计算过10个原子的自选轨道耦合,一直挺顺利的。最近计算20原子的体系,总是出现问题。请大家帮忙看下是什么问题? [ Last edited by zhang668 on 2011-6-15 at 22:22 ] |
» 猜你喜欢
博士申请都是内定的吗?
已经有7人回复
读博
已经有5人回复
之前让一硕士生水了7个发明专利,现在这7个获批发明专利的维护费可从哪儿支出哈?
已经有5人回复
博士读完未来一定会好吗
已经有29人回复
投稿精细化工
已经有4人回复
高职单位投计算机相关的北核或SCI四区期刊推荐,求支招!
已经有4人回复
导师想让我从独立一作变成了共一第一
已经有9人回复
心脉受损
已经有5人回复
Springer期刊投稿求助
已经有4人回复
小论文投稿
已经有3人回复
» 本主题相关价值贴推荐,对您同样有帮助:
请问DMA能不能测橡胶的弹性模量,该怎么测
已经有4人回复
不同数据,同样的分析方法算剽窃吗?
已经有13人回复
vasp4.6收敛的结构在vasp5.2 计算不收敛
已经有7人回复
(原创)MS4.4(LINUX版)详细安装步骤(同样适用于高版本)
已经有5人回复
急问 现在哪里的计算中心有vasp5.2.12?
已经有6人回复
紧急求助vasp5.2 计算氧化铈表面吸附遇到的若干问题
已经有4人回复
【求助】关于vasp5.2计算色散力
已经有5人回复
【求助】讨论一下vasp4.6 和vasp5.2的计算速度
已经有8人回复
【求助】用vasp4.6计算GaAs的能带结构
已经有18人回复
【求助】如何用vasp5.2中的DFPT方法算Born effective charge?
已经有11人回复
【答案】应助回帖
★ ★ ★ ★ ★
youzhizhe(金币+5): 谢谢交流。 2011-06-16 09:50:48
youzhizhe(1ST强帖+1): 谢谢交流。 2011-06-16 09:51:16
zhang668(金币+3): 非常感谢详细指导 2011-06-16 15:00:22
youzhizhe(金币+5): 谢谢交流。 2011-06-16 09:50:48
youzhizhe(1ST强帖+1): 谢谢交流。 2011-06-16 09:51:16
zhang668(金币+3): 非常感谢详细指导 2011-06-16 15:00:22
|
1.查一查下OUTCAR里最后几行的提示,是否进入了主循环计算? 2.-heap-arrays 64 只是解决栈溢出问题,你只有那么点原子,考虑自旋也也不会产生此问题; 3.检查一下你的编译器的版本,INTEL 10.xxx编译器本身有bug,会遇到一下意想不到的问题; 4.FC=mpiifort,这个是自己该的还是自动生成的?一般MPI编译后,会建立个mpif90到ifort连接,即便并行编译,FC=mpif90 或 ifort都可以; 5.你在做测试吗?这么小的体系还用那么多个节点,并行效率很差,时间都浪费在CPU之间的通讯上了。你可以减少节点测试一下,或者串行试一下看看是否还有这个问题。 |

2楼2011-06-16 06:24:08
3楼2011-06-16 15:25:45
4楼2011-06-16 15:29:18

5楼2011-06-16 17:45:47
6楼2011-06-16 17:50:42
|
k点不多。是这样的,我的4.6版本是直接拷过来的程序,就是在集群下编译的。 5.2旋轨耦合版本是我自己编译的。 这是5.2 makefile: FC=ifort # fortran linker FCL=$(FC) #----------------------------------------------------------------------- # whereis CPP ?? (I need CPP, can't use gcc with proper options) # that's the location of gcc for SUSE 5.3 # # CPP_ = /usr/lib/gcc-lib/i486-linux/2.7.2/cpp -P -C # # that's probably the right line for some Red Hat distribution: # # CPP_ = /usr/lib/gcc-lib/i386-redhat-linux/2.7.2.3/cpp -P -C # # SUSE X.X, maybe some Red Hat distributions: CPP_ = ./preprocess <$*.F | /usr/bin/cpp -P -C -traditional >$*$(SUFFIX) #----------------------------------------------------------------------- # possible options for CPP: # NGXhalf charge density reduced in X direction # wNGXhalf gamma point only reduced in X direction # avoidalloc avoid ALLOCATE if possible # PGF90 work around some for some PGF90 / IFC bugs # CACHE_SIZE 1000 for PII,PIII, 5000 for Athlon, 8000-12000 P4, PD # RPROMU_DGEMV use DGEMV instead of DGEMM in RPRO (depends on used BLAS) # RACCMU_DGEMV use DGEMV instead of DGEMM in RACC (depends on used BLAS) #----------------------------------------------------------------------- CPP = $(CPP_) -DHOST=\"LinuxIFC\" \ -Dkind8 -DCACHE_SIZE=12000 -DPGF90 -Davoidalloc \ # -DRPROMU_DGEMV -DRACCMU_DGEMV #----------------------------------------------------------------------- # general fortran flags (there must a trailing blank on this line) # byterecl is strictly required for ifc, since otherwise # the WAVECAR file becomes huge #----------------------------------------------------------------------- FFLAGS = -FR -lowercase -assume byterecl -heap-arrays 64 -I/opt/intel/mkl/10.2.1.017/include/fftw #----------------------------------------------------------------------- # optimization # we have tested whether higher optimisation improves performance # -axK SSE1 optimization, but also generate code executable on all mach. # xK improves performance somewhat on XP, and a is required in order # to run the code on older Athlons as well # -xW SSE2 optimization # -axW SSE2 optimization, but also generate code executable on all mach. # -tpp6 P3 optimization # -tpp7 P4 optimization #----------------------------------------------------------------------- # ifc.9.1, ifc.10.1 recommended OFLAG=-O3 OFLAG_HIGH = $(OFLAG) OBJ_HIGH = OBJ_NOOPT = DEBUG = -FR -O0 INLINE = $(OFLAG) #----------------------------------------------------------------------- # the following lines specify the position of BLAS and LAPACK # VASP works fastest with the libgoto library # so that's what we recommend #----------------------------------------------------------------------- # mkl.10.0 # set -DRPROMU_DGEMV -DRACCMU_DGEMV in the CPP lines BLAS=-L/opt/intel/mkl/10.2.1.017/lib/em64t -lmkl_intel_lp64 -lmkl_sequential -lpthread -lmkl_core # even faster for VASP Kazushige Goto's BLAS # http://www.cs.utexas.edu/users/kgoto/signup_first.html # parallel goto version requires sometimes -libverbs #BLAS= /opt/libs/libgoto/libgoto.so # LAPACK, simplest use vasp.5.lib/lapack_double #LAPACK= ../vasp.5.lib/lapack_double.o # use the mkl Intel lapack LAPACK=-L/opt/intel/mkl/10.2.1.017/lib/em64t -lmkl_intel_lp64 -lmkl_sequential -lmkl_core -lpthread #----------------------------------------------------------------------- #LIB = -L../vasp.5.lib -ldmy \ ../vasp.5.lib/linpack_double.o $(LAPACK) \ $(BLAS) # options for linking, nothing is required (usually) LINK = #----------------------------------------------------------------------- # fft libraries: # VASP.5.2 can use fftw.3.1.X (http://www.fftw.org) # since this version is faster on P4 machines, we recommend to use it #----------------------------------------------------------------------- #FFT3D = fft3dfurth.o fft3dlib.o # alternatively: fftw.3.1.X is slighly faster and should be used if available #FFT3D = fftw3d.o fft3dlib.o /opt/libs/fftw-3.1.2/lib/libfftw3.a #======================================================================= # MPI section, uncomment the following lines until # general rules and compile lines # presently we recommend OPENMPI, since it seems to offer better # performance than lam or mpich # # !!! Please do not send me any queries on how to install MPI, I will # certainly not answer them !!!! #======================================================================= #----------------------------------------------------------------------- # fortran linker for mpi #----------------------------------------------------------------------- FC=mpiifort FCL=$(FC) #----------------------------------------------------------------------- # additional options for CPP in parallel version (see also above): # NGZhalf charge density reduced in Z direction # wNGZhalf gamma point only reduced in Z direction # scaLAPACK use scaLAPACK (usually slower on 100 Mbit Net) #----------------------------------------------------------------------- CPP = $(CPP_) -DMPI -DHOST=\"LinuxIFC\" -DIFC \ -Dkind8 -DCACHE_SIZE=4000 -DPGF90 -Davoidalloc \ -DMPI_BLOCK=8000 -DRPROMU_DGEMV -DRACCMU_DGEMV #----------------------------------------------------------------------- # location of SCALAPACK # if you do not use SCALAPACK simply leave that section commented out #----------------------------------------------------------------------- #BLACS=$(HOME)/archives/SCALAPACK/BLACS/ #SCA_=$(HOME)/archives/SCALAPACK/SCALAPACK #SCA= $(SCA_)/libscalapack.a \ # $(BLACS)/LIB/blacsF77init_MPI-LINUX-0.a $(BLACS)/LIB/blacs_MPI-LINUX-0.a $(BLACS)/LIB/blacsF77init_MPI-LINUX-0.a SCA=-L/opt/intel/mkl/10.2.1.017/lib/em64t -lmkl_scalapack_lp64 -lmkl_blacs_intelmpi_lp64 #----------------------------------------------------------------------- # libraries for mpi #----------------------------------------------------------------------- LIB = -L../vasp.5.lib -ldmy \ ../vasp.5.lib/linpack_double.o $(LAPACK) \ $(SCA) $(BLAS) # FFT: fftmpi.o with fft3dlib of Juergen Furthmueller #FFT3D = fftmpi.o fftmpi_map.o fft3dfurth.o fft3dlib.o # alternatively: fftw.3.1.X is slighly faster and should be used if available FFT3D = fftmpiw.o fftmpi_map.o fftw3d.o fft3dlib.o /gpfssan1/apps/fftw32/lib/libfftw3f.a |
7楼2011-06-18 13:27:22
8楼2011-11-01 17:18:11
9楼2011-11-02 23:29:17
hnuzhoulin
金虫 (小有名气)
- 1ST强帖: 1
- 应助: 7 (幼儿园)
- 金币: 1202.8
- 红花: 6
- 帖子: 265
- 在线: 49小时
- 虫号: 778409
- 注册: 2009-05-24
- 性别: GG
- 专业: 计算机应用技术
10楼2011-11-03 18:15:41













回复此楼