| 查看: 3444 | 回复: 11 | ||
mengfc金虫 (正式写手)
|
[求助]
编译vasp出错
|
|
用ifort+mkl+mpich编译vasp 提示error #10236: File not found: '/opt/libs/libgoto/libgoto_p4_512-r0.6.so' makefile如下,请大家帮忙看看 .SUFFIXES: .inc .f .f90 .F #----------------------------------------------------------------------- # Makefile for Intel Fortran compiler for P4 systems # # The makefile was tested only under Linux on Intel platforms # (Suse 5.3- Suse 9.0) # the followin compiler versions have been tested # 5.0, 6.0, 7.0 and 7.1 (some 8.0 versions seem to fail compiling the code) # presently we recommend version 7.1 or 7.0, since these # releases have been used to compile the present code versions # # it might be required to change some of library pathes, since # LINUX installation vary a lot # Hence check ***ALL**** options in this makefile very carefully #----------------------------------------------------------------------- # # BLAS must be installed on the machine # there are several options: # 1) very slow but works: # retrieve the lapackage from ftp.netlib.org # and compile the blas routines (BLAS/SRC directory) # please use g77 or f77 for the compilation. When I tried to # use pgf77 or pgf90 for BLAS, VASP hang up when calling # ZHEEV (however this was with lapack 1.1 now I use lapack 2.0) # 2) most desirable: get an optimized BLAS # # the two most reliable packages around are presently: # 3a) Intels own optimised BLAS (PIII, P4, Itanium) # http://developer.intel.com/software/products/mkl/ # this is really excellent when you use Intel CPU's # # 3b) or obtain the atlas based BLAS routines # http://math-atlas.sourceforge.net/ # you certainly need atlas on the Athlon, since the mkl # routines are not optimal on the Athlon. # If you want to use atlas based BLAS, check the lines around LIB= # # 3c) mindblowing fast SSE2 (4 GFlops on P4, 2.53 GHz) # Kazushige Goto's BLAS # http://www.cs.utexas.edu/users/kgoto/signup_first.html # #----------------------------------------------------------------------- # all CPP processed fortran files have the extension .f90 SUFFIX=.f90 #----------------------------------------------------------------------- # fortran compiler and linker #----------------------------------------------------------------------- FC=ifort # fortran linker FCL=$(FC) #----------------------------------------------------------------------- # whereis CPP ?? (I need CPP, can't use gcc with proper options) # that's the location of gcc for SUSE 5.3 # # CPP_ = /usr/lib/gcc-lib/i486-linux/2.7.2/cpp -P -C # # that's probably the right line for some Red Hat distribution: # # CPP_ = /usr/lib/gcc-lib/i386-redhat-linux/2.7.2.3/cpp -P -C # # SUSE X.X, maybe some Red Hat distributions: CPP_ = ./preprocess <$*.F | /usr/bin/cpp -P -C -traditional >$*$(SUFFIX) #----------------------------------------------------------------------- # possible options for CPP: # NGXhalf charge density reduced in X direction # wNGXhalf gamma point only reduced in X direction # avoidalloc avoid ALLOCATE if possible # IFC work around some IFC bugs # CACHE_SIZE 1000 for PII,PIII, 5000 for Athlon, 8000-12000 P4 # RPROMU_DGEMV use DGEMV instead of DGEMM in RPRO (depends on used BLAS) # RACCMU_DGEMV use DGEMV instead of DGEMM in RACC (depends on used BLAS) #----------------------------------------------------------------------- CPP = $(CPP_) -DHOST=\"LinuxIFC\" \ -Dkind8 -DNGXhalf -DCACHE_SIZE=12000 -DPGF90 -Davoidalloc \ # -DRPROMU_DGEMV -DRACCMU_DGEMV #----------------------------------------------------------------------- # general fortran flags (there must a trailing blank on this line) #----------------------------------------------------------------------- FFLAGS = -FR -lowercase -assume byterecl #----------------------------------------------------------------------- # optimization # we have tested whether higher optimisation improves performance # -axK SSE1 optimization, but also generate code executable on all mach. # xK improves performance somewhat on XP, and a is required in order # to run the code on older Athlons as well # -xW SSE2 optimization # -axW SSE2 optimization, but also generate code executable on all mach. # -tpp6 P3 optimization # -tpp7 P4 optimization #----------------------------------------------------------------------- OFLAG=-O3 -xW OFLAG_HIGH = $(OFLAG) OBJ_HIGH = OBJ_NOOPT = DEBUG = -FR -O0 INLINE = $(OFLAG) #----------------------------------------------------------------------- # the following lines specify the position of BLAS and LAPACK # on P4, VASP works fastest with the libgoto library # so that's what I recommend #----------------------------------------------------------------------- # Atlas based libraries #ATLASHOME= $(HOME)/archives/BLAS_OPT/ATLAS/lib/Linux_P4SSE2/ #BLAS= -L$(ATLASHOME) -lf77blas -latlas # use specific libraries (default library path might point to other libraries) #BLAS= $(ATLASHOME)/libf77blas.a $(ATLASHOME)/libatlas.a # use the mkl Intel libraries for p4 (www.intel.com) # mkl.5.1 # set -DRPROMU_DGEMV -DRACCMU_DGEMV in the CPP lines #BLAS=-L/opt/intel/mkl/lib/32 -lmkl_p4 -lpthread # mkl.5.2 requires also to -lguide library # set -DRPROMU_DGEMV -DRACCMU_DGEMV in the CPP lines #BLAS=-L/opt/intel/mkl/lib/32 -lmkl_p4 -lguide -lpthread # even faster Kazushige Goto's BLAS # http://www.cs.utexas.edu/users/kgoto/signup_first.html BLAS= /opt/libs/libgoto/libgoto_p4_512-r0.6.so # LAPACK, simplest use vasp.4.lib/lapack_double LAPACK= ../vasp.4.lib/lapack_double.o # use atlas optimized part of lapack #LAPACK= ../vasp.4.lib/lapack_atlas.o -llapack -lcblas # use the mkl Intel lapack #LAPACK= -lmkl_lapack #----------------------------------------------------------------------- LIB = -L../vasp.4.lib -ldmy \ ../vasp.4.lib/linpack_double.o $(LAPACK) \ $(BLAS) # options for linking (for compiler version 6.X, 7.1) nothing is required LINK = # compiler version 7.0 generates some vector statments which are located # in the svml library, add the LIBPATH and the library (just in case) #LINK = -L/opt/intel/compiler70/ia32/lib/ -lsvml #----------------------------------------------------------------------- # fft libraries: # VASP.4.6 can use fftw.3.0.X (http://www.fftw.org) # since this version is faster on P4 machines, we recommend to use it #----------------------------------------------------------------------- FFT3D = fft3dfurth.o fft3dlib.o #FFT3D = fftw3d.o fft3dlib.o /opt/libs/fftw-3.0.1/lib/libfftw3.a #======================================================================= # MPI section, uncomment the following lines # # one comment for users of mpich or lam: # You must *not* compile mpi with g77/f77, because f77/g77 # appends *two* underscores to symbols that contain already an # underscore (i.e. MPI_SEND becomes mpi_send__). The pgf90/ifc # compilers however append only one underscore. # Precompiled mpi version will also not work !!! # # We found that mpich.1.2.1 and lam-6.5.X to lam-7.0.4 are stable # mpich.1.2.1 was configured with # ./configure -prefix=/usr/local/mpich_nodvdbg -fc="pgf77 -Mx,119,0x200000" \ # -f90="pgf90 " \ # --without-romio --without-mpe -opt=-O \ # # lam was configured with the line # ./configure -prefix /opt/libs/lam-7.0.4 --with-cflags=-O -with-fc=ifc \ # --with-f77flags=-O --without-romio # # please note that you might be able to use a lam or mpich version # compiled with f77/g77, but then you need to add the following # options: -Msecond_underscore (compilation) and -g77libs (linking) # # !!! Please do not send me any queries on how to install MPI, I will # certainly not answer them !!!! #======================================================================= #----------------------------------------------------------------------- # fortran linker for mpi: if you use LAM and compiled it with the options # suggested above, you can use the following line #----------------------------------------------------------------------- FC=mpif90 FCL=$(FC) #----------------------------------------------------------------------- # additional options for CPP in parallel version (see also above): # NGZhalf charge density reduced in Z direction # wNGZhalf gamma point only reduced in Z direction # scaLAPACK use scaLAPACK (usually slower on 100 Mbit Net) #----------------------------------------------------------------------- CPP = $(CPP_) -DMPI -DHOST=\"LinuxIFC\" -DIFC \ -Dkind8 -DNGZhalf -DCACHE_SIZE=4000 -DPGF90 -Davoidalloc \ -DMPI_BLOCK=500 \ ## -DRPROMU_DGEMV -DRACCMU_DGEMV #----------------------------------------------------------------------- # location of SCALAPACK # if you do not use SCALAPACK simply uncomment the line SCA #----------------------------------------------------------------------- BLACS=$(HOME)/archives/SCALAPACK/BLACS/ SCA_=$(HOME)/archives/SCALAPACK/SCALAPACK SCA= $(SCA_)/libscalapack.a \ $(BLACS)/LIB/blacsF77init_MPI-LINUX-0.a $(BLACS)/LIB/blacs_MPI-LINUX-0.a $(BLACS)/LIB/blacsF77init_MPI-LINUX-0.a SCA= #----------------------------------------------------------------------- # libraries for mpi #----------------------------------------------------------------------- LIB = -L../vasp.4.lib -ldmy \ ../vasp.4.lib/linpack_double.o $(LAPACK) \ $(SCA) $(BLAS) # FFT: fftmpi.o with fft3dlib of Juergen Furthmueller FFT3D = fftmpi.o fftmpi_map.o fft3dlib.o # fftw.3.0.1 is slighly faster and should be used if available #FFT3D = fftmpiw.o fftmpi_map.o fft3dlib.o /opt/libs/fftw-3.0.1/lib/libfftw3.a #----------------------------------------------------------------------- # general rules and compile lines #----------------------------------------------------------------------- BASIC= symmetry.o symlib.o lattlib.o random.o SOURCE= base.o mpi.o smart_allocate.o xml.o \ constant.o jacobi.o main_mpi.o scala.o \ asa.o lattice.o poscar.o ini.o setex.o radial.o \ pseudo.o mgrid.o mkpoints.o wave.o wave_mpi.o $(BASIC) \ nonl.o nonlr.o dfast.o choleski2.o \ mix.o charge.o xcgrad.o xcspin.o potex1.o potex2.o \ metagga.o constrmag.o pot.o cl_shift.o force.o dos.o elf.o \ tet.o hamil.o steep.o \ chain.o dyna.o relativistic.o LDApU.o sphpro.o paw.o us.o \ ebs.o wavpre.o wavpre_noio.o broyden.o \ dynbr.o rmm-diis.o reader.o writer.o tutor.o xml_writer.o \ brent.o stufak.o fileio.o opergrid.o stepver.o \ dipol.o xclib.o chgloc.o subrot.o optreal.o davidson.o \ edtest.o electron.o shm.o pardens.o paircorrection.o \ optics.o constr_cell_relax.o stm.o finite_diff.o \ elpol.o setlocalpp.o INC= vasp: $(SOURCE) $(FFT3D) $(INC) main.o rm -f vasp $(FCL) -o vasp $(LINK) main.o $(SOURCE) $(FFT3D) $(LIB) makeparam: $(SOURCE) $(FFT3D) makeparam.o main.F $(INC) $(FCL) -o makeparam $(LINK) makeparam.o $(SOURCE) $(FFT3D) $(LIB) zgemmtest: zgemmtest.o base.o random.o $(INC) $(FCL) -o zgemmtest $(LINK) zgemmtest.o random.o base.o $(LIB) dgemmtest: dgemmtest.o base.o random.o $(INC) $(FCL) -o dgemmtest $(LINK) dgemmtest.o random.o base.o $(LIB) ffttest: base.o smart_allocate.o mpi.o mgrid.o random.o ffttest.o $(FFT3D) $(INC) $(FCL) -o ffttest $(LINK) ffttest.o mpi.o mgrid.o random.o smart_allocate.o base.o $(FFT3D) $(LIB) kpoints: $(SOURCE) $(FFT3D) makekpoints.o main.F $(INC) $(FCL) -o kpoints $(LINK) makekpoints.o $(SOURCE) $(FFT3D) $(LIB) clean: -rm -f *.g *.f *.o *.L *.mod ; touch *.F main.o: main$(SUFFIX) $(FC) $(FFLAGS)$(DEBUG) $(INCS) -c main$(SUFFIX) xcgrad.o: xcgrad$(SUFFIX) $(FC) $(FFLAGS) $(INLINE) $(INCS) -c xcgrad$(SUFFIX) xcspin.o: xcspin$(SUFFIX) $(FC) $(FFLAGS) $(INLINE) $(INCS) -c xcspin$(SUFFIX) makeparam.o: makeparam$(SUFFIX) $(FC) $(FFLAGS)$(DEBUG) $(INCS) -c makeparam$(SUFFIX) makeparam$(SUFFIX): makeparam.F main.F # # MIND: I do not have a full dependency list for the include # and MODULES: here are only the minimal basic dependencies # if one strucuture is changed then touch_dep must be called # with the corresponding name of the structure # base.o: base.inc base.F mgrid.o: mgrid.inc mgrid.F constant.o: constant.inc constant.F lattice.o: lattice.inc lattice.F setex.o: setexm.inc setex.F pseudo.o: pseudo.inc pseudo.F poscar.o: poscar.inc poscar.F mkpoints.o: mkpoints.inc mkpoints.F wave.o: wave.inc wave.F nonl.o: nonl.inc nonl.F nonlr.o: nonlr.inc nonlr.F $(OBJ_HIGH): $(CPP) $(FC) $(FFLAGS) $(OFLAG_HIGH) $(INCS) -c $*$(SUFFIX) $(OBJ_NOOPT): $(CPP) $(FC) $(FFLAGS) $(INCS) -c $*$(SUFFIX) fft3dlib_f77.o: fft3dlib_f77.F $(CPP) $(F77) $(FFLAGS_F77) -c $*$(SUFFIX) .F.o: $(CPP) $(FC) $(FFLAGS) $(OFLAG) $(INCS) -c $*$(SUFFIX) .F$(SUFFIX): $(CPP) $(SUFFIX).o: $(FC) $(FFLAGS) $(OFLAG) $(INCS) -c $*$(SUFFIX) # special rules #----------------------------------------------------------------------- # these special rules are cummulative (that is once failed # in one compiler version, stays in the list forever) # -tpp5|6|7 P, PII-PIII, PIV # -xW use SIMD (does not pay of on PII, since fft3d uses double prec) # all other options do no affect the code performance since -O1 is used #----------------------------------------------------------------------- fft3dlib.o : fft3dlib.F $(CPP) $(FC) -FR -lowercase -O1 -tpp7 -xW -prefetch- -unroll0 -vec_report3 -c $*$(SUFFIX) fft3dfurth.o : fft3dfurth.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) radial.o : radial.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) symlib.o : symlib.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) symmetry.o : symmetry.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) dynbr.o : dynbr.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) broyden.o : broyden.F $(CPP) $(FC) -FR -lowercase -O2 -c $*$(SUFFIX) us.o : us.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) wave.o : wave.F $(CPP) $(FC) -FR -lowercase -O0 -c $*$(SUFFIX) LDApU.o : LDApU.F $(CPP) $(FC) -FR -lowercase -O2 -c $*$(SUFFIX) |
» 猜你喜欢
之前让一硕士生水了7个发明专利,现在这7个获批发明专利的维护费可从哪儿支出哈?
已经有5人回复
博士读完未来一定会好吗
已经有29人回复
博士申请都是内定的吗?
已经有5人回复
到新单位后,换了新的研究方向,没有团队,持续积累2区以上论文,能申请到面上吗
已经有12人回复
投稿精细化工
已经有4人回复
高职单位投计算机相关的北核或SCI四区期刊推荐,求支招!
已经有4人回复
导师想让我从独立一作变成了共一第一
已经有9人回复
读博
已经有4人回复
JMPT 期刊投稿流程
已经有4人回复
心脉受损
已经有5人回复
» 本主题相关价值贴推荐,对您同样有帮助:
vasp跨节点运行出错,mpiexec_node-1 (handle_stdin_input 1089)
已经有5人回复
vasp5.2 openmpi+intel+mkl编译,能量正值问题
已经有7人回复
vasp并行测试时出错。
已经有7人回复
vasp 并行运算出错
已经有7人回复
vasp5.2 并行编译如何修改makefile
已经有5人回复
【求助】vasp计算过渡态结果出错
已经有17人回复
【求助】VASP编译出错
已经有5人回复
【求助】VASP考虑自旋轨道耦合的话,如何编译
已经有17人回复
【求助】vasp编译时,报错:cannot find -lmkl_em64t
已经有11人回复
【求助】VASP 编译出错
已经有24人回复
saikun
木虫 (正式写手)
- 应助: 4 (幼儿园)
- 金币: 2483.2
- 散金: 891
- 红花: 5
- 帖子: 424
- 在线: 89.1小时
- 虫号: 197920
- 注册: 2006-02-25
- 性别: GG
- 专业: 古陶瓷与传统陶瓷
【答案】应助回帖
★
youzhizhe(金币+1): 谢谢交流。 2011-05-02 09:14:12
mengfc(金币+10): 2011-05-02 09:33:49
mengfc(金币+5): 谢谢回复,请看十楼的makefile有没有问题? 2011-05-02 16:02:46
youzhizhe(金币+1): 谢谢交流。 2011-05-02 09:14:12
mengfc(金币+10): 2011-05-02 09:33:49
mengfc(金币+5): 谢谢回复,请看十楼的makefile有没有问题? 2011-05-02 16:02:46
|
把goto-blas那句改成用mkl或先编译goto-blas就可以了。 但我觉得mkl比较快(xeon上) 本来intel上有一篇指南的,但好象没了。只能在google cache里找到,你看看。 ======================= Using Intel® MKL in VASP October 5, 2010 12:50 AM PDT Step 1 - Overview This guide is intended to help current VASP* (Vienna Ab-Initio Package Simulation*) users get better benchmark performance by utilizing Intel® Math Kernel Library (Intel® MKL). VASP is a package for performing ab-initio quantum-mechanical molecular dynamics (MD) using pseudo potentials and a plane wave basis set. The approach implemented in VAMP/VASP is based on a finite-temperature local-density approximation (with the free energy as variational quantity) and an exact evaluation of the instantaneous electronic ground state at each MD-step using efficient matrix diagonalization schemes and an efficient Pulay mixing. These techniques avoid all problems occurring in the original Car-Parrinello method which is based on the simultaneous integration of electronic and ionic equations of motion. The interaction between ions and electrons is described using ultrasoft Vanderbilt pseudopotentials (US-PP) or the projector augmented wave method (PAW). Both techniques allow a considerable reduction of the necessary number of plane-waves per atom for transition metals and first row elements. Forces and stress can be easily calculated with VAMP/VASP and used to relax atoms into their instantaneous groundstate. [Ref: VASP*] Version Information This application note was created to help users who benchmark clusters using VASP to also incorporate the latest version of Intel® MKL on Linux* platforms on Intel® Xeon® *processor-based systems. Specifically, we'll address Intel® MKL version 10.2. More information on VASP can be found from http://cms.mpi.univie.ac.at/vasp/* Step 2 - Downloading VASP Source Code VASP is not public-domain or shareware, and will be distributed only after a license contract has been signed. Enquiries must be sent to Juerrgen Hafner. The enquiry should contain a short description of the short term research aims (less than half a page). Prerequisites: Intel® MKL contains highly optimized FFT and also the wrappers for FFTW, which can be obtained using the following options: Download a FREE evaluation version of the Intel® MKL product. Download the FREE non-commercial* version of the Intel® MKL product. Purchase a copy of Intel® MKL. All of these can be obtained at: Intel® Math Kernel Library Product Page. Obtain Intel® C++ and Fortran compilers. Obtain Intel® MPI. Obtain Open Source MPI (MPICH2) Step 3 - Configuration Use the following commands to extract the VASP files: $tar -xvzf vasp.tgz This will create a directory called vasp. Make sure that MPI, C++ and Fortran compilers are installed and in the PATH. Also set LD_LIBRARY_PATH to your compiler (C++ and Fortran), MPI, and Intel® MKL libraries. Note: This application note is written specifically for use with the Intel® compilers and MPI. Step 4 - Building VASP Build FFTW wrapper library for Intel® MKL Change directory to Assuming default installation-- $cd /opt/intel/mkl/10.2.0.013/interfaces/fftw3xf Build the fftw 3.x Fortran wrapper library for Intel® 64 (previously known as em64t) platform using the Intel compilers. $make libem64t compiler=intel This will create the fftw wrapper library "libfftw3xf_intel.a" in lib/em64t directory of your Intel® MKL installation. Build libdmy.a Change directory to vasp.x.x.lib. Run the following command from vasp.x.x.lib, using the Makefile for Linux* using the Intel compiler. #make -f makefile.linux_ifc_P4 After a successful compilation, libdmy.a will be built in the same directory. Build VASP Change directory to vasp.x.x. Edit the makefile.linux_ifc_P4 to link with Intel® MKL libraries. Go to the section where BLAS, LAPACK and FFT libraries are specified and change as below, assuming the default installation /opt/intel/mkl/10.2.0.013 for MKL 10.2. BLAS=-L/opt/intel/mkl/10.2.0.013/lib/em64t -lmkl_intel_lp64 -lmkl_sequential -lmkl_core -lpthread LAPACK=-L/opt/intel/mkl/10.2.0.013/lib/em64t -lmkl_intel_lp64 -lmkl_sequential -lmkl_core -lpthread FFT3D= fftmpiw.o fftmpi_map.o fft3dlib.o /opt/intel/mkl/10.2.0.013/lib/em64t/libfftw3xf_intel.a Modify Fortran compiler flags line to include fftw3.f from Intel® MKL include directory as below: FFLAGS= -I/opt/intel/mkl/10.2.0.013/include/fftw -FR -lower_case This will eliminate compilation errors pertaining to "cannot open include file `fftw3.f' when Intel® MKL FFT is used rather than VASP's own FFTs. To use Intel® MKL ScaLAPACK, modify additional options for CPP for parallel version adding -DscaLAPACK to the CPP line and specify Intel® MKL ScaLAPACK as below: SCA=/opt/intel/mkl/10.2.0.013/lib/em64t/libmkl_scalapack_lp64.a /opt/intel/mkl/10.2.0.13/lib/em64t/libmkl_blacs_intelmpi_lp64.a To use Intel® MPI specify Fortran compiler as FC=mpiifort. Run the following command to build vasp: $make -f makefile.linux_ifc_P4 This will create the VASP executable in the current directory. Step 5 - Running VASP Run vasp by executing the following command with your workloads: $mpirun -n 4 ./vasp |
2楼2011-05-01 23:34:04
hakuna
木虫 (知名作家)
- 1ST强帖: 3
- 应助: 82 (初中生)
- 金币: 8735.9
- 散金: 158
- 红花: 126
- 帖子: 5599
- 在线: 1367.2小时
- 虫号: 167622
- 注册: 2006-01-15
- 专业: 考古理论
3楼2011-05-01 23:36:23
mengfc
金虫 (正式写手)
- 1ST强帖: 1
- 应助: 20 (小学生)
- 金币: 1301.2
- 散金: 1807
- 红花: 5
- 帖子: 927
- 在线: 524.4小时
- 虫号: 818484
- 注册: 2009-07-30
- 性别: GG
- 专业: 凝聚态物性 II :电子结构
|
你好,谢谢你的回帖。我按着这上面又改了一下maekfile,还是不行。把错误信息贴在后面,请指点一下。 问题一:在vasp.4.lib文件夹中make时,提示 gcc -E -P -C preclib.F >preclib.f ifort -O0 -FI -FR -c preclib.f cc -O -c timing_.c cc -O -c derrf_.c cc -O -c dclock_.c gcc -E -P -C diolib.F >diolib.f ifort -O0 -FI -FR -c diolib.f gcc -E -P -C dlexlib.F >dlexlib.f ifort -O0 -FI -FR -c dlexlib.f gcc -E -P -C drdatab.F >drdatab.f ifort -O0 -FI -FR -c drdatab.f ifort -O0 -FI -c lapack_double.f ifort -O0 -FI -c linpack_double.f ifort -O0 -FI -c lapack_atlas.f rm libdmy.a rm: cannot remove `libdmy.a': No such file or directory make: [libdmy.a] Error 1 (ignored) ar vq libdmy.a preclib.o timing_.o derrf_.o dclock_.o diolib.o dlexlib.o drdatab.o ar: creating libdmy.a a - preclib.o a - timing_.o a - derrf_.o a - dclock_.o a - diolib.o a - dlexlib.o a - drdatab.o 这是否正确编译了库文件? 问题二:在vasp.4.6 文件夹中make时,出错: elpol.f90(693): warning #7866: The statement following this DEC loop optimization directive must be an iterative do-stmt, a vector assignment, an OMP pdo-directive, or an OMP parallel-do-directive. !DIR$ IVDEP ------^ ifort: warning #10147: no action performed for specified file(s) ./preprocess mpif90 /opt/intel/composerxe-2011.3.174/mkl/include/fftw -FR -lowercase -O3 -xW -c setlocalpp.f90 ifort: command line remark #10010: option '-lowercase' is deprecated and will be removed in a future release. See '-help deprecated' ifort: command line remark #10279: option '-xW' is deprecated and will be removed in a future release. See '-help deprecated' ifort: warning #10147: no action performed for specified file(s) make: *** No rule to make target `FFT3D', needed by `vasp'. Stop. makefile 如下,不知应如何修改? .SUFFIXES: .inc .f .f90 .F #----------------------------------------------------------------------- # Makefile for Intel Fortran compiler for P4 systems # # The makefile was tested only under Linux on Intel platforms # (Suse 5.3- Suse 9.0) # the followin compiler versions have been tested # 5.0, 6.0, 7.0 and 7.1 (some 8.0 versions seem to fail compiling the code) # presently we recommend version 7.1 or 7.0, since these # releases have been used to compile the present code versions # # it might be required to change some of library pathes, since # LINUX installation vary a lot # Hence check ***ALL**** options in this makefile very carefully #----------------------------------------------------------------------- # # BLAS must be installed on the machine # there are several options: # 1) very slow but works: # retrieve the lapackage from ftp.netlib.org # and compile the blas routines (BLAS/SRC directory) # please use g77 or f77 for the compilation. When I tried to # use pgf77 or pgf90 for BLAS, VASP hang up when calling # ZHEEV (however this was with lapack 1.1 now I use lapack 2.0) # 2) most desirable: get an optimized BLAS # # the two most reliable packages around are presently: # 3a) Intels own optimised BLAS (PIII, P4, Itanium) # http://developer.intel.com/software/products/mkl/ # this is really excellent when you use Intel CPU's # # 3b) or obtain the atlas based BLAS routines # http://math-atlas.sourceforge.net/ # you certainly need atlas on the Athlon, since the mkl # routines are not optimal on the Athlon. # If you want to use atlas based BLAS, check the lines around LIB= # # 3c) mindblowing fast SSE2 (4 GFlops on P4, 2.53 GHz) # Kazushige Goto's BLAS # http://www.cs.utexas.edu/users/kgoto/signup_first.html # #----------------------------------------------------------------------- # all CPP processed fortran files have the extension .f90 SUFFIX=.f90 #----------------------------------------------------------------------- # fortran compiler and linker #----------------------------------------------------------------------- FC=ifort # fortran linker FCL=$(FC) #----------------------------------------------------------------------- # whereis CPP ?? (I need CPP, can't use gcc with proper options) # that's the location of gcc for SUSE 5.3 # # CPP_ = /usr/lib/gcc-lib/i486-linux/2.7.2/cpp -P -C # # that's probably the right line for some Red Hat distribution: # # CPP_ = /usr/lib/gcc-lib/i386-redhat-linux/2.7.2.3/cpp -P -C # # SUSE X.X, maybe some Red Hat distributions: CPP_ = ./preprocess <$*.F | /usr/bin/cpp -P -C -traditional >$*$(SUFFIX) #----------------------------------------------------------------------- # possible options for CPP: # NGXhalf charge density reduced in X direction # wNGXhalf gamma point only reduced in X direction # avoidalloc avoid ALLOCATE if possible # IFC work around some IFC bugs # CACHE_SIZE 1000 for PII,PIII, 5000 for Athlon, 8000-12000 P4 # RPROMU_DGEMV use DGEMV instead of DGEMM in RPRO (depends on used BLAS) # RACCMU_DGEMV use DGEMV instead of DGEMM in RACC (depends on used BLAS) #----------------------------------------------------------------------- CPP = $(CPP_) -DHOST=\"LinuxIFC\" \ -Dkind8 -DNGXhalf -DCACHE_SIZE=12000 -DPGF90 -Davoidalloc \ # -DRPROMU_DGEMV -DRACCMU_DGEMV #----------------------------------------------------------------------- # general fortran flags (there must a trailing blank on this line) #----------------------------------------------------------------------- FFLAGS = /opt/intel/composerxe-2011.3.174/mkl/include/fftw -FR -lowercase #----------------------------------------------------------------------- # optimization # we have tested whether higher optimisation improves performance # -axK SSE1 optimization, but also generate code executable on all mach. # xK improves performance somewhat on XP, and a is required in order # to run the code on older Athlons as well # -xW SSE2 optimization # -axW SSE2 optimization, but also generate code executable on all mach. # -tpp6 P3 optimization # -tpp7 P4 optimization #----------------------------------------------------------------------- OFLAG=-O3 -xW OFLAG_HIGH = $(OFLAG) OBJ_HIGH = OBJ_NOOPT = DEBUG = -FR -O0 INLINE = $(OFLAG) #----------------------------------------------------------------------- # the following lines specify the position of BLAS and LAPACK # on P4, VASP works fastest with the libgoto library # so that's what I recommend #----------------------------------------------------------------------- # Atlas based libraries #ATLASHOME= $(HOME)/archives/BLAS_OPT/ATLAS/lib/Linux_P4SSE2/ #BLAS= -L$(ATLASHOME) -lf77blas -latlas # use specific libraries (default library path might point to other libraries) #BLAS= $(ATLASHOME)/libf77blas.a $(ATLASHOME)/libatlas.a # use the mkl Intel libraries for p4 (www.intel.com) # mkl.5.1 # set -DRPROMU_DGEMV -DRACCMU_DGEMV in the CPP lines #BLAS=-L/opt/intel/mkl/lib/32 -lmkl_p4 -lpthread # mkl.5.2 requires also to -lguide library # set -DRPROMU_DGEMV -DRACCMU_DGEMV in the CPP lines BLAS=-L/opt/intel/composerxe-2011.3.174/mkl/lib/ia32 -lmkl_intel-lmkl_sequential -lmkl_core -lpthread # even faster Kazushige Goto's BLAS # http://www.cs.utexas.edu/users/kgoto/signup_first.html #BLAS= /opt/libs/libgoto/libgoto_p4_512-r0.6.so # LAPACK, simplest use vasp.4.lib/lapack_double #LAPACK= ../vasp.4.lib/lapack_double.o # use atlas optimized part of lapack #LAPACK= ../vasp.4.lib/lapack_atlas.o -llapack -lcblas # use the mkl Intel lapack LAPACK=-L/opt/intel/composerxe-2011.3.174/mkl/lib/ia32 -lmkl_intel-lmkl_sequential -lmkl_core -lpthread #----------------------------------------------------------------------- LIB = -L../vasp.4.lib -ldmy \ ../vasp.4.lib/linpack_double.o $(LAPACK) \ $(BLAS) # options for linking (for compiler version 6.X, 7.1) nothing is required LINK = # compiler version 7.0 generates some vector statments which are located # in the svml library, add the LIBPATH and the library (just in case) #LINK = -L/opt/intel/compiler70/ia32/lib/ -lsvml #----------------------------------------------------------------------- # fft libraries: # VASP.4.6 can use fftw.3.0.X (http://www.fftw.org) # since this version is faster on P4 machines, we recommend to use it #----------------------------------------------------------------------- #FFT3D = fft3dfurth.o fft3dlib.o FFT3D = fftmpiw.o fftmpi_map.o fft3dlib.o /opt/intel/composerxe-2011.3.174/mkl/lib/ia32/libfftw3xf_intel.a #======================================================================= # MPI section, uncomment the following lines # # one comment for users of mpich or lam: # You must *not* compile mpi with g77/f77, because f77/g77 # appends *two* underscores to symbols that contain already an # underscore (i.e. MPI_SEND becomes mpi_send__). The pgf90/ifc # compilers however append only one underscore. # Precompiled mpi version will also not work !!! # # We found that mpich.1.2.1 and lam-6.5.X to lam-7.0.4 are stable # mpich.1.2.1 was configured with # ./configure -prefix=/usr/local/mpich_nodvdbg -fc="pgf77 -Mx,119,0x200000" \ # -f90="pgf90 " \ # --without-romio --without-mpe -opt=-O \ # # lam was configured with the line # ./configure -prefix /opt/libs/lam-7.0.4 --with-cflags=-O -with-fc=ifc \ # --with-f77flags=-O --without-romio # # please note that you might be able to use a lam or mpich version # compiled with f77/g77, but then you need to add the following # options: -Msecond_underscore (compilation) and -g77libs (linking) # # !!! Please do not send me any queries on how to install MPI, I will # certainly not answer them !!!! #======================================================================= #----------------------------------------------------------------------- # fortran linker for mpi: if you use LAM and compiled it with the options # suggested above, you can use the following line #----------------------------------------------------------------------- FC=mpif90 FCL=$(FC) #----------------------------------------------------------------------- # additional options for CPP in parallel version (see also above): # NGZhalf charge density reduced in Z direction # wNGZhalf gamma point only reduced in Z direction # scaLAPACK use scaLAPACK (usually slower on 100 Mbit Net) #----------------------------------------------------------------------- CPP = $(CPP_) -DMPI -DHOST=\"LinuxIFC\" -DIFC \ -Dkind8 -DNGZhalf -DCACHE_SIZE=4000 -DPGF90 -Davoidalloc \ -DMPI_BLOCK=500 \ ## -DRPROMU_DGEMV -DRACCMU_DGEMV #----------------------------------------------------------------------- # location of SCALAPACK # if you do not use SCALAPACK simply uncomment the line SCA #----------------------------------------------------------------------- BLACS=$(HOME)/archives/SCALAPACK/BLACS/ SCA_=$(HOME)/archives/SCALAPACK/SCALAPACK SCA= $(SCA_)/libscalapack.a \ $(BLACS)/LIB/blacsF77init_MPI-LINUX-0.a $(BLACS)/LIB/blacs_MPI-LINUX-0.a $(BLACS)/LIB/blacsF77init_MPI-LINUX-0.a SCA=/opt/intel/composerxe-2011.3.174/mkl/lib/ia32/libmkl_scalapack_core.a /opt/intel/composerxe-2011.3.174/mkl/lib/ia32/libmkl_blacs_intelmpi.a #----------------------------------------------------------------------- # libraries for mpi #----------------------------------------------------------------------- #LIB = -L../vasp.4.lib -ldmy \ # ../vasp.4.lib/linpack_double.o $(LAPACK) \ # $(SCA) $(BLAS) # FFT: fftmpi.o with fft3dlib of Juergen Furthmueller FFT3D = FFT3D = fftmpiw.o fftmpi_map.o fft3dlib.o /opt/intel/composerxe-2011.3.174/mkl/lib/ia32/libfftw3xf_intel.a # fftw.3.0.1 is slighly faster and should be used if available #FFT3D = fftmpiw.o fftmpi_map.o fft3dlib.o /opt/libs/fftw-3.0.1/lib/libfftw3.a #----------------------------------------------------------------------- # general rules and compile lines #----------------------------------------------------------------------- BASIC= symmetry.o symlib.o lattlib.o random.o SOURCE= base.o mpi.o smart_allocate.o xml.o \ constant.o jacobi.o main_mpi.o scala.o \ asa.o lattice.o poscar.o ini.o setex.o radial.o \ pseudo.o mgrid.o mkpoints.o wave.o wave_mpi.o $(BASIC) \ nonl.o nonlr.o dfast.o choleski2.o \ mix.o charge.o xcgrad.o xcspin.o potex1.o potex2.o \ metagga.o constrmag.o pot.o cl_shift.o force.o dos.o elf.o \ tet.o hamil.o steep.o \ chain.o dyna.o relativistic.o LDApU.o sphpro.o paw.o us.o \ ebs.o wavpre.o wavpre_noio.o broyden.o \ dynbr.o rmm-diis.o reader.o writer.o tutor.o xml_writer.o \ brent.o stufak.o fileio.o opergrid.o stepver.o \ dipol.o xclib.o chgloc.o subrot.o optreal.o davidson.o \ edtest.o electron.o shm.o pardens.o paircorrection.o \ optics.o constr_cell_relax.o stm.o finite_diff.o \ elpol.o setlocalpp.o INC= vasp: $(SOURCE) $(FFT3D) $(INC) main.o rm -f vasp $(FCL) -o vasp $(LINK) main.o $(SOURCE) $(FFT3D) $(LIB) makeparam: $(SOURCE) $(FFT3D) makeparam.o main.F $(INC) $(FCL) -o makeparam $(LINK) makeparam.o $(SOURCE) $(FFT3D) $(LIB) zgemmtest: zgemmtest.o base.o random.o $(INC) $(FCL) -o zgemmtest $(LINK) zgemmtest.o random.o base.o $(LIB) dgemmtest: dgemmtest.o base.o random.o $(INC) $(FCL) -o dgemmtest $(LINK) dgemmtest.o random.o base.o $(LIB) ffttest: base.o smart_allocate.o mpi.o mgrid.o random.o ffttest.o $(FFT3D) $(INC) $(FCL) -o ffttest $(LINK) ffttest.o mpi.o mgrid.o random.o smart_allocate.o base.o $(FFT3D) $(LIB) kpoints: $(SOURCE) $(FFT3D) makekpoints.o main.F $(INC) $(FCL) -o kpoints $(LINK) makekpoints.o $(SOURCE) $(FFT3D) $(LIB) clean: -rm -f *.g *.f *.o *.L *.mod ; touch *.F main.o: main$(SUFFIX) $(FC) $(FFLAGS)$(DEBUG) $(INCS) -c main$(SUFFIX) xcgrad.o: xcgrad$(SUFFIX) $(FC) $(FFLAGS) $(INLINE) $(INCS) -c xcgrad$(SUFFIX) xcspin.o: xcspin$(SUFFIX) $(FC) $(FFLAGS) $(INLINE) $(INCS) -c xcspin$(SUFFIX) makeparam.o: makeparam$(SUFFIX) $(FC) $(FFLAGS)$(DEBUG) $(INCS) -c makeparam$(SUFFIX) makeparam$(SUFFIX): makeparam.F main.F # # MIND: I do not have a full dependency list for the include # and MODULES: here are only the minimal basic dependencies # if one strucuture is changed then touch_dep must be called # with the corresponding name of the structure # base.o: base.inc base.F mgrid.o: mgrid.inc mgrid.F constant.o: constant.inc constant.F lattice.o: lattice.inc lattice.F setex.o: setexm.inc setex.F pseudo.o: pseudo.inc pseudo.F poscar.o: poscar.inc poscar.F mkpoints.o: mkpoints.inc mkpoints.F wave.o: wave.inc wave.F nonl.o: nonl.inc nonl.F nonlr.o: nonlr.inc nonlr.F $(OBJ_HIGH): $(CPP) $(FC) $(FFLAGS) $(OFLAG_HIGH) $(INCS) -c $*$(SUFFIX) $(OBJ_NOOPT): $(CPP) $(FC) $(FFLAGS) $(INCS) -c $*$(SUFFIX) fft3dlib_f77.o: fft3dlib_f77.F $(CPP) $(F77) $(FFLAGS_F77) -c $*$(SUFFIX) .F.o: $(CPP) $(FC) $(FFLAGS) $(OFLAG) $(INCS) -c $*$(SUFFIX) .F$(SUFFIX): $(CPP) $(SUFFIX).o: $(FC) $(FFLAGS) $(OFLAG) $(INCS) -c $*$(SUFFIX) # special rules #----------------------------------------------------------------------- # these special rules are cummulative (that is once failed # in one compiler version, stays in the list forever) # -tpp5|6|7 P, PII-PIII, PIV # -xW use SIMD (does not pay of on PII, since fft3d uses double prec) # all other options do no affect the code performance since -O1 is used #----------------------------------------------------------------------- fft3dlib.o : fft3dlib.F $(CPP) $(FC) -FR -lowercase -O1 -tpp7 -xW -prefetch- -unroll0 -vec_report3 -c $*$(SUFFIX) fft3dfurth.o : fft3dfurth.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) radial.o : radial.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) symlib.o : symlib.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) symmetry.o : symmetry.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) dynbr.o : dynbr.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) broyden.o : broyden.F $(CPP) $(FC) -FR -lowercase -O2 -c $*$(SUFFIX) us.o : us.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) wave.o : wave.F $(CPP) $(FC) -FR -lowercase -O0 -c $*$(SUFFIX) LDApU.o : LDApU.F $(CPP) $(FC) -FR -lowercase -O2 -c $*$(SUFFIX) |
4楼2011-05-02 09:31:56
enola
捐助贵宾 (正式写手)
- 1ST强帖: 2
- 应助: 13 (小学生)
- 金币: 1485.9
- 散金: 1549
- 红花: 14
- 帖子: 847
- 在线: 102.5小时
- 虫号: 947564
- 注册: 2010-01-24
- 专业: 半导体光电子器件

5楼2011-05-02 11:30:37
enola
捐助贵宾 (正式写手)
- 1ST强帖: 2
- 应助: 13 (小学生)
- 金币: 1485.9
- 散金: 1549
- 红花: 14
- 帖子: 847
- 在线: 102.5小时
- 虫号: 947564
- 注册: 2010-01-24
- 专业: 半导体光电子器件

6楼2011-05-02 11:31:55
【答案】应助回帖
mengfc(金币+5): 2011-05-02 15:19:56
|
# even faster Kazushige Goto's BLAS # http://www.cs.utexas.edu/users/kgoto/signup_first.html #BLAS= /opt/libs/libgoto/libgoto_p4_512-r0.6.so # LAPACK, simplest use vasp.4.lib/lapack_double #LAPACK= ../vasp.4.lib/lapack_double.o # use atlas optimized part of lapack #LAPACK= ../vasp.4.lib/lapack_atlas.o -llapack -lcblas # use the mkl Intel lapack LAPACK=-L/opt/intel/composerxe-2011.3.174/mkl/lib/ia32 -lmkl_intel-lmkl_sequential -lmkl_core -lpthread 还是用自带的吧: # even faster Kazushige Goto's BLAS # http://www.cs.utexas.edu/users/kgoto/signup_first.html #BLAS= /opt/libs/libgoto/libgoto_p4_512-r0.6.so # LAPACK, simplest use vasp.4.lib/lapack_double LAPACK= ../vasp.4.lib/lapack_double.o # use atlas optimized part of lapack #LAPACK= ../vasp.4.lib/lapack_atlas.o -llapack -lcblas # use the mkl Intel lapack #LAPACK=-L/opt/intel/composerxe-2011.3.174/mkl/lib/ia32 -lmkl_intel-lmkl_sequential #-lmkl_core -lpthread |

7楼2011-05-02 12:03:42
小木虫于
金虫 (正式写手)
- 应助: 5 (幼儿园)
- 金币: 1052.9
- 散金: 56
- 红花: 1
- 帖子: 419
- 在线: 293.1小时
- 虫号: 1027530
- 注册: 2010-05-25
- 性别: GG
- 专业: 凝聚态物性 II :电子结构
8楼2011-05-02 13:32:46
mengfc
金虫 (正式写手)
- 1ST强帖: 1
- 应助: 20 (小学生)
- 金币: 1301.2
- 散金: 1807
- 红花: 5
- 帖子: 927
- 在线: 524.4小时
- 虫号: 818484
- 注册: 2009-07-30
- 性别: GG
- 专业: 凝聚态物性 II :电子结构
|
谢谢你,越装越没信心了。。。。 /opt/intel/composerxe-2011.3.174/mkl/lib/ia32/libmkl_intel.so: undefined reference to `mkl_lapack_dgtcon' /opt/intel/composerxe-2011.3.174/mkl/lib/ia32/libmkl_intel.so: undefined reference to `mkl_lapack_zcposv' /opt/intel/composerxe-2011.3.174/mkl/lib/ia32/libmkl_intel.so: undefined reference to `mkl_lapack_ssyevd' /opt/intel/composerxe-2011.3.174/mkl/lib/ia32/libmkl_intel.so: undefined reference to `mkl_spblas_mkl_cspblas_ccoosymv' /opt/intel/composerxe-2011.3.174/mkl/lib/ia32/libmkl_intel.so: undefined reference to `mkl_pds_dss_factor_complex_d' make: *** [vasp] Error 1 这个错误是什么原因呢? |
9楼2011-05-02 15:20:32
mengfc
金虫 (正式写手)
- 1ST强帖: 1
- 应助: 20 (小学生)
- 金币: 1301.2
- 散金: 1807
- 红花: 5
- 帖子: 927
- 在线: 524.4小时
- 虫号: 818484
- 注册: 2009-07-30
- 性别: GG
- 专业: 凝聚态物性 II :电子结构
|
[mengfc@localhost vasp.4.6]$ ./vasp Error reading item 'IMAGES' from file INCAR. 这样算编译成功了吗? 另附我的makefile,请帮忙看看可以吗? .SUFFIXES: .inc .f .f90 .F #----------------------------------------------------------------------- # Makefile for Intel Fortran compiler for P4 systems # # The makefile was tested only under Linux on Intel platforms # (Suse 5.3- Suse 9.0) # the followin compiler versions have been tested # 5.0, 6.0, 7.0 and 7.1 (some 8.0 versions seem to fail compiling the code) # presently we recommend version 7.1 or 7.0, since these # releases have been used to compile the present code versions # # it might be required to change some of library pathes, since # LINUX installation vary a lot # Hence check ***ALL**** options in this makefile very carefully #----------------------------------------------------------------------- # # BLAS must be installed on the machine # there are several options: # 1) very slow but works: # retrieve the lapackage from ftp.netlib.org # and compile the blas routines (BLAS/SRC directory) # please use g77 or f77 for the compilation. When I tried to # use pgf77 or pgf90 for BLAS, VASP hang up when calling # ZHEEV (however this was with lapack 1.1 now I use lapack 2.0) # 2) most desirable: get an optimized BLAS # # the two most reliable packages around are presently: # 3a) Intels own optimised BLAS (PIII, P4, Itanium) # http://developer.intel.com/software/products/mkl/ # this is really excellent when you use Intel CPU's # # 3b) or obtain the atlas based BLAS routines # http://math-atlas.sourceforge.net/ # you certainly need atlas on the Athlon, since the mkl # routines are not optimal on the Athlon. # If you want to use atlas based BLAS, check the lines around LIB= # # 3c) mindblowing fast SSE2 (4 GFlops on P4, 2.53 GHz) # Kazushige Goto's BLAS # http://www.cs.utexas.edu/users/kgoto/signup_first.html # #----------------------------------------------------------------------- # all CPP processed fortran files have the extension .f90 SUFFIX=.f90 #----------------------------------------------------------------------- # fortran compiler and linker #----------------------------------------------------------------------- FC=ifort # fortran linker FCL=$(FC) #----------------------------------------------------------------------- # whereis CPP ?? (I need CPP, can't use gcc with proper options) # that's the location of gcc for SUSE 5.3 # # CPP_ = /usr/lib/gcc-lib/i486-linux/2.7.2/cpp -P -C # # that's probably the right line for some Red Hat distribution: # # CPP_ = /usr/lib/gcc-lib/i386-redhat-linux/2.7.2.3/cpp -P -C # # SUSE X.X, maybe some Red Hat distributions: CPP_ = ./preprocess <$*.F | /usr/bin/cpp -P -C -traditional >$*$(SUFFIX) #----------------------------------------------------------------------- # possible options for CPP: # NGXhalf charge density reduced in X direction # wNGXhalf gamma point only reduced in X direction # avoidalloc avoid ALLOCATE if possible # IFC work around some IFC bugs # CACHE_SIZE 1000 for PII,PIII, 5000 for Athlon, 8000-12000 P4 # RPROMU_DGEMV use DGEMV instead of DGEMM in RPRO (depends on used BLAS) # RACCMU_DGEMV use DGEMV instead of DGEMM in RACC (depends on used BLAS) #----------------------------------------------------------------------- CPP = $(CPP_) -DHOST=\"LinuxIFC\" \ -Dkind8 -DNGXhalf -DCACHE_SIZE=12000 -DPGF90 -Davoidalloc \ -DRPROMU_DGEMV -DRACCMU_DGEMV #----------------------------------------------------------------------- # general fortran flags (there must a trailing blank on this line) #----------------------------------------------------------------------- FFLAGS = -FR -lowercase -heap-arrays 64 -assume byterecl #----------------------------------------------------------------------- # optimization # we have tested whether higher optimisation improves performance # -axK SSE1 optimization, but also generate code executable on all mach. # xK improves performance somewhat on XP, and a is required in order # to run the code on older Athlons as well # -xW SSE2 optimization # -axW SSE2 optimization, but also generate code executable on all mach. # -tpp6 P3 optimization # -tpp7 P4 optimization #----------------------------------------------------------------------- OFLAG=-O3 -xW OFLAG_HIGH = $(OFLAG) OBJ_HIGH = OBJ_NOOPT = DEBUG = -FR -O0 INLINE = $(OFLAG) #----------------------------------------------------------------------- # the following lines specify the position of BLAS and LAPACK # on P4, VASP works fastest with the libgoto library # so that's what I recommend #----------------------------------------------------------------------- # Atlas based libraries #ATLASHOME= $(HOME)/archives/BLAS_OPT/ATLAS/lib/Linux_P4SSE2/ #BLAS= -L$(ATLASHOME) -lf77blas -latlas # use specific libraries (default library path might point to other libraries) #BLAS= $(ATLASHOME)/libf77blas.a $(ATLASHOME)/libatlas.a # use the mkl Intel libraries for p4 (www.intel.com) # mkl.5.1 # set -DRPROMU_DGEMV -DRACCMU_DGEMV in the CPP lines #BLAS=-L/opt/intel/mkl/lib/32 -lmkl_p4 -lpthread # mkl.5.2 requires also to -lguide library # set -DRPROMU_DGEMV -DRACCMU_DGEMV in the CPP lines BLAS=-L/opt/intel/composerxe-2011.3.174/mkl/lib/ia32 -lmkl_intel -lmkl_sequential -lmkl_core -lpthread # even faster Kazushige Goto's BLAS # http://www.cs.utexas.edu/users/kgoto/signup_first.html #BLAS= /opt/libs/libgoto/libgoto_p4_512-r0.6.so # LAPACK, simplest use vasp.4.lib/lapack_double LAPACK= ../vasp.4.lib/lapack_double.o # use atlas optimized part of lapack #LAPACK= ../vasp.4.lib/lapack_atlas.o -llapack -lcblas # use the mkl Intel lapack #LAPACK= -lmkl_lapack #----------------------------------------------------------------------- LIB = -L../vasp.4.lib -ldmy \ ../vasp.4.lib/linpack_double.o $(LAPACK) \ $(BLAS) # options for linking (for compiler version 6.X, 7.1) nothing is required LINK = # compiler version 7.0 generates some vector statments which are located # in the svml library, add the LIBPATH and the library (just in case) #LINK = -L/opt/intel/compiler70/ia32/lib/ -lsvml #----------------------------------------------------------------------- # fft libraries: # VASP.4.6 can use fftw.3.0.X (http://www.fftw.org) # since this version is faster on P4 machines, we recommend to use it #----------------------------------------------------------------------- #FFT3D = fft3dfurth.o fft3dlib.o #FFT3D = fftw3d.o fft3dlib.o /opt/libs/fftw-3.0.1/lib/libfftw3.a #======================================================================= # MPI section, uncomment the following lines # # one comment for users of mpich or lam: # You must *not* compile mpi with g77/f77, because f77/g77 # appends *two* underscores to symbols that contain already an # underscore (i.e. MPI_SEND becomes mpi_send__). The pgf90/ifc # compilers however append only one underscore. # Precompiled mpi version will also not work !!! # # We found that mpich.1.2.1 and lam-6.5.X to lam-7.0.4 are stable # mpich.1.2.1 was configured with # ./configure -prefix=/usr/local/mpich_nodvdbg -fc="pgf77 -Mx,119,0x200000" \ # -f90="pgf90 " \ # --without-romio --without-mpe -opt=-O \ # # lam was configured with the line # ./configure -prefix /opt/libs/lam-7.0.4 --with-cflags=-O -with-fc=ifc \ # --with-f77flags=-O --without-romio # # please note that you might be able to use a lam or mpich version # compiled with f77/g77, but then you need to add the following # options: -Msecond_underscore (compilation) and -g77libs (linking) # # !!! Please do not send me any queries on how to install MPI, I will # certainly not answer them !!!! #======================================================================= #----------------------------------------------------------------------- # fortran linker for mpi: if you use LAM and compiled it with the options # suggested above, you can use the following line #----------------------------------------------------------------------- FC=mpif90 FCL=$(FC) #----------------------------------------------------------------------- # additional options for CPP in parallel version (see also above): # NGZhalf charge density reduced in Z direction # wNGZhalf gamma point only reduced in Z direction # scaLAPACK use scaLAPACK (usually slower on 100 Mbit Net) #----------------------------------------------------------------------- CPP = $(CPP_) -DMPI -DHOST=\"LinuxIFC\" -DIFC \ -Dkind8 -DNGZhalf -DCACHE_SIZE=4000 -DPGF90 -Davoidalloc \ -DMPI_BLOCK=500 \ -DRPROMU_DGEMV -DRACCMU_DGEMV #----------------------------------------------------------------------- # location of SCALAPACK # if you do not use SCALAPACK simply uncomment the line SCA #----------------------------------------------------------------------- BLACS=$(HOME)/archives/SCALAPACK/BLACS/ SCA_=$(HOME)/archives/SCALAPACK/SCALAPACK SCA= $(SCA_)/libscalapack.a \ $(BLACS)/LIB/blacsF77init_MPI-LINUX-0.a $(BLACS)/LIB/blacs_MPI-LINUX-0.a $(BLACS)/LIB/blacsF77init_MPI-LINUX-0.a SCA=/opt/intel/composerxe-2011.3.174/mkl/lib/ia32/libmkl_scalapack_core.a /opt/intel/composerxe-2011.3.174/mkl/lib/ia32/libmkl_blacs_intelmpi.a #----------------------------------------------------------------------- # libraries for mpi #----------------------------------------------------------------------- #LIB = -L../vasp.4.lib -ldmy \ # ../vasp.4.lib/linpack_double.o $(LAPACK) \ # $(SCA) $(BLAS) # FFT: fftmpi.o with fft3dlib of Juergen Furthmueller FFT3D = fftmpi.o fftmpi_map.o fft3dlib.o # fftw.3.0.1 is slighly faster and should be used if available #FFT3D = fftmpiw.o fftmpi_map.o fft3dlib.o /opt/intel/composerxe-2011.3.174/mkl/interfaces/fftw3xf/libfftw3xf_intel.a #----------------------------------------------------------------------- # general rules and compile lines #----------------------------------------------------------------------- BASIC= symmetry.o symlib.o lattlib.o random.o SOURCE= base.o mpi.o smart_allocate.o xml.o \ constant.o jacobi.o main_mpi.o scala.o \ asa.o lattice.o poscar.o ini.o setex.o radial.o \ pseudo.o mgrid.o mkpoints.o wave.o wave_mpi.o $(BASIC) \ nonl.o nonlr.o dfast.o choleski2.o \ mix.o charge.o xcgrad.o xcspin.o potex1.o potex2.o \ metagga.o constrmag.o pot.o cl_shift.o force.o dos.o elf.o \ tet.o hamil.o steep.o \ chain.o dyna.o relativistic.o LDApU.o sphpro.o paw.o us.o \ ebs.o wavpre.o wavpre_noio.o broyden.o \ dynbr.o rmm-diis.o reader.o writer.o tutor.o xml_writer.o \ brent.o stufak.o fileio.o opergrid.o stepver.o \ dipol.o xclib.o chgloc.o subrot.o optreal.o davidson.o \ edtest.o electron.o shm.o pardens.o paircorrection.o \ optics.o constr_cell_relax.o stm.o finite_diff.o \ elpol.o setlocalpp.o INC= vasp: $(SOURCE) $(FFT3D) $(INC) main.o rm -f vasp $(FCL) -o vasp $(LINK) main.o $(SOURCE) $(FFT3D) $(LIB) makeparam: $(SOURCE) $(FFT3D) makeparam.o main.F $(INC) $(FCL) -o makeparam $(LINK) makeparam.o $(SOURCE) $(FFT3D) $(LIB) zgemmtest: zgemmtest.o base.o random.o $(INC) $(FCL) -o zgemmtest $(LINK) zgemmtest.o random.o base.o $(LIB) dgemmtest: dgemmtest.o base.o random.o $(INC) $(FCL) -o dgemmtest $(LINK) dgemmtest.o random.o base.o $(LIB) ffttest: base.o smart_allocate.o mpi.o mgrid.o random.o ffttest.o $(FFT3D) $(INC) $(FCL) -o ffttest $(LINK) ffttest.o mpi.o mgrid.o random.o smart_allocate.o base.o $(FFT3D) $(LIB) kpoints: $(SOURCE) $(FFT3D) makekpoints.o main.F $(INC) $(FCL) -o kpoints $(LINK) makekpoints.o $(SOURCE) $(FFT3D) $(LIB) clean: -rm -f *.g *.f *.o *.L *.mod ; touch *.F main.o: main$(SUFFIX) $(FC) $(FFLAGS)$(DEBUG) $(INCS) -c main$(SUFFIX) xcgrad.o: xcgrad$(SUFFIX) $(FC) $(FFLAGS) $(INLINE) $(INCS) -c xcgrad$(SUFFIX) xcspin.o: xcspin$(SUFFIX) $(FC) $(FFLAGS) $(INLINE) $(INCS) -c xcspin$(SUFFIX) makeparam.o: makeparam$(SUFFIX) $(FC) $(FFLAGS)$(DEBUG) $(INCS) -c makeparam$(SUFFIX) makeparam$(SUFFIX): makeparam.F main.F # # MIND: I do not have a full dependency list for the include # and MODULES: here are only the minimal basic dependencies # if one strucuture is changed then touch_dep must be called # with the corresponding name of the structure # base.o: base.inc base.F mgrid.o: mgrid.inc mgrid.F constant.o: constant.inc constant.F lattice.o: lattice.inc lattice.F setex.o: setexm.inc setex.F pseudo.o: pseudo.inc pseudo.F poscar.o: poscar.inc poscar.F mkpoints.o: mkpoints.inc mkpoints.F wave.o: wave.inc wave.F nonl.o: nonl.inc nonl.F nonlr.o: nonlr.inc nonlr.F $(OBJ_HIGH): $(CPP) $(FC) $(FFLAGS) $(OFLAG_HIGH) $(INCS) -c $*$(SUFFIX) $(OBJ_NOOPT): $(CPP) $(FC) $(FFLAGS) $(INCS) -c $*$(SUFFIX) fft3dlib_f77.o: fft3dlib_f77.F $(CPP) $(F77) $(FFLAGS_F77) -c $*$(SUFFIX) .F.o: $(CPP) $(FC) $(FFLAGS) $(OFLAG) $(INCS) -c $*$(SUFFIX) .F$(SUFFIX): $(CPP) $(SUFFIX).o: $(FC) $(FFLAGS) $(OFLAG) $(INCS) -c $*$(SUFFIX) # special rules #----------------------------------------------------------------------- # these special rules are cummulative (that is once failed # in one compiler version, stays in the list forever) # -tpp5|6|7 P, PII-PIII, PIV # -xW use SIMD (does not pay of on PII, since fft3d uses double prec) # all other options do no affect the code performance since -O1 is used #----------------------------------------------------------------------- fft3dlib.o : fft3dlib.F $(CPP) $(FC) -FR -lowercase -O1 -xW -unroll0 -vec_report3 -c $*$(SUFFIX) fft3dfurth.o : fft3dfurth.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) radial.o : radial.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) symlib.o : symlib.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) symmetry.o : symmetry.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) dynbr.o : dynbr.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) broyden.o : broyden.F $(CPP) $(FC) -FR -lowercase -O2 -c $*$(SUFFIX) us.o : us.F $(CPP) $(FC) -FR -lowercase -O1 -c $*$(SUFFIX) wave.o : wave.F $(CPP) $(FC) -FR -lowercase -O0 -c $*$(SUFFIX) LDApU.o : LDApU.F $(CPP) $(FC) -FR -lowercase -O2 -c $*$(SUFFIX) |
10楼2011-05-02 16:00:49













回复此楼