| ²é¿´: 4151 | »Ø¸´: 9 | |||||
| ¡¾½±Àø¡¿ ±¾Ìû±»ÆÀ¼Û3´Î£¬×÷ÕßÁõÊ˳¿Ôö¼Ó½ð±Ò 2.2 ¸ö | |||||
[×ÊÔ´]
×÷Òµ¹ÜÀíϵͳTorque 2.4.16ÔÚLinuxmint 13 / Ubuntu12.04¹¤×÷Õ¾/µ¥»úÉϵݲװºÍʹÓÃ
|
|||||
|
СµÜǰ¶Îʱ¼äÔÚ¾À½áÈçºÎÔÚ×Ô¼ºµÄ¹¤×÷Õ¾Éϰ²×°PBSÈÎÎñ¹ÜÀíÈí¼þ£¬ÒòΪÈç¹ûÊÇ×Ô¼ºÐ´½Å±¾Í¶£¬ÆÕͨÈκλ¹¿ÉÒÔ£¬Èç¹ûÐèÒªÁ¬ÐøÍ¶ºÃ¼¸¸öÈÎÎñÄÇ»¹ÊÇÏ൱Âé·³µÄ£¬¾ÍÏëÆðÁ˼¯ÈºµÄPBS£¬µ«ÊÇ×Ô¼ºµ·¹ÄÁ˺þö¼Ã»¸ã¶¨£¬ºóÀ´×Ô¼ºµÄͬѧÔÚÍøÉÏÕÒµ½ÁËÕâÆª½Ì³Ì£¬³É¹¦ÁË£¬ÄóöÀ´ºÍ´ó¼Ò·ÖÏí£¬Èç¹û´ó¼Ò¸ÕºÃÐèÒª£¬¿ÉÒÔʡȥºÜ¶àÂé·³¡£ ÎÒ×Ô¼ºµÄϵͳ£ºlinuxmint14 Ò»¹²×°ÁË1̨¹¤×÷Õ¾£¨E5-2440*2£©2̨ÆǪ̃ͨʽ»ú£¨i7-3770£©Ò»Ì¨ÐéÄâ»ú£¨vmware£©¶¼³É¹¦ÁË£¬Ã»ÓгöÏÖÎÊÌ⣺ ½Ì³ÌÔÁ´½Ó£ºhttp://hi.baidu.com/xijunw/item/9a4e823959240af62684f426 ÕýÎÄ£º£¨ÔÎÄÕâÀïÓÐÒ»ÕÅͼƬ£¬µ«ÊÇÒòΪ²»Ó°ÏìÏÂÃæµÄ½Ì³Ì£¬ËùÒÔÎҾͲ»ÌùÁË£© ×¢Ò⣺£¨×Ô¼º°²×°µÄʱºò³öÏÖµÄÎÊÌ⣬ËùÒÔÒ»¶¨Òª×¢Ò⼸¸öµØ·½£© 1.¼ÆËã½Úµã½Ì³ÌÊÇcalnode1£¬Ç§Íò²»ÒªÐ´³Écalnodes1 2.ÏÂÃæµÄ½Ì³ÌÐÞÕýÁËÔÎÄÖеÄÒ»´¦±êµã´íÎó£¬Èç¹û²Î¿¼ÔÎÄ£¬ÐèÒª×ÔÐÐÐ޸ģ» TorqueÊÇ×÷Òµ¶ÓÁйÜÀíϵͳ£¬ÆäǰÉíÊÇopenPBS£¬µ«ºóÀ´openPBSµÄÄǰïÈË¿ª¹«Ë¾×öÉÌÒµPBS׬ǮȥÁË£¬openPBS¸ÄΪTorque¼ÌÐøÎª¿ªÔ´ÉçÇøÎ¬»¤¡£ ¹ØÓÚTorqueµÄ°²×°£¬ÍøÉϵĽ̳̺ܶࡣµ«ÊÇ£¬ÕâЩ½Ì³Ì´ó¶àÊÇÕë¶Ôcluster¼¯ÈºµÄ£¬ÓеÄÊÇtorque²»Í¬°æ±¾²»Í¬Æ½Ì¨µÄ£¬ËùÒÔ¸ø³öµÄÉèÖø÷²»Ïàͬ¡£Ò»µ©Äã°´ÕÕÒ»¸ö²»ºÏÊʵĽ̳ÌÈ¥²Ù×÷£¬¾Í»áµ¼ÖÂÖîÈçÕÒ²»µ½·þÎñÆ÷ÎÞ·¨°²×°£¬»òÕß°²×°ºó×÷ÒµÌá½»Á˲»ÄÜÔËÐУ¬»òÕßÄÜÌá½»×÷Òµµ«Á¢¼´½áÊøÍ˳ö£¬»òÕß×÷ÒµÄÜÕý³£ÔËÐе«²»¸ø³öoÎļþµÈ¸÷ÖÖÎÊÌâ¡£Õë¶Ô¸÷ÖÖÎÊÌâµÄ½â¾ö·½°¸¸üÊÇÇ§Ææ°Ù¹Ö£¬ÈÃÈËÔÆÀïÎíÀÎÞËùÊÊ´Ó¡£ ²»ÐÒµÄÊÇ£¬ÉÏÃæÌáµ½µÄÎÊÌâÎÒÈ«²¿Óöµ½ÁË¡£¸÷ÖÖ¹úÄÚÍâÍøÕ¾¸ø³öµÄ½â¾ö·½°¸¼¸ºõ¶¼ÕÛÌڱ飬²ÅÖð½¥Òâʶµ½£¬°²×°Torque£¬±ØÐë¸ãÇåTorqueµÄÈý¸ö²¿·Ö£¬pbs_server, pbs_mom, pbs_sched£¬ ËüÃÇÖ®¼äµÄ¹ØÏµºÍͨÐÅ»úÖÆµÄ¹Ø¼ü¡£ÕâÒ»²¿·ÖÍÆ¼öÈ¥°Ù¶È»òµÀ¿ÍÉÏËѼ¸Æª¹ØÓÚ¡°×÷Òµµ÷¶ÈϵͳPBS¡±½éÉܵÄppt¿´¿´¡£¼òµ¥Ëµ£¬ pbs_serverÊÇÁìµ¼£¬×øÔÚ×ܲ¿·þÎñÆ÷ÉϸºÔð½ÓÊÕÈÎÎñ£¬ pbs_schedÊǾÀí£¬¸ºÔð°Ñ¹¤×÷ÅÅÐò²¢·ÖÅäÏÂÈ¥£¬ pbs_momÊÇÃñ¹¤£¬ÔÚ¸÷¸ö¼ÆËã½ÚµãÐÁ¿à¹¤×÷£¬²¢°ÑÇé¿ö»ã±¨¸ø×ܲ¿¡£ Èý¸ö²¿·ÖµÄÅäÖÃÓÐ2¸ö¹Ø¼ü£¬ ¹Ø¼üÉèÖÃA£¬Äã±ØÐë¸æËßserverºÍschedÿ¸ö½ÚµãµÄÃû×ֺͺ˵ÄÊýÁ¿£¬ÒÔ±ãËüÃÇ·ÖÅäÈÎÎñ£» ¹Ø¼üÉèÖÃB£¬ÊÇÄã±ØÐë¸æËßmomÄĸö½ÚµãÊÇ·þÎñÆ÷½Úµã£¬ÒÔ±ãÆäÏò×ܲ¿»ã±¨¹¤×÷½øÕ¹¡£ ¾ßÌåÀ´Ëù£¬Á½¸ö¹Ø¼üÉèÖÃÉæ¼°ÈçϲÙ×÷£º£¨ÅäÖþùÔÚtorqueĬÈϰ²×°Ä¿Â¼Ï£º/var/spool/torque£© ¹Ø¼üÉèÖÃA£º ´´½¨»òÐÞ¸Äserver_priv/nodesÎļþ£¬Áгö¼ÆËã½ÚµãÃû³ÆºÍºËÐÄÊý£» ¹Ø¼üÉèÖÃB£º ´´½¨»òÐÞ¸Ämom_priv/configÎļþÁгöÖ÷½Úµãip£»´´½¨server_nameÎļþ£¬ÁгöÖ÷½Úµãhostname ¶ÔÓÚ¹¤×÷Õ¾À´Ëµ£¬ºÍcluster¼¯ÈºµÄÎ¨Ò»Çø±ðÊÇËüÖ»ÓÐÒ»¸ö¼ÆËã½Úµã£¬Ò²¾ÍÊÇÆä·þÎñ½Úµã£¬Í¨³£±¾»úIPºÍÖ÷»úÃûĬÈÏ·Ö±ðΪ127.0.0.1ºÍtorqueserver¡£ µ«ÊÇÄãÒªÔÙ¸øËüÉèÖÃÒ»¸ö±ðÃû£¬±ÈÈçcalnode1£¬×÷Ϊ¼ÆËã½ÚµãµÄÃû³Æ¡£ÈçÉÏÃæÍ¼Ëùʾ¡£ÕâÀï¸ã²»Ç壬ºÜÈÝÒ׳ö´í¡£ ºÃÁË£¬Ã÷°×ÁËÒÔÉϹؼü²¿·Ö£¬ÏÂÃæ¾Í¼òµ¥ÁË¡£ÔÚÎÒµÄ4ºËlinuxmint¹¤×÷վϣ¬Ê¹ÓÃÈí¼þÔ´°²×°µÄ°²×°Á÷³ÌÈçÏ£º 1. ÐÞ¸Ä/etc/hostsµÚÒ»ÐУ¬Ê¹ÆäΪ¡°127.0.0.1 localhost yourhostname torqueserver calnode1¡± (rootȨÏÞ) $ echo $HOSTNAME // find the hostname xxxxx // write this hostname into /etc/hosts $ sudo vi /etc/hosts 127.0.0.1 localhost xxxxx torqueserver calnode1 # 127.0.1.1 somename // ÕâÒ»ÐÐÒªcommentµô ºóÃæÓÐһЩipv6µÄ¶«Î÷£¬ÎÞÐ趯¡£ 2. °²×°torqueµÄ7¸öÏà¹Ø°ü $ sudo apt-get install torque-common libtorque2 libtorque2-dev torque-server torque-scheduler torque-mom torque-client °²×°Íê³Éºóserver, sched»á×Ô¶¯Æô¶¯ 3. ³õʼ»¯ $ sudo qterm // ÏÈÖÕÖ¹·þÎñ $ sudo bash /usr/share/doc/torque-common/torque.setup $USER torqueserver // ½¨Á¢Ä¬ÈÏ·þÎñÆ÷ºÍ¶ÓÁУ¬²¢°Ñ×Ô¼ºÁÐΪ¹ÜÀíÔ± $ qmgr -c 'print server' // ²é¿´Ä¬ÈÏÅäÖõķþÎñºÍ¶ÓÁÐ # # Create queues and set their attributes. # # # Create and define queue batch # create queue batch set queue batch queue_type = Execution set queue batch resources_default.nodes = 1 set queue batch resources_default.walltime = 01:00:00 set queue batch enabled = True set queue batch started = True # # Set server attributes. # set server scheduling = True set server acl_hosts = torqueserver set server default_queue = batch set server log_events = 511 set server mail_from = adm set server scheduler_iteration = 600 set server node_check_rate = 150 set server tcp_timeout = 6 set server mom_job_sync = True set server keep_completed = 300 // ×÷ÒµÍê³Éºó»áµÈ´ý300Ãë²ÅÏûʧ£¬ÕâÀïÐèÒª¸Ä³É1,¼û±¾Îĸ½Â¼¡£ 4. ÉèÖ÷þÎñ½Úµã (1) ´´½¨server_nameÎļþ£¬Ö¸Ã÷·þÎñ½ÚµãµÄÃû³ÆÎªtorqueserver $ sudo echo "torqueserver" > /var/spool/torque/server_name // ´ËÎļþÓ¦¸ÃÊÇĬÈÏÒѾ×Ô¶¯Éú³ÉµÄ (2) Ìí¼Ó¼ÆËã½Úµã ´´½¨server_priv/nodesÎļþ£¬Ö¸¶¨ÀûÓÃÃûΪcalnodeµÄ½ÚµãµÄ4¸öºË×ö¼ÆËã $ sudo echo "calnode1 np=4" > /var/spool/torque/server_priv/nodes 5. È¥¼ÆËã½ÚµãÅäÖᣠÓÉÓÚÎÒÃÇÊǹ¤×÷Õ¾£¬ËùÒÔʵ¼ÊÉϾÍÖ»ÊÇÔÚ±¾»úÉϲÙ×÷ ´´½¨mom_priv/configÎļþ£¬¸æËßmomÏòIPΪ127.0.0.1µÄ·þÎñ½Úµã»ã±¨ $ sudo echo "$pbs_server = 127.0.0.1" > /var/spool/torque/mom_priv/config 6. ½áÊøÅäÖã¬ÖØÆô·þÎñ ÏÈÆô¶¯¼ÆËã½Úµã·þÎñ£º $ sudo pbs_mom È»ºóÊÇ·þÎñ½Úµã $ sudo qterm -t quick // »òÕß $ sudo killall -r "pbs_*" $ sudo pbs_server // Æô¶¯server $ pbsnodes -a // ²é¿´ËùÓмÆËã½Úµã£¬freeΪÕý³£ 7. ÅäÖ÷þÎñµÄ¿ª»úÆô¶¯ $ sudo vi /etc/rc.local Ôö¼ÓÈýÁзֱðÊÇpbs_server pbs_sched pbs_mom 8. ²âÊÔ $ echo 'sleep 20' | qsub 9. ³ö´íºó¸ù¾Ý×÷ÒµºÅ×·²é×÷ÒµÏêÇé $ tracejob xx µäÐÍ×÷ÒµÌá½»½Å±¾£º #!/bin/bash #PBS -N test // job listÏÔʾµÄ×÷ÒµÃû³Æ¡£Í¨³£ÎÞÐèÖ¸¶¨£¬½«ÏÔʾ½Å±¾ÎļþÃû #PBS -l ncpus=2 // ÓÃ2¸öºË #PBS -l walltime=24:00:00 // ÔËÐÐʱ¼ä£¬Í¨³£ÔÚ×Ô¼ºµÄ¹¤×÷Õ¾ÉÏÎÞÐèÖ¸¶¨ #PBS -j oe // ºÏ²¢oÎļþºÍeÎļþΪoÎļþ£¬Õâ¸öºÜÓÐÓà #PBS -q batch // ½»µ½batch¶ÓÁУ¬Ò»°ãÎÞÐèÖ¸¶¨ #PBS -V // ʹÓÃ.bashrcÖÐÉèÖõĻ·¾³±äÁ¿£¬·Ç³£ÖØÒª cd $PBS_O_WORKDIR // ½øÈë½Å±¾Ìá½»µÄĿ¼Ϊ¹¤×÷Ŀ¼£¬ÕâÒ»ÐкÜÖØÒª¡£ g09 input.gjf output.log // ×÷ÒµÐÐ ¸½£º Ð޸ķþÎñºÍ¶ÓÁеij£ÓÃÃüÁî (1) ´´½¨ÓëÐÞ¸Ä×÷Òµ¶ÓÁÐbatch $ sudo qmgr -c 'create queue batch' // ´´½¨ÃûΪbatchµÄ¶ÓÁÐ $ sudo qmgr -c 'set queue batch queue_type = Execution' // ÀàÐÍΪ¼ÆËã $ sudo qmgr -c 'set queue batch enabled = True' // ¼¤»î $ sudo qmgr -c 'set queue batch started = True' // ¿ªÆô $ sudo qmgr -c 'set queue batch resources_default.walltime = 900:00:00' // ×ÔËÐÐʱ¼ä900Сʱ $ sudo qmgr -c 'set queue batch resources_default.ncpus = 1' // ĬÈÏÖ»ÓÃ1ºË $ sudo qmgr -c 'set queue batch resources_default.nodes = 1' // ĬÈÏʹÓÃ1¸ö½Úµã $ sudo qmgr -c 'set queue batch resources_default.nodect = 1' // Ö»·Å¿ª1¸ö½Úµã $ sudo qmgr -c 'set queue batch resources_max.ncpus = 4' // ×î¶àʹÓÃ4ºË $ sudo qmgr -c 'set queue batch resources_min.ncpus = 1' $ sudo qmgr -c 'set queue batch resources_max.nodes = 1' // Ö»ÓÐ1¸ö½Úµã $ sudo qmgr -c 'set queue batch max_running = 2' // ×î¶àͬʱÔËÐÐ2¸ö×÷Òµ (2) ÅäÖÃÓëÐ޸ķþÎñÆ÷server $ sudo qmgr -c 'set server scheduling = True' // Æô¶¯ÅŶӹÜÀí $ sudo qmgr -c 'set server default_queue = batch' // ¶¨ÒåĬÈ϶ÓÁÐ $ sudo qmgr -c 'set server allow_node_submit = True' // ÔÊÐíÏò·þÎñ½ÚµãÌá½»×÷Òµ£¬Õâ¸ö±ØÐëÉèÖà $ sudo qmgr -c 'set server query_other_jobs = True' // $ sudo qmgr -c 'set server acl_host_enable = True' $ sudo qmgr -c 'set server acl_hosts = calnode1' 1. ¹ØÓÚUnauthorized requestÎÊÌ⣺ ¿ÉÄÜÊÇÍüÁËʹÓùÜÀíԱȨÏÞ²Ù×÷£»Ò²¿ÉÄÜÊDzÙ×÷˳Ðò²»¶Ô£¬ÅäÖóåÍ»¡£¿ÉɱµôËùÓÐpbs_*·þÎñ£¬ÔÙ¿ªÆô¡£²»ÐÐÖØÆôÖ÷»ú¡£ 2. oÎļþÖеġ°Command not found¡± ºÜ¶àÈí¼þµÄÔËÐл·¾³ÊÇÔÚ.bashrcÖÐÉèÖ㬵«ÊÇtorqueÔÚqsubʱĬÈϲ¢²»Ö´ÐÐ.bashrc£¬¾Í»áµ¼ÖÂ×÷ÒµÌá½»ºóÁ¢¼´½áÊø£¬oÎļþÏÔʾcommand not found¡£½â¾ö´ËÎÊÌâÖ»ÐèÔÚ×÷Òµ½Å±¾ÖÐÔö¼ÓÒ»ÐÐ #PBS -V Job Checkpoint and Restart Create a checkpoint and stop: $ qhold Reboot server and restart job from the checkpoint: $ qrerun http://www.clusterresources.com/ ... jobcheckpoint.shtml |
» ÊÕ¼±¾ÌûµÄÌÔÌûר¼ÍƼö
VASP and MS | ºÃÌù | Linuxѧϰ |
» ²ÂÄãϲ»¶
Ò»Ö¾Ô¸Ìì´ó²ÄÁÏÓ뻯¹¤£¨085600£©×Ü·Ö338
ÒѾÓÐ4È˻ظ´
085700×ÊÔ´Óë»·¾³308Çóµ÷¼Á
ÒѾÓÐ3È˻ظ´
Çó²ÄÁϵ÷¼Á
ÒѾÓÐ8È˻ظ´
294Çóµ÷¼Á²ÄÁÏÓ뻯¹¤×¨Ë¶
ÒѾÓÐ5È˻ظ´
Ò»Ö¾Ô¸»ªÖпƼ¼´óѧ£¬080502£¬354·ÖÇóµ÷¼Á
ÒѾÓÐ4È˻ظ´
Ò»Ö¾Ô¸¼ªÁÖ´óѧ²ÄÁÏѧ˶321Çóµ÷¼Á
ÒѾÓÐ6È˻ظ´
085410È˹¤ÖÇÄÜר˶317Çóµ÷¼Á£¨0854¶¼¿ÉÒÔ£©
ÒѾÓÐ3È˻ظ´
330Çóµ÷¼Á
ÒѾÓÐ3È˻ظ´
Ò»Ö¾Ô¸Öк£Ñó²ÄÁϹ¤³Ìר˶330·ÖÇóµ÷¼Á
ÒѾÓÐ5È˻ظ´
304Çóµ÷¼Á
ÒѾÓÐ5È˻ظ´
| ÍüÁË˵ÁË£¬Õâ´ÎÂòÁË·þÎñÆ÷²ÅÖªµÀ£¬Ö®Ç°Ì¨Ê½»úÒ»Ö±¿ª×ų¬Ïß³ÌÅÜÈÎÎñ£¬×°ÁËPBS²Å·¢ÏÖ£¬³¬Ï̻߳áÍÏÂývasp£¬È»ºóÈ¥ÎÒÃÇ×éÀïµÄ¼¯ÈºÉÏ¿´ÁËÏ£¬ËùÓеĽڵ㶼ÊǹرÕÁ˳¬Ï̺߳ÍÐéÄ⻯£¬´ò¿ªÁËî£Æµ¡£ÎÒÒ²ÁªÏµÁËIBMµÄ¹¤³Ìʦ£¬È·ÊµÊÇÕâÑùµÄ£¬ËùÒÔÈç¹û»¹Óв»ÖªµÀµÄͬѧ£¬Èç¹û»úÆ÷½ö½öÊÇ×°ÁËvasp£¬À´ÅÜÈÎÎñ£¬½¨Ò鹨±Õ³¬Ï̺߳ÍÐéÄ⻯¡£Èç¹û»úÆ÷ÊÇ˫ϵͳ£¬»¹ÓÐһЩÆäËûÈÎÎñ£¬ÄÇô¾Ã²»Òª¹Ø±ÕÁË£¬µ«ÊÇÔÚÅÜÈÎÎñµÄʱºòʹÓõĺËÊýÒªµÈÓÚÄãµÄÎïÀíºËÐÄÊý£¬¶ø²»ÊÇÄãµÄÏß³ÌÊý£¬Ò²¾ÍÊÇCPUÀûÓÃÂÊ50%£¬µ±È»¼¯Èº¾Í²»Óõ£ÐÄÁË£¬ÒòΪ¹¤³Ìʦ¿Ï¶¨¾Í°ïÄãŪºÃµÄ£¬ËùÓеÄÕâЩ¶¼ÊÇÕë¶Ô×Ô¼ºµÄµ¥»ú»òÕß¹¤×÷Õ¾¡£ºÍ´ó¼Ò¹²Ïí¡£ |
2Â¥2013-11-07 16:26:22
3Â¥2013-11-07 18:05:10
4Â¥2013-11-07 22:13:28
5Â¥2013-11-08 00:57:03
6Â¥2013-11-08 23:06:02
7Â¥2013-11-08 23:07:09
8Â¥2013-11-09 15:06:19
9Â¥2013-11-09 23:19:01
10Â¥2015-12-02 19:15:06













»Ø¸´´ËÂ¥