Problem with interaction between PBS-Torque and Mpich2

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello,

I install Torque and Mpich2 on a cluster. I can run
jobs using Mpich2. Before run a job, I up the demons
(Mpd) in all the machines (mpdboot), and run the job.

The problem is, that I can not do the same using
Torque, because when I run more than one job in the
queue, any jobs dies.

I think that the mpd deamon kill the other mpd deamon.

I am using the next script to run on the queue:

-------------------------

#PBS -l nodes=3
#PBS -j eo
#PBS -m bae
#PBS -M fokerman@xxxxxxxxx
#PBS -V

NP=$(wc -l $PBS_NODEFILE | awk '{print $1}')

echo "nodes ($NP cpu total):"
sort $PBS_NODEFILE | uniq
echo

cd $PBS_O_WORKDIR

cat $PBS_NODEFILE > mpd.hosts

mpdboot -n $NP

mpirun -machinefile $PBS_NODEFILE -np $NP
$PBS_O_WORKDIR/IMB_3.0/src/IMB-MPI1

mpdallexit

exit 0

-------------------------

A lot of tanks!

David.

__________________________________________________

"quien dice muchos, dice infinito..."


      Tarjeta de crédito Yahoo! de Banco Supervielle.
Solicitá tu nueva Tarjeta de crédito. De tu PC directo a tu casa. www.tuprimeratarjeta.com.ar 




[Index of Archives]     [Gluster Users]     [Ceph Users]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Security]     [Bugtraq]     [Linux]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux