[AMBER] ´ð¸´: Can a Nudged elastic band (NEB) job run on a single GPU?

From: Li, Dailin <d.li.northeastern.edu>
Date: Fri, 7 Dec 2018 22:16:11 +0000

Hi,

I have tried running with pmemd.cuda.MPI by the following commands and everything is ok.

export CUDA_VISIBLE_DEVICES=0,1
mpirun -np 2 pmemd.cuda.MPI -O -i ....

For the NEB job, I tried the following commands and got the same error as before:

export CUDA_VISIBLE_DEVICES=0,1
mpirun -np 16 pmemd.cuda.MPI -ng 16 -groupfile groupfile

Anyway, thanks a lot for your reply.

Regards,
Dailin

-----ÓʼþÔ­¼þ-----
·¢¼þÈË: Ghoreishi, Delaram <delaram.phys.ufl.edu>
·¢ËÍʱ¼ä: 2018Äê12ÔÂ7ÈÕ 16:10
ÊÕ¼þÈË: AMBER Mailing List <amber.ambermd.org>
Ö÷Ìâ: Re: [AMBER] Can a Nudged elastic band (NEB) job run on a single GPU?

This seems like an Open MPI error, not an Amber one. Apparently, you are oversubscribing your processors. Look here for info:
https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.open-mpi.org%2Ffaq%2F%3Fcategory%3Drunning%23oversubscribing&amp;data=02%7C01%7Cd.li%40northeastern.edu%7Cabc50ddf04e54e890bc508d65c886fd1%7Ca8eec281aaa34daeac9b9a398b9215e7%7C0%7C0%7C636798138453770686&amp;sdata=Xk0elhTK0IQP%2BkmZrsqFn90D1iNdxdSWKaYzfY1sZx0%3D&amp;reserved=0
Have you tried running your simulations with pmemd.MPI? If yes, do you receive the same error message?

I suggest trying the NEB simulation with a fewer number of replicas, maybe 16.

Best,
Delaram



________________________________________
From: Li, Dailin <d.li.northeastern.edu>
Sent: Friday, December 7, 2018 3:09 PM
To: AMBER Mailing List
Subject: [AMBER] ´ð¸´: Can a Nudged elastic band (NEB) job run on a single GPU?

Hi Delaram,



Thanks a lot for your reply. I have issued mpirun -np 32 $AMBERHOME/bin/pmemd.cuda.MPI -ng 32 -groupfile groupfile on the 2-GPU platform and got the error as below:



There are not enough slots available in the system to satisfy the 32 slots that were requested by the application:

  pmemd.cuda.MPI

Either request fewer slots for your application, or make more slots available for use.



Is there any methods to let the command work? Thanks.



Regards,

Dailin



-----ÓʼþÔ­¼þ-----

·¢¼þÈË: Ghoreishi, Delaram <delaram.phys.ufl.edu>

·¢ËÍʱ¼ä: 2018Äê12ÔÂ7ÈÕ 14:49

ÊÕ¼þÈË: AMBER Mailing List <amber.ambermd.org>

Ö÷Ìâ: Re: [AMBER] Can a Nudged elastic band (NEB) job run on a single GPU?



Hi Dailin,



1) The MPI size (number of processors that you ask in the -np flag) should be a multiple of the number of replicas (that you state in the -ng flag). With 32 replicas your command should look like this:

mpirun -np 32 $AMBERHOME/bin/pmemd.cuda.MPI -ng 32 -groupfile groupfile For optimum Amber performance, you need to have 32 GPUs available when you execute this command, however, this command is still going to work if you have a fewer number of GPUs. But the computational performance is not going to be efficient. As you are overloading a single GPU with calculations that would otherwise be executed on other cards, you should see a drastic decrease in the performance. Thus, not a good idea.



2) NEB is an MPI job and is set to run with pmemd.MPI or pmemd.cuda.MPI only.



All the best,

Delaram



________________________________________

From: Li, Dailin <d.li.northeastern.edu>

Sent: Friday, December 7, 2018 2:11 PM

To: amber.ambermd.org

Subject: [AMBER] Can a Nudged elastic band (NEB) job run on a single GPU?



Hi,



I want to do NEB computations on GPUs. There are 32 images in the NEB job and only 2 GPUs could be used. When the job was submitted to the 2 GPUs, error concerning number of GPUs is not multiple will appear.



(1) Is it possible to do the NEB job on 2 GPUs? If yes, then how? Amber18 manual says "In case pmemd.cuda.MPI is used, it is best that the number of GPUs is equal to the number of images". Does "it is best" mean "it is required"?



(2) Is it possible to do the NEB job with pmemd.cuda, which means only 1GPU is used?



Thanks.



Regards,



Dailin

_______________________________________________

AMBER mailing list

AMBER.ambermd.org

https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Furldefense.proofpoint.com%2Fv2%2Furl%3Fu%3Dhttps-3A__na01.safelinks.protection.outlook.com_-3Furl-3Dhttps-253A-252F-252Furldefense.proofpoint.com-252Fv2-252Furl-253Fu-253Dhttp-2D3A-5F-5Flists.ambermd.org-5Fmailman-5Flistinfo-5Famber-2526d-253DDwICAg-2526c-253DpZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM-2526r-253DVvQy5PCXKJaGqwIFOxrZfrBLHWzuw9VxhPTw-5FbbTkzg-2526m-253DibBdszPwsZmEN-5Fq8N-2DAHz7Xzl1dVl7S8Kq-5F57Q1lyYo-2526s-253DNqs9gIeICaKZzWuGUxc6cg5-2DAgjWYZOh1rZqnsDAmdE-2526e-26amp-3Bdata-3D02-257C01-257Cd.li-2540northeastern.edu-257Cb291c330f3f9462f04c708d65c7d139e-257Ca8eec281aaa34daeac9b9a398b9215e7-257C0-257C0-257C636798089605633529-26amp-3Bsdata-3DNLFWdmPSXeEcvWUbv7A3EGAuIkeKDJb6OjaKcily2nw-253D-26amp-3Breserved-3D0-3D%26d%3DDwIGbw%26c%3DpZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM%26r%3DVvQy5PCXKJaGqwIFOxrZfrBLHWzuw9VxhPTw_bbTkzg%26m%3Dsi65W6oEe1zR3ZorByc-mJU-_ed7auYQ3wNDWc22DEg%26s%3DFCOYfxICeJdqyXfU9cE1d0EX9VTYOp6rMmr_EXnXDZQ%26e&amp;data=02%7C01%7Cd.li%40northeastern.edu%7Cabc50ddf04e54e890bc508d65c886fd1%7Ca8eec281aaa34daeac9b9a398b9215e7%7C0%7C0%7C636798138453770686&amp;sdata=8NSC3BZktHnqGEyh0vbqv0Wz2HjChVMW9Y8AGSwGuCs%3D&amp;reserved=0=



_______________________________________________

AMBER mailing list

AMBER.ambermd.org

https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Furldefense.proofpoint.com%2Fv2%2Furl%3Fu%3Dhttps-3A__na01.safelinks.protection.outlook.com_-3Furl-3Dhttp-253A-252F-252Flists.ambermd.org-252Fmailman-252Flistinfo-252Famber-26amp-3Bdata-3D02-257C01-257Cd.li-2540northeastern.edu-257Cb291c330f3f9462f04c708d65c7d139e-257Ca8eec281aaa34daeac9b9a398b9215e7-257C0-257C0-257C636798089605643533-26amp-3Bsdata-3DDrL23y8uvjmHvuNgqDDkMbDW-252Fyrv4Tz89AovVioEOWw-253D-26amp-3Breserved-3D0%26d%3DDwIGbw%26c%3DpZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM%26r%3DVvQy5PCXKJaGqwIFOxrZfrBLHWzuw9VxhPTw_bbTkzg%26m%3Dsi65W6oEe1zR3ZorByc-mJU-_ed7auYQ3wNDWc22DEg%26s%3DboHjpGDAkUhU1KQdAJ_G6rVkTbHenrBwBEyh63bpcgE%26e&amp;data=02%7C01%7Cd.li%40northeastern.edu%7Cabc50ddf04e54e890bc508d65c886fd1%7Ca8eec281aaa34daeac9b9a398b9215e7%7C0%7C0%7C636798138453780694&amp;sdata=HiTrzOUSYebXpDDF06XPbwz6cQmzga%2FpVa5zmIFyQno%3D&amp;reserved=0=

_______________________________________________
AMBER mailing list
AMBER.ambermd.org
https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Furldefense.proofpoint.com%2Fv2%2Furl%3Fu%3Dhttp-3A__lists.ambermd.org_mailman_listinfo_amber%26d%3DDwIGbw%26c%3DpZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM%26r%3DVvQy5PCXKJaGqwIFOxrZfrBLHWzuw9VxhPTw_bbTkzg%26m%3Dsi65W6oEe1zR3ZorByc-mJU-_ed7auYQ3wNDWc22DEg%26s%3DiC8PaSgwlpo0_PPzwALUukgGx9qm9kctYZy_DRBCmrk%26e&amp;data=02%7C01%7Cd.li%40northeastern.edu%7Cabc50ddf04e54e890bc508d65c886fd1%7Ca8eec281aaa34daeac9b9a398b9215e7%7C0%7C0%7C636798138453780694&amp;sdata=O6oCWOhtg21BYaMCrfmz9FFvOfGL3WuSLVI0RzyS1hg%3D&amp;reserved=0=
_______________________________________________
AMBER mailing list
AMBER.ambermd.org
https://na01.safelinks.protection.outlook.com/?url=http%3A%2F%2Flists.ambermd.org%2Fmailman%2Flistinfo%2Famber&amp;data=02%7C01%7Cd.li%40northeastern.edu%7Cabc50ddf04e54e890bc508d65c886fd1%7Ca8eec281aaa34daeac9b9a398b9215e7%7C0%7C0%7C636798138453780694&amp;sdata=80fha4gtCfPpvK9jCAvh8GRrksNE%2FmSEvxB1sdLN6xg%3D&amp;reserved=0
_______________________________________________
AMBER mailing list
AMBER.ambermd.org
http://lists.ambermd.org/mailman/listinfo/amber
Received on Fri Dec 07 2018 - 14:30:02 PST
Custom Search