This seems like an Open MPI error, not an Amber one. Apparently, you are oversubscribing your processors. Look here for info:
https://www.open-mpi.org/faq/?category=running#oversubscribing
Have you tried running your simulations with pmemd.MPI? If yes, do you receive the same error message?
I suggest trying the NEB simulation with a fewer number of replicas, maybe 16.
Best,
Delaram
________________________________________
From: Li, Dailin <d.li.northeastern.edu>
Sent: Friday, December 7, 2018 3:09 PM
To: AMBER Mailing List
Subject: [AMBER] 答复: Can a Nudged elastic band (NEB) job run on a single GPU?
Hi Delaram,
Thanks a lot for your reply. I have issued mpirun -np 32 $AMBERHOME/bin/pmemd.cuda.MPI -ng 32 -groupfile groupfile on the 2-GPU platform and got the error as below:
There are not enough slots available in the system to satisfy the 32 slots that were requested by the application:
pmemd.cuda.MPI
Either request fewer slots for your application, or make more slots available for use.
Is there any methods to let the command work? Thanks.
Regards,
Dailin
-----邮件原件-----
发件人: Ghoreishi, Delaram <delaram.phys.ufl.edu>
发送时间: 2018年12月7日 14:49
收件人: AMBER Mailing List <amber.ambermd.org>
主题: Re: [AMBER] Can a Nudged elastic band (NEB) job run on a single GPU?
Hi Dailin,
1) The MPI size (number of processors that you ask in the -np flag) should be a multiple of the number of replicas (that you state in the -ng flag). With 32 replicas your command should look like this:
mpirun -np 32 $AMBERHOME/bin/pmemd.cuda.MPI -ng 32 -groupfile groupfile For optimum Amber performance, you need to have 32 GPUs available when you execute this command, however, this command is still going to work if you have a fewer number of GPUs. But the computational performance is not going to be efficient. As you are overloading a single GPU with calculations that would otherwise be executed on other cards, you should see a drastic decrease in the performance. Thus, not a good idea.
2) NEB is an MPI job and is set to run with pmemd.MPI or pmemd.cuda.MPI only.
All the best,
Delaram
________________________________________
From: Li, Dailin <d.li.northeastern.edu>
Sent: Friday, December 7, 2018 2:11 PM
To: amber.ambermd.org
Subject: [AMBER] Can a Nudged elastic band (NEB) job run on a single GPU?
Hi,
I want to do NEB computations on GPUs. There are 32 images in the NEB job and only 2 GPUs could be used. When the job was submitted to the 2 GPUs, error concerning number of GPUs is not multiple will appear.
(1) Is it possible to do the NEB job on 2 GPUs? If yes, then how? Amber18 manual says "In case pmemd.cuda.MPI is used, it is best that the number of GPUs is equal to the number of images". Does "it is best" mean "it is required"?
(2) Is it possible to do the NEB job with pmemd.cuda, which means only 1GPU is used?
Thanks.
Regards,
Dailin
_______________________________________________
AMBER mailing list
AMBER.ambermd.org
https://urldefense.proofpoint.com/v2/url?u=https-3A__na01.safelinks.protection.outlook.com_-3Furl-3Dhttps-253A-252F-252Furldefense.proofpoint.com-252Fv2-252Furl-253Fu-253Dhttp-2D3A-5F-5Flists.ambermd.org-5Fmailman-5Flistinfo-5Famber-2526d-253DDwICAg-2526c-253DpZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM-2526r-253DVvQy5PCXKJaGqwIFOxrZfrBLHWzuw9VxhPTw-5FbbTkzg-2526m-253DibBdszPwsZmEN-5Fq8N-2DAHz7Xzl1dVl7S8Kq-5F57Q1lyYo-2526s-253DNqs9gIeICaKZzWuGUxc6cg5-2DAgjWYZOh1rZqnsDAmdE-2526e-26amp-3Bdata-3D02-257C01-257Cd.li-2540northeastern.edu-257Cb291c330f3f9462f04c708d65c7d139e-257Ca8eec281aaa34daeac9b9a398b9215e7-257C0-257C0-257C636798089605633529-26amp-3Bsdata-3DNLFWdmPSXeEcvWUbv7A3EGAuIkeKDJb6OjaKcily2nw-253D-26amp-3Breserved-3D0-3D&d=DwIGbw&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=VvQy5PCXKJaGqwIFOxrZfrBLHWzuw9VxhPTw_bbTkzg&m=si65W6oEe1zR3ZorByc-mJU-_ed7auYQ3wNDWc22DEg&s=FCOYfxICeJdqyXfU9cE1d0EX9VTYOp6rMmr_EXnXDZQ&e=
_______________________________________________
AMBER mailing list
AMBER.ambermd.org
https://urldefense.proofpoint.com/v2/url?u=https-3A__na01.safelinks.protection.outlook.com_-3Furl-3Dhttp-253A-252F-252Flists.ambermd.org-252Fmailman-252Flistinfo-252Famber-26amp-3Bdata-3D02-257C01-257Cd.li-2540northeastern.edu-257Cb291c330f3f9462f04c708d65c7d139e-257Ca8eec281aaa34daeac9b9a398b9215e7-257C0-257C0-257C636798089605643533-26amp-3Bsdata-3DDrL23y8uvjmHvuNgqDDkMbDW-252Fyrv4Tz89AovVioEOWw-253D-26amp-3Breserved-3D0&d=DwIGbw&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=VvQy5PCXKJaGqwIFOxrZfrBLHWzuw9VxhPTw_bbTkzg&m=si65W6oEe1zR3ZorByc-mJU-_ed7auYQ3wNDWc22DEg&s=boHjpGDAkUhU1KQdAJ_G6rVkTbHenrBwBEyh63bpcgE&e=
_______________________________________________
AMBER mailing list
AMBER.ambermd.org
https://urldefense.proofpoint.com/v2/url?u=http-3A__lists.ambermd.org_mailman_listinfo_amber&d=DwIGbw&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=VvQy5PCXKJaGqwIFOxrZfrBLHWzuw9VxhPTw_bbTkzg&m=si65W6oEe1zR3ZorByc-mJU-_ed7auYQ3wNDWc22DEg&s=iC8PaSgwlpo0_PPzwALUukgGx9qm9kctYZy_DRBCmrk&e=
_______________________________________________
AMBER mailing list
AMBER.ambermd.org
http://lists.ambermd.org/mailman/listinfo/amber
Received on Fri Dec 07 2018 - 13:30:03 PST