Re: [AMBER] Fwd: simulation terminates for no apparent reason

From: Sergio R Aragon <aragons.sfsu.edu>
Date: Fri, 10 Dec 2010 18:15:33 +0000

Hello Gabriel,

The error you are encountering has been the subject of a large amount of discussion in this list. Unfortunately there is no fix for this problem at the present time. It appears that all GTX 400 series cards AND the 580 have this error. Nvidia is working on it, but we have no assurance of a workaround. We are hopeful that one will be found.

Meanwhile, we cannot use these cards to run pmemd.cuda. If you can, move over to the Tesla cards such as the C2050 to do your work.

Sincerely,

Sergio Aragon
Professor of Chemistry
San Francisco State University


-----Original Message-----
From: Gabriel Urquiza [mailto:urquizagabes.gmail.com]
Sent: Friday, December 10, 2010 5:14 AM
To: AMBER Mailing List
Subject: [AMBER] Fwd: simulation terminates for no apparent reason

Greetings amberists,

I am having some problems running PMEMD simulations of a protein in a water
box. The system has around 60.000 atoms and I'm running it CUDA on a GeForce
GTX 480 1.4Ghz board with a global memory size of 1535 MB. I am running
several short dynamics with this system and it turns out that when I check
at the human-readable output files I notice that some of them just
abnormally terminated. The step in which the simulations finish doesn't
follow any apparent pattern, some terminate near the end, others right at
the beginning. Most of the simulations runs through though but if I remove
every file, restart the machine and resubmit all the simulations, using the
same script, in the same order, it is almost certain that the simulations
that crashed earlier won't be exactly the ones that will crash now. It seems
a very random thing and I find it very weird. I am placing my bets on the
narrow memory size of the board I'm using, but I've decided to ask for some
light on the matter.

The machine is running on double precision and it has an i7 processor. The
nohup pops one out of two error messages down below, whenever a dynamic
crashes:

Error: the launch timed out and was terminated launching kernel
kCalculatePMENonbondForces
Error: the launch timed out and was terminated launching kernel
kPMEGetGridWeights

I hope you can assist me with this.


Gabriel Urquiza

----------------

Universidade Federal da Paraíba - UFPB
Departamento de Química - Programa de Pós-Graduação
_______________________________________________
AMBER mailing list
AMBER.ambermd.org
http://lists.ambermd.org/mailman/listinfo/amber



_______________________________________________
AMBER mailing list
AMBER.ambermd.org
http://lists.ambermd.org/mailman/listinfo/amber
Received on Fri Dec 10 2010 - 10:30:07 PST
Custom Search