Re: [AMBER] GPU kernel error

From: Ross Walker <ross.rosswalker.co.uk>
Date: Fri, 15 Jun 2012 15:33:37 -0700

Hi Fernando,

Is this the only output you get, is there anything in the mdout file. Also is it reproducible as in it always occurs at the same step every time? What about if you try a slightly different system or simulation parameters?

My suspicion is that your M2090 card is locking up or otherwise going weird. Most likely due to overheating. I have similar situations in desktop test machines I have M2090s in. These do not have the required ducked cooling needed for the passively cooled M2090s and so are only good for about 2 mins or so per run (for basic testing) before the card shuts down, often with some kind of kernel segfault or other OS based fatal event.

All the best
Ross

> -----Original Message-----
> From: Fernando Martín García [mailto:fmgarcia.cbm.uam.es]
> Sent: Friday, June 15, 2012 3:01 AM
> To: amber.ambermd.org
> Subject: [AMBER] GPU kernel error
>
> Hello,
>
> We are in some trouble with with the execution of pmemd.cuda. We have
> installed Amber12 in a CentOS 6 machine with two M2090. All test
> worked
> fine, but when we run a MD, we have next error after 4000 steps of
> 10000:
>
> Message from syslogd.francio at Jun 15 11:32:31 ...
> kernel:Stack:
>
> Message from syslogd.francio at Jun 15 11:32:31 ...
> kernel:Call Trace:
>
> Message from syslogd.francio at Jun 15 11:32:31 ...
> kernel:Code: c0 00 00 00 02 74 11 48 c7 c7 50 b5 aa a0 e8 a4 00 6b 00
> e8 6a 36 6e 00 b8 00 00 00 00 eb 12 89 c0 ba 01 00 00 00 d3 e2 85 14
> 87
> <0f> 95 c0 0f b6 c0 48 83 c5 08 5b c3 41 55 41 54 53 48 89 fb 49
>
> have any idea where could it be the problem?
>
> Thank you
>
> Best regards,
> --
> ==============================================
> Fernando Martín García
> Molecular Modelling Group - Lab 312.1
> Molecular Biology Center "Severo Ochoa"
> C/ NICOLáS CABRERA, 5.
> UAM University. Cantoblanco, 28049 Madrid. Spain.
> TEL: (+34) 91-196-4662 FAX: (+34) 91-196-4420
> Web: http://fertoledo.wordpress.com/
> ==============================================
>
>
> _______________________________________________
> AMBER mailing list
> AMBER.ambermd.org
> http://lists.ambermd.org/mailman/listinfo/amber


_______________________________________________
AMBER mailing list
AMBER.ambermd.org
http://lists.ambermd.org/mailman/listinfo/amber
Received on Fri Jun 15 2012 - 16:00:04 PDT
Custom Search