It would be interesting to know if the water cooled titans work BUT it depends whether they water cool the memory or not.
On Jul 11, 2013, at 7:52, ET <sketchfoot.gmail.com> wrote:
> That is very interesting Scott. So if it is heat related, then the water
> cooled GPUs that Ross mentioned in another thread would avoid the errors?
>
>
>
>
>
>
> On 11 July 2013 14:56, Scott Le Grand <varelse2005.gmail.com> wrote:
>
>> The problem with memtest is that it just exercises memory. Memory on its
>> own is usually fine. What seems to be going on is that the memory starts
>> giving errors when the GPU heats up the system while number-crunching. i
>> have found all sorts of GPUs, from all AMBER-compatible generations, that
>> pass memtest and even run n-body, only to blow up running JAC NVE. for 2
>> minutes.
>>
>> We know Titan has issues and I suspect at least some 780s will as well.
>> What's interesting is the people who have gotten around this by modding
>> their GPU heatsinks and downclocking memory by a factor of 2. Not for the
>> faint of heart, but a lot cheaper than buying Teslas. That said, I suspect
>> the future lead to contexts where Teslas better prove their worth running
>> AMBER.
>>
>> Scott
>>
>>
>>
>>
>> On Thu, Jul 11, 2013 at 5:05 AM, ET <sketchfoot.gmail.com> wrote:
>>
>>> I never got any memtest errors with my TITANS either, so it is a
>>> combination of hardware and the CUDA code IMO,
>>>
>> _______________________________________________
>> AMBER mailing list
>> AMBER.ambermd.org
>> http://lists.ambermd.org/mailman/listinfo/amber
>>
> _______________________________________________
> AMBER mailing list
> AMBER.ambermd.org
> http://lists.ambermd.org/mailman/listinfo/amber
_______________________________________________
AMBER mailing list
AMBER.ambermd.org
http://lists.ambermd.org/mailman/listinfo/amber
Received on Thu Jul 11 2013 - 12:00:03 PDT