Re: [AMBER] pmemd.cuda segfaults

From: <pavel.banas.upol.cz>
Date: Mon, 03 Mar 2014 21:44:24 +0100 (CET)

        charset=us-ascii;
        name="gdb_error_message.txt"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: attachment; sizeQ36; filename="gdb_error_message.txt"

# RUN 1

banas.v01:~/test/test4$ gdb --args pmemd.cuda -O -i NVT.in -o production1_GPU0.out -p test1.top -c production0.restart -x production1_GPU0.traj -r production1_GPU0.restart
GNU gdb (GDB) 7.4.1-debian
Copyright (C) 2012 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law. Type "show copying"
and "show warranty" for details.
This GDB was configured as "x86_64-linux-gnu".
For bug reporting instructions, please see:
<http://www.gnu.org/software/gdb/bugs/>...
Reading symbols from /usr/local/programs/common/amber/amber12-pl21-at13-pl22/arch/x86_64-intel_13.1.0.146-cuda_5.5-debug/amber12/bin/pmemd.cuda...done.
(gdb) run
Starting program: /usr/local/programs/common/amber/amber12-pl21-at13-pl22/arch/x86_64-intel_13.1.0.146-cuda_5.5-debug/amber12/bin/pmemd.cuda -O -i NVT.in -o production1_GPU0.out -p test1.top -c production0.restart -x production1_GPU0.traj -r production1_GPU0.restart
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
[New Thread 0x7fffed594700 (LWP 3074)]
version_line NVRM version: NVIDIA UNIX x86_64 Kernel Module 331.38 Wed Jan 8 19:32:30 PST 2014

Program received signal SIGSEGV, Segmentation fault.
0x0000000000000000 in ?? ()
(gdb) backtrace
#0 0x0000000000000000 in ?? ()
#1 0x00007fffed9f821a in ?? () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
#2 0x00007fffed9156aa in ?? () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
#3 0x00007fffed9fc229 in ?? () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
#4 0x00007fffed9fc3e2 in ?? () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
#5 0x00007fffed9158af in ?? () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
#6 0x00007fffed905eba in ?? () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
#7 0x00007fffed8f1e2f in cuCtxSynchronize () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
#8 0x00007ffff0e9fc99 in ?? () from /usr/local/programs/cuda/cuda-5.5/cuda/lib64/libcudart.so.5.5
#9 0x00007ffff0ec1509 in cudaThreadSynchronize () from /usr/local/programs/cuda/cuda-5.5/cuda/lib64/libcudart.so.5.5
#10 0x000000000055ecb8 in gpu_calculate_kinetic_energy_ ()
#11 0x000000000049cf3c in runmd_mod_mp_runmd_ ()
#12 0x00000000004db2e1 in MAIN__ ()

# RUN 2

banas.v01:~/test/test4$ gdb --args pmemd.cuda -O -i NVT.in -o production1_GPU0.out -p test1.top -c production0.restart -x production1_GPU0.traj -r production1_GPU0.restart
GNU gdb (GDB) 7.4.1-debian
Copyright (C) 2012 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law. Type "show copying"
and "show warranty" for details.
This GDB was configured as "x86_64-linux-gnu".
For bug reporting instructions, please see:
<http://www.gnu.org/software/gdb/bugs/>...
Reading symbols from /usr/local/programs/common/amber/amber12-pl21-at13-pl22/arch/x86_64-intel_13.1.0.146-cuda_5.5-debug/amber12/bin/pmemd.cuda...done.
(gdb) run
Starting program: /usr/local/programs/common/amber/amber12-pl21-at13-pl22/arch/x86_64-intel_13.1.0.146-cuda_5.5-debug/amber12/bin/pmemd.cuda -O -i NVT.in -o production1_GPU0.out -p test1.top -c production0.restart -x production1_GPU0.traj -r production1_GPU0.restart
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
[New Thread 0x7fffed594700 (LWP 3147)]
version_line NVRM version: NVIDIA UNIX x86_64 Kernel Module 331.38 Wed Jan 8 19:32:30 PST 2014

Program received signal SIGSEGV, Segmentation fault.
0x00007fffed9cf9d6 in ?? () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
(gdb) backtrace
#0 0x00007fffed9cf9d6 in ?? () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
#1 0x00007fffed9d501f in ?? () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
#2 0x00007fffed9ed57c in ?? () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
#3 0x00007fffed907c25 in ?? () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
#4 0x00007fffed8e401c in cuLaunchKernel () from /usr/local/programs/cuda/driver-331.38/usr/lib/libcuda.so
#5 0x00007ffff0ea4108 in ?? () from /usr/local/programs/cuda/cuda-5.5/cuda/lib64/libcudart.so.5.5
#6 0x00007ffff0ec2bbd in cudaLaunch () from /usr/local/programs/cuda/cuda-5.5/cuda/lib64/libcudart.so.5.5
#7 0x000000000058ba67 in cudaError (anonymous namespace)::cudaLaunch<char>(char*) ()
#8 0x000000000058b12a in __device_stub__Z18kNLSkinTest_kernelv() ()
#9 0x000000000058b135 in kNLSkinTest_kernel() ()
#10 0x000000000058ad66 in kNLSkinTest ()
#11 0x0000000000556091 in gpu_skin_test_ ()
#12 0x000000000049da3f in runmd_mod_mp_runmd_ ()
#13 0x00000000004db2e1 in MAIN__ ()

_______________________________________________
AMBER mailing list
AMBER.ambermd.org
http://lists.ambermd.org/mailman/listinfo/amber
Received on Mon Mar 03 2014 - 13:00:08 PST
Custom Search