[AMBER] Surprisingly Poor Performance of Quadro RTX5000 -- Any Ideas on Reasons...

From: 石谷沁 <guqin.shi.qilu-pharma.com>
Date: Mon, 17 May 2021 01:53:56 +0000

Dear Amber Community,

I recently installed Amber20/AmberTools21 on my system:
CentOS 7, 32 cpus, 1 Quadro RTX5000. (with cmake3, mpich3.3.2, CUDA 11.1)

I downloaded the AMBER 20 Benchmark Suite and ran a short test on JAC Production NPT 4fs (23,558 atoms) with pmemd.cuda.
The average speed is 28.54 ns/day, which is incredibly slow… I can’t even believe it… (from top, I can confirm pmemd.cuda was running)

I then installed and tested Amber20/AmberTools21 on another system:
Ubuntu, 4 A100SXM4, CUDA 11.1
Same tests on JAC Production NPT 4fs with pmemd.cuda.
The average speed is 1035 ns/day. Which is the expected performance.

Any ideas on why the RTX5000 performance is so poor….Could share a few bullet points that I should troubleshoot one by one?
Also, the RTX5000 temp is at 70C which is also a concern, because a normal temp should be at around 30C…

Any thought is appreciated! Thx!!
-Guqin

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 455.23.05 Driver Version: 455.23.05 CUDA Version: 11.1 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Quadro RTX 5000 Off | 00000000:73:00.0 Off | Off |
| 47% 71C P2 164W / 230W | 4530MiB / 16125MiB | 100% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 118393 C ...are/man/driver/nvidia-smi 4457MiB |
+-----------------------------------------------------------------------------+

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 455.23.05 Driver Version: 455.23.05 CUDA Version: 11.1 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 A100-SXM4-80GB Off | 00000000:01:00.0 Off | 0 |
| N/A 30C P0 58W / 500W | 4MiB / 81251MiB | 0% Default |
| | | Disabled |
+-------------------------------+----------------------+----------------------+
| 1 A100-SXM4-80GB Off | 00000000:41:00.0 Off | 0 |
| N/A 29C P0 58W / 500W | 4MiB / 81251MiB | 0% Default |
| | | Disabled |
+-------------------------------+----------------------+----------------------+
| 2 A100-SXM4-80GB Off | 00000000:81:00.0 Off | 0 |
| N/A 30C P0 57W / 500W | 4MiB / 81251MiB | 0% Default |
| | | Disabled |
+-------------------------------+----------------------+----------------------+
| 3 A100-SXM4-80GB Off | 00000000:C1:00.0 Off | 0 |
| N/A 28C P0 60W / 500W | 4MiB / 81251MiB | 0% Default |
| | | Disabled |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 3516 G /usr/lib/xorg/Xorg 4MiB |
| 1 N/A N/A 3516 G /usr/lib/xorg/Xorg 4MiB |
| 2 N/A N/A 3516 G /usr/lib/xorg/Xorg 4MiB |
| 3 N/A N/A 3516 G /usr/lib/xorg/Xorg 4MiB |
+-----------------------------------------------------------------------------+



***********免责声明*************

本电子邮件中包含的信息仅供指定的或授权的个人或团体使用。本电子邮件及附件中提到的信息可能是保密信息或者法律特许保密的信息。如果你不是指定收件人,对于邮件内容的任何披露、复制、散布或者任何针对邮件内容进行的行为都是违法行为,需要严格禁止。如果您误收该电子邮件,请立即通知本公司并从您的系统中删除全部原始信息。该邮件可能会对您的系统或者数据造成损坏,对此我公司不承担任何责任。除非与公司业务有关,否则本邮件中的观点、结论、或者其它包含在邮件中的信息均为发件人个人行为,并不代表我公司。我公司有权保留对收发邮件的监控权利。

***********Business Email Disclaimer**************

 This e-mail and any attachments are meant for the intended recipient only and may contain information belonging to Qilu Pharma that is privileged, confidential, proprietary, and/or otherwise protected or prohibited from disclosure. If you are not the correct recipient or received this e-mail erroneously, please inform the sender immediately and delete this mail from your system. Qilu Pharma state no liability for any damage to your system and data caused by this email. Unless this email is related to the business with the company, otherwise any views or opinions presented in this email are solely from the sender. Qilu Pharma has the right to monitor the sending and receiving of the e-mail.
_______________________________________________
AMBER mailing list
AMBER.ambermd.org
http://lists.ambermd.org/mailman/listinfo/amber
Received on Sun May 16 2021 - 19:00:02 PDT
Custom Search