求助:显存没有占用,但是功耗拉满, 风扇狂转...

2022-12-14 10:49:17 +08:00
 fenffef
服务器最近经常满负载,风扇狂转,炼不了丹,但是 nvidia-smi 显示显存没有占用,只能重启解决。
nvidia-smi
Wed Dec 14 10:42:47 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 510.108.03 Driver Version: 510.108.03 CUDA Version: 11.6 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... Off | 00000000:73:00.0 On | N/A |
| 74% 77C P2 349W / 350W | 710MiB / 24576MiB | 100% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 1 NVIDIA GeForce ... Off | 00000000:D5:00.0 Off | N/A |
| 94% 86C P2 309W / 350W | 516MiB / 24576MiB | 100% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 2050 G /usr/lib/xorg/Xorg 90MiB |
| 0 N/A N/A 2390 G /usr/bin/gnome-shell 63MiB |
| 0 N/A N/A 31515 G ...892221889725268949,131072 43MiB |
| 1 N/A N/A 2050 G /usr/lib/xorg/Xorg 4MiB |
+-----------------------------------------------------------------------------+
请问这是什么原因造成的?除了 reboot 有没有什么解决办法呢?
694 次点击
所在节点    机器学习
1 条回复
fenffef
2022-12-14 10:51:12 +08:00
nvidia-smi
Wed Dec 14 10:42:47 2022
| NVIDIA-SMI 510.108.03 Driver Version: 510.108.03 CUDA Version: 11.6 |
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| 0 NVIDIA GeForce ... Off | 00000000:73:00.0 On | N/A |
| 74% 77C P2 349W / 350W | 710MiB / 24576MiB | 100% Default |
| 1 NVIDIA GeForce ... Off | 00000000:D5:00.0 Off | N/A |
| 94% 86C P2 309W / 350W | 516MiB / 24576MiB | 100% Default |
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 2050 G /usr/lib/xorg/Xorg 90MiB |
| 0 N/A N/A 2390 G /usr/bin/gnome-shell 63MiB |
| 0 N/A N/A 31515 G ...892221889725268949,131072 43MiB |
| 1 N/A N/A 2050 G /usr/lib/xorg/Xorg 4MiB |
+-----------------------------------------------------------------------------+

这是一个专为移动设备优化的页面(即为了让你能够在 Google 搜索结果里秒开这个页面),如果你希望参与 V2EX 社区的讨论,你可以继续到 V2EX 上打开本讨论主题的完整版本。

https://www.v2ex.com/t/902397

V2EX 是创意工作者们的社区,是一个分享自己正在做的有趣事物、交流想法,可以遇见新朋友甚至新机会的地方。

V2EX is a community of developers, designers and creative people.

© 2021 V2EX