This repository was archived by the owner on Jan 22, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
This repository was archived by the owner on Jan 22, 2024. It is now read-only.
Failures on Debian with ldconfig #1399
Copy link
Copy link
Closed
Description
The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense.
Also, before reporting a new issue, please make sure that:
- You read carefully the documentation and frequently asked questions.
- You searched for a similar issue and this is not a duplicate of an existing one.
- This issue is not related to NGC, otherwise, please use the devtalk forums instead.
- You went through the troubleshooting steps.
1. Issue or feature description
On Debian 10 and Debian unstable, nvidia-docker fails to run programs that use CUDA inside of containers UNLESS ldconfig is run first in the container to rebuild the the ldconfig cache.
Example failure:
[17:07:54]:wash@voyager:/home/wash/development/nvidia/cuda_linux_p4/sw/gpgpu/thrust/ci:0:$ sudo docker run --rm --gpus all nvidia/cuda:11.0-base nvidia-smi NVIDIA-SMI couldn't find libnvidia-ml.so library in your system. Please make sure that the NVIDIA Display Driver is properly installed and present in your system.
Please also try adding directory that contains libnvidia-ml.so to your system PATH.
[17:07:57]:wash@voyager:/home/wash/development/nvidia/cuda_linux_p4/sw/gpgpu/thrust/ci:12:$ tail -n 5 /var/log/nvidia-container-toolkit.log
I1014 00:07:56.925518 4001429 nvc_ldcache.c:359] executing /sbin/ldconfig from host at /var/lib/docker/overlay2/1b23287eb935d89df1baab6e66ded34209ac3f6a371ccb11c307a553bd11cff4/merged
E1014 00:07:56.926236 1 nvc_ldcache.c:390] could not start /sbin/ldconfig: process execution failed: no such file or directory
I1014 00:07:56.943973 4001429 nvc.c:337] shutting down library context
I1014 00:07:56.944378 4001435 driver.c:156] terminating driver service
I1014 00:07:56.944613 4001429 driver.c:196] driver service terminated successfully
If I run ldconfig within the container to rebuild ld.so.cache first, everything works:
[17:09:00]:wash@voyager:/home/wash/development/nvidia/cuda_linux_p4/sw/gpgpu/thrust/ci:0:$ sudo docker run --rm --gpus all nvidia/cuda:11.0-base bash -c "ldconfig && nvidia-smi"
Wed Oct 14 00:11:34 2020
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 455.10 Driver Version: 455.10 CUDA Version: 11.1 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 GeForce GT 710 On | 00000000:04:00.0 N/A | N/A |
| 40% 41C P8 N/A / N/A | 1MiB / 2002MiB | N/A Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 1 TITAN Xp On | 00000000:17:00.0 Off | N/A |
| 23% 30C P8 8W / 250W | 1MiB / 12196MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
| 2 Quadro GV100 On | 00000000:65:00.0 On | Off |
| 32% 44C P0 26W / 250W | 0MiB / 32505MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+
This seems related to:
- Failed to run nvidia/cuda:8.0 via nvidia-docker2. nvidia-container-cli: ldcache error: process /sbin/ldconfig failed with error code: 127 #587
- Debian buster: nnvidia-container-cli: ldcache error: process /sbin/ldconfig failed with error code: 1 #660
- Couldn't find libnvidia-ml.so library in your system nvidia-container-toolkit#299
2. Steps to reproduce the issue
3. Information to attach (optional if deemed irrelevant)
- Some nvidia-container information:
nvidia-container-cli -k -d /dev/tty info
I1014 00:13:02.642845 4001640 nvc.c:282] initializing library context (version=1.3.0, build=16315ebdf4b9728e899f615e208b50c41d7a5d15)
I1014 00:13:02.642869 4001640 nvc.c:256] using root /
I1014 00:13:02.642873 4001640 nvc.c:257] using ldcache /etc/ld.so.cache
I1014 00:13:02.642876 4001640 nvc.c:258] using unprivileged user 1000:1000
I1014 00:13:02.642887 4001640 nvc.c:299] attempting to load dxcore to see if we are running under Windows Subsystem for Linux (WSL)
I1014 00:13:02.642982 4001640 nvc.c:301] dxcore initialization failed, continuing assuming a non-WSL environment
W1014 00:13:02.644169 4001641 nvc.c:187] failed to set inheritable capabilities
W1014 00:13:02.644192 4001641 nvc.c:188] skipping kernel modules load due to failure
I1014 00:13:02.644291 4001642 driver.c:101] starting driver service
I1014 00:13:02.645319 4001640 nvc_info.c:680] requesting driver information with ''
I1014 00:13:02.646053 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/vdpau/libvdpau_nvidia.so.455.10
I1014 00:13:02.646118 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvoptix.so.455.10
I1014 00:13:02.646144 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-tls.so.455.10
I1014 00:13:02.646162 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-rtcore.so.455.10
I1014 00:13:02.646182 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-ptxjitcompiler.so.455.10
I1014 00:13:02.646210 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-opticalflow.so.455.10
I1014 00:13:02.646239 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-opencl.so.455.10
I1014 00:13:02.646258 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-ngx.so.455.10
I1014 00:13:02.646278 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-ml.so.455.10
I1014 00:13:02.646305 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-ifr.so.455.10
I1014 00:13:02.646335 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-glvkspirv.so.455.10
I1014 00:13:02.646354 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-glsi.so.455.10
I1014 00:13:02.646374 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-glcore.so.455.10
I1014 00:13:02.646393 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-fbc.so.455.10
I1014 00:13:02.646421 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-encode.so.455.10
I1014 00:13:02.646448 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-eglcore.so.455.10
I1014 00:13:02.646467 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-compiler.so.455.10
I1014 00:13:02.646487 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-cfg.so.455.10
I1014 00:13:02.646514 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-cbl.so.455.10
I1014 00:13:02.646532 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvidia-allocator.so.455.10
I1014 00:13:02.646562 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libnvcuvid.so.455.10
I1014 00:13:02.646670 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libcuda.so.455.10
I1014 00:13:02.646765 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libGLX_nvidia.so.455.10
I1014 00:13:02.646786 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libGLESv2_nvidia.so.455.10
I1014 00:13:02.646807 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libGLESv1_CM_nvidia.so.455.10
I1014 00:13:02.646828 4001640 nvc_info.c:169] selecting /usr/lib/x86_64-linux-gnu/libEGL_nvidia.so.455.10
I1014 00:13:02.646853 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/vdpau/libvdpau_nvidia.so.455.10
I1014 00:13:02.646880 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-tls.so.455.10
I1014 00:13:02.646898 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-ptxjitcompiler.so.455.10
I1014 00:13:02.646925 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-opticalflow.so.455.10
I1014 00:13:02.646952 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-opencl.so.455.10
I1014 00:13:02.646971 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-ml.so.455.10
I1014 00:13:02.646998 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-ifr.so.455.10
I1014 00:13:02.647027 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-glvkspirv.so.455.10
I1014 00:13:02.647045 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-glsi.so.455.10
I1014 00:13:02.647064 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-glcore.so.455.10
I1014 00:13:02.647083 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-fbc.so.455.10
I1014 00:13:02.647110 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-encode.so.455.10
I1014 00:13:02.647137 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-eglcore.so.455.10
I1014 00:13:02.647156 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-compiler.so.455.10
I1014 00:13:02.647175 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvidia-allocator.so.455.10
I1014 00:13:02.647204 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libnvcuvid.so.455.10
I1014 00:13:02.647242 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libcuda.so.455.10
I1014 00:13:02.647278 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libGLX_nvidia.so.455.10
I1014 00:13:02.647297 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libGLESv2_nvidia.so.455.10
I1014 00:13:02.647317 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libGLESv1_CM_nvidia.so.455.10
I1014 00:13:02.647337 4001640 nvc_info.c:169] selecting /usr/lib/i386-linux-gnu/libEGL_nvidia.so.455.10
W1014 00:13:02.647351 4001640 nvc_info.c:350] missing library libnvidia-fatbinaryloader.so
W1014 00:13:02.647356 4001640 nvc_info.c:354] missing compat32 library libnvidia-cfg.so
W1014 00:13:02.647360 4001640 nvc_info.c:354] missing compat32 library libnvidia-fatbinaryloader.so
W1014 00:13:02.647363 4001640 nvc_info.c:354] missing compat32 library libnvidia-ngx.so
W1014 00:13:02.647366 4001640 nvc_info.c:354] missing compat32 library libnvidia-rtcore.so
W1014 00:13:02.647370 4001640 nvc_info.c:354] missing compat32 library libnvoptix.so
W1014 00:13:02.647373 4001640 nvc_info.c:354] missing compat32 library libnvidia-cbl.so
I1014 00:13:02.655816 4001640 nvc_info.c:276] selecting /usr/bin/nvidia-smi
I1014 00:13:02.655827 4001640 nvc_info.c:276] selecting /usr/bin/nvidia-debugdump
I1014 00:13:02.655839 4001640 nvc_info.c:276] selecting /usr/bin/nvidia-persistenced
I1014 00:13:02.655849 4001640 nvc_info.c:276] selecting /usr/bin/nvidia-cuda-mps-control
I1014 00:13:02.655859 4001640 nvc_info.c:276] selecting /usr/bin/nvidia-cuda-mps-server
I1014 00:13:02.655874 4001640 nvc_info.c:438] listing device /dev/nvidiactl
I1014 00:13:02.655877 4001640 nvc_info.c:438] listing device /dev/nvidia-uvm
I1014 00:13:02.655882 4001640 nvc_info.c:438] listing device /dev/nvidia-uvm-tools
I1014 00:13:02.655887 4001640 nvc_info.c:438] listing device /dev/nvidia-modeset
I1014 00:13:02.655928 4001640 nvc_info.c:317] listing ipc /run/nvidia-persistenced/socket
W1014 00:13:02.655937 4001640 nvc_info.c:321] missing ipc /tmp/nvidia-mps
I1014 00:13:02.655941 4001640 nvc_info.c:745] requesting device information with ''
I1014 00:13:02.661724 4001640 nvc_info.c:628] listing device /dev/nvidia0 (GPU-858ec672-5669-6e20-d0e8-194029d32d2c at 00000000:04:00.0)
I1014 00:13:02.667422 4001640 nvc_info.c:628] listing device /dev/nvidia1 (GPU-2da062d6-3b80-9750-0af9-85d39d0b010b at 00000000:17:00.0)
I1014 00:13:02.673130 4001640 nvc_info.c:628] listing device /dev/nvidia2 (GPU-58a70c9d-1070-2a96-e5b3-cbee8d19d9e3 at 00000000:65:00.0)
NVRM version: 455.10
CUDA version: 11.1
Device Index: 0
Device Minor: 0
Model: GeForce GT 710
Brand: GeForce
GPU UUID: GPU-858ec672-5669-6e20-d0e8-194029d32d2c
Bus Location: 00000000:04:00.0
Architecture: 3.5
Device Index: 1
Device Minor: 1
Model: TITAN Xp
Brand: GeForce
GPU UUID: GPU-2da062d6-3b80-9750-0af9-85d39d0b010b
Bus Location: 00000000:17:00.0
Architecture: 6.1
Device Index: 2
Device Minor: 2
Model: Quadro GV100
Brand: Quadro
GPU UUID: GPU-58a70c9d-1070-2a96-e5b3-cbee8d19d9e3
Bus Location: 00000000:65:00.0
Architecture: 7.0
I1014 00:13:02.673174 4001640 nvc.c:337] shutting down library context
I1014 00:13:02.673531 4001642 driver.c:156] terminating driver service
I1014 00:13:02.673681 4001640 driver.c:196] driver service terminated successfully
- Kernel version from
uname -a
Linux voyager 5.5.0-1-amd64 NVIDIA/nvidia-docker#1 SMP Debian 5.5.13-2 (2020-03-30) x86_64 GNU/Linux
- Driver information from
nvidia-smi -a
Timestamp : Tue Oct 13 17:15:06 2020
Driver Version : 455.10
CUDA Version : 11.1
Attached GPUs : 3
GPU 00000000:04:00.0
Product Name : GeForce GT 710
Product Brand : GeForce
Display Mode : N/A
Display Active : N/A
Persistence Mode : Enabled
MIG Mode
Current : N/A
Pending : N/A
Accounting Mode : N/A
Accounting Mode Buffer Size : N/A
Driver Model
Current : N/A
Pending : N/A
Serial Number : N/A
GPU UUID : GPU-858ec672-5669-6e20-d0e8-194029d32d2c
Minor Number : 0
VBIOS Version : 80.28.A6.00.12
MultiGPU Board : N/A
Board ID : N/A
GPU Part Number : N/A
Inforom Version
Image Version : N/A
OEM Object : N/A
ECC Object : N/A
Power Management Object : N/A
GPU Operation Mode
Current : N/A
Pending : N/A
GPU Virtualization Mode
Virtualization Mode : N/A
Host VGPU Mode : N/A
IBMNPU
Relaxed Ordering Mode : N/A
PCI
Bus : 0x04
Device : 0x00
Domain : 0x0000
Device Id : 0x128B10DE
Bus Id : 00000000:04:00.0
Sub System Id : 0x27123842
GPU Link Info
PCIe Generation
Max : N/A
Current : N/A
Link Width
Max : N/A
Current : N/A
Bridge Chip
Type : N/A
Firmware : N/A
Replays Since Reset : 0
Replay Number Rollovers : 0
Tx Throughput : N/A
Rx Throughput : N/A
Fan Speed : 40 %
Performance State : P8
Clocks Throttle Reasons : N/A
FB Memory Usage
Total : 2002 MiB
Used : 1 MiB
Free : 2001 MiB
BAR1 Memory Usage
Total : N/A
Used : N/A
Free : N/A
Compute Mode : Default
Utilization
Gpu : N/A
Memory : N/A
Encoder : N/A
Decoder : N/A
Encoder Stats
Active Sessions : N/A
Average FPS : N/A
Average Latency : N/A
FBC Stats
Active Sessions : N/A
Average FPS : N/A
Average Latency : N/A
Ecc Mode
Current : N/A
Pending : N/A
ECC Errors
Volatile
Single Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
CBU : N/A
Total : N/A
Double Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
CBU : N/A
Total : N/A
Aggregate
Single Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
CBU : N/A
Total : N/A
Double Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
CBU : N/A
Total : N/A
Retired Pages
Single Bit ECC : N/A
Double Bit ECC : N/A
Pending Page Blacklist : N/A
Remapped Rows : N/A
Temperature
GPU Current Temp : 41 C
GPU Shutdown Temp : N/A
GPU Slowdown Temp : N/A
GPU Max Operating Temp : N/A
Memory Current Temp : N/A
Memory Max Operating Temp : N/A
Power Readings
Power Management : N/A
Power Draw : N/A
Power Limit : N/A
Default Power Limit : N/A
Enforced Power Limit : N/A
Min Power Limit : N/A
Max Power Limit : N/A
Clocks
Graphics : N/A
SM : N/A
Memory : N/A
Video : N/A
Applications Clocks
Graphics : 954 MHz
Memory : 900 MHz
Default Applications Clocks
Graphics : 954 MHz
Memory : 900 MHz
Max Clocks
Graphics : N/A
SM : N/A
Memory : N/A
Video : N/A
Max Customer Boost Clocks
Graphics : N/A
Clock Policy
Auto Boost : N/A
Auto Boost Default : N/A
Processes : None
GPU 00000000:17:00.0
Product Name : TITAN Xp
Product Brand : GeForce
Display Mode : Disabled
Display Active : Disabled
Persistence Mode : Enabled
MIG Mode
Current : N/A
Pending : N/A
Accounting Mode : Disabled
Accounting Mode Buffer Size : 4000
Driver Model
Current : N/A
Pending : N/A
Serial Number : N/A
GPU UUID : GPU-2da062d6-3b80-9750-0af9-85d39d0b010b
Minor Number : 1
VBIOS Version : 86.02.49.00.00
MultiGPU Board : No
Board ID : 0x1700
GPU Part Number : N/A
Inforom Version
Image Version : G001.0000.01.04
OEM Object : 1.1
ECC Object : N/A
Power Management Object : N/A
GPU Operation Mode
Current : N/A
Pending : N/A
GPU Virtualization Mode
Virtualization Mode : None
Host VGPU Mode : N/A
IBMNPU
Relaxed Ordering Mode : N/A
PCI
Bus : 0x17
Device : 0x00
Domain : 0x0000
Device Id : 0x1B0210DE
Bus Id : 00000000:17:00.0
Sub System Id : 0x11DF10DE
GPU Link Info
PCIe Generation
Max : 3
Current : 1
Link Width
Max : 16x
Current : 8x
Bridge Chip
Type : N/A
Firmware : N/A
Replays Since Reset : 0
Replay Number Rollovers : 0
Tx Throughput : 0 KB/s
Rx Throughput : 0 KB/s
Fan Speed : 23 %
Performance State : P8
Clocks Throttle Reasons
Idle : Active
Applications Clocks Setting : Not Active
SW Power Cap : Not Active
HW Slowdown : Not Active
HW Thermal Slowdown : Not Active
HW Power Brake Slowdown : Not Active
Sync Boost : Not Active
SW Thermal Slowdown : Not Active
Display Clock Setting : Not Active
FB Memory Usage
Total : 12196 MiB
Used : 1 MiB
Free : 12195 MiB
BAR1 Memory Usage
Total : 256 MiB
Used : 5 MiB
Free : 251 MiB
Compute Mode : Default
Utilization
Gpu : 0 %
Memory : 0 %
Encoder : 0 %
Decoder : 0 %
Encoder Stats
Active Sessions : 0
Average FPS : 0
Average Latency : 0
FBC Stats
Active Sessions : 0
Average FPS : 0
Average Latency : 0
Ecc Mode
Current : N/A
Pending : N/A
ECC Errors
Volatile
Single Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
CBU : N/A
Total : N/A
Double Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
CBU : N/A
Total : N/A
Aggregate
Single Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
CBU : N/A
Total : N/A
Double Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
CBU : N/A
Total : N/A
Retired Pages
Single Bit ECC : N/A
Double Bit ECC : N/A
Pending Page Blacklist : N/A
Remapped Rows : N/A
Temperature
GPU Current Temp : 30 C
GPU Shutdown Temp : 99 C
GPU Slowdown Temp : 96 C
GPU Max Operating Temp : N/A
Memory Current Temp : N/A
Memory Max Operating Temp : N/A
Power Readings
Power Management : Supported
Power Draw : 9.77 W
Power Limit : 250.00 W
Default Power Limit : 250.00 W
Enforced Power Limit : 250.00 W
Min Power Limit : 125.00 W
Max Power Limit : 300.00 W
Clocks
Graphics : 139 MHz
SM : 139 MHz
Memory : 405 MHz
Video : 544 MHz
Applications Clocks
Graphics : 1404 MHz
Memory : 5705 MHz
Default Applications Clocks
Graphics : 1404 MHz
Memory : 5705 MHz
Max Clocks
Graphics : 1911 MHz
SM : 1911 MHz
Memory : 5705 MHz
Video : 1620 MHz
Max Customer Boost Clocks
Graphics : N/A
Clock Policy
Auto Boost : N/A
Auto Boost Default : N/A
Processes : None
GPU 00000000:65:00.0
Product Name : Quadro GV100
Product Brand : Quadro
Display Mode : Enabled
Display Active : Enabled
Persistence Mode : Enabled
MIG Mode
Current : N/A
Pending : N/A
Accounting Mode : Disabled
Accounting Mode Buffer Size : 4000
Driver Model
Current : N/A
Pending : N/A
Serial Number : 0320319013520
GPU UUID : GPU-58a70c9d-1070-2a96-e5b3-cbee8d19d9e3
Minor Number : 2
VBIOS Version : 88.00.5A.00.03
MultiGPU Board : No
Board ID : 0x6500
GPU Part Number : 900-5G500-0000-000
Inforom Version
Image Version : G500.0500.00.05
OEM Object : 1.1
ECC Object : 5.0
Power Management Object : N/A
GPU Operation Mode
Current : N/A
Pending : N/A
GPU Virtualization Mode
Virtualization Mode : None
Host VGPU Mode : N/A
IBMNPU
Relaxed Ordering Mode : N/A
PCI
Bus : 0x65
Device : 0x00
Domain : 0x0000
Device Id : 0x1DBA10DE
Bus Id : 00000000:65:00.0
Sub System Id : 0x121A10DE
GPU Link Info
PCIe Generation
Max : 3
Current : 3
Link Width
Max : 16x
Current : 16x
Bridge Chip
Type : N/A
Firmware : N/A
Replays Since Reset : 0
Replay Number Rollovers : 0
Tx Throughput : 0 KB/s
Rx Throughput : 0 KB/s
Fan Speed : 32 %
Performance State : P0
Clocks Throttle Reasons
Idle : Active
Applications Clocks Setting : Not Active
SW Power Cap : Not Active
HW Slowdown : Not Active
HW Thermal Slowdown : Not Active
HW Power Brake Slowdown : Not Active
Sync Boost : Not Active
SW Thermal Slowdown : Not Active
Display Clock Setting : Not Active
FB Memory Usage
Total : 32505 MiB
Used : 0 MiB
Free : 32505 MiB
BAR1 Memory Usage
Total : 256 MiB
Used : 3 MiB
Free : 253 MiB
Compute Mode : Default
Utilization
Gpu : 0 %
Memory : 0 %
Encoder : 0 %
Decoder : 0 %
Encoder Stats
Active Sessions : 0
Average FPS : 0
Average Latency : 0
FBC Stats
Active Sessions : 0
Average FPS : 0
Average Latency : 0
Ecc Mode
Current : Disabled
Pending : Disabled
ECC Errors
Volatile
Single Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
CBU : N/A
Total : N/A
Double Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
CBU : N/A
Total : N/A
Aggregate
Single Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
CBU : N/A
Total : N/A
Double Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
CBU : N/A
Total : N/A
Retired Pages
Single Bit ECC : 0
Double Bit ECC : 0
Pending Page Blacklist : No
Remapped Rows : N/A
Temperature
GPU Current Temp : 44 C
GPU Shutdown Temp : 90 C
GPU Slowdown Temp : 88 C
GPU Max Operating Temp : 87 C
Memory Current Temp : 42 C
Memory Max Operating Temp : 95 C
Power Readings
Power Management : Supported
Power Draw : 26.83 W
Power Limit : 250.00 W
Default Power Limit : 250.00 W
Enforced Power Limit : 250.00 W
Min Power Limit : 100.00 W
Max Power Limit : 250.00 W
Clocks
Graphics : 135 MHz
SM : 135 MHz
Memory : 850 MHz
Video : 555 MHz
Applications Clocks
Graphics : 1132 MHz
Memory : 850 MHz
Default Applications Clocks
Graphics : 1132 MHz
Memory : 850 MHz
Max Clocks
Graphics : 1912 MHz
SM : 1912 MHz
Memory : 850 MHz
Video : 1717 MHz
Max Customer Boost Clocks
Graphics : 1912 MHz
Clock Policy
Auto Boost : N/A
Auto Boost Default : N/A
Processes : None
- Docker version from
docker version
Client: Docker Engine - Community
Version: 19.03.8
API version: 1.40
Go version: go1.12.17
Git commit: afacb8b7f0
Built: Wed Mar 11 01:26:02 2020
OS/Arch: linux/amd64
Experimental: false
Server: Docker Engine - Community
Engine:
Version: 19.03.12
API version: 1.40 (minimum version 1.12)
Go version: go1.13.10
Git commit: 48a66213fe
Built: Mon Jun 22 15:44:23 2020
OS/Arch: linux/amd64
Experimental: false
containerd:
Version: 1.2.13
GitCommit: 7ad184331fa3e55e52b890ea95e65ba581ae3429
runc:
Version: 1.0.0-rc10
GitCommit: dc9208a3303feef5b3839f4323d9beb36df0a9dd
docker-init:
Version: 0.18.0
GitCommit: fec3683
- NVIDIA packages version from
dpkg -l '*nvidia*'orrpm -qa '*nvidia*'
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name Version Architecture Description
+++-=============================-============-============-=====================================================
un libgldispatch0-nvidia <none> <none> (no description available)
ii libnvidia-container-tools 1.3.0-1 amd64 NVIDIA container runtime library (command-line tools)
ii libnvidia-container1:amd64 1.3.0-1 amd64 NVIDIA container runtime library
ii nvidia-container-runtime 3.4.0-1 amd64 NVIDIA container runtime
un nvidia-container-runtime-hook <none> <none> (no description available)
ii nvidia-container-toolkit 1.3.0-1 amd64 NVIDIA container runtime hook
un nvidia-docker <none> <none> (no description available)
ii nvidia-docker2 2.5.0-1 all nvidia-docker CLI wrapper
un nvidia-libopencl1-dev <none> <none> (no description available)
My installation of the display driver and CUDA is a local debug build from source and is rougly CUDA 11.0 / R455.
- NVIDIA container library version from
nvidia-container-cli -V
version: 1.3.0
build date: 2020-09-16T12:33+00:00
build revision: 16315ebdf4b9728e899f615e208b50c41d7a5d15
build compiler: x86_64-linux-gnu-gcc-8 8.3.0
build platform: x86_64
build flags: -D_GNU_SOURCE -D_FORTIFY_SOURCE=2 -DNDEBUG -std=gnu11 -O2 -g -fdata-sections -ffunction-sections -fstack-protector -fno-strict-aliasing -fvisibility=hidden -Wall -Wextra -Wcast-align -Wpointer-arith -Wmissing-prototypes -Wnonnull -Wwrite-strings -Wlogical-op -Wformat=2 -Wmissing-format-attribute -Winit-self -Wshadow -Wstrict-prototypes -Wunreachable-code -Wconversion -Wsign-conversion -Wno-unknown-warning-option -Wno-format-extra-args -Wno-gnu-alignof-expression -Wl,-zrelro -Wl,-znow -Wl,-zdefs -Wl,--gc-sections
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels