Documentation
¶
Overview ¶
Package accelerator contains the accelerator components and its query interface.
Directories
¶
| Path | Synopsis |
|---|---|
|
Package nvidia contains the NVIDIA accelerator components and its query interface.
|
Package nvidia contains the NVIDIA accelerator components and its query interface. |
|
clock-speed
Package clockspeed tracks the NVIDIA per-GPU clock speed.
|
Package clockspeed tracks the NVIDIA per-GPU clock speed. |
|
ecc
Package ecc tracks the NVIDIA per-GPU ECC errors and other ECC related information.
|
Package ecc tracks the NVIDIA per-GPU ECC errors and other ECC related information. |
|
fabric-manager
Package fabricmanager tracks NVIDIA fabric manager and fabric health monitoring services.
|
Package fabricmanager tracks NVIDIA fabric manager and fabric health monitoring services. |
|
gpm
Package gpm tracks the NVIDIA per-GPU GPM metrics.
|
Package gpm tracks the NVIDIA per-GPU GPM metrics. |
|
gpu-counts
Package gpucounts monitors the GPU count of the system.
|
Package gpucounts monitors the GPU count of the system. |
|
hw-slowdown
Package hwslowdown monitors NVIDIA GPU hardware clock events of all GPUs, such as HW Slowdown events.
|
Package hwslowdown monitors NVIDIA GPU hardware clock events of all GPUs, such as HW Slowdown events. |
|
infiniband
Package infiniband monitors the infiniband status of the system.
|
Package infiniband monitors the infiniband status of the system. |
|
memory
Package memory tracks the NVIDIA per-GPU memory usage.
|
Package memory tracks the NVIDIA per-GPU memory usage. |
|
nccl
Package nccl monitors the NCCL status.
|
Package nccl monitors the NCCL status. |
|
nvlink
Package nvlink monitors the NVIDIA per-GPU nvlink devices.
|
Package nvlink monitors the NVIDIA per-GPU nvlink devices. |
|
peermem
Package peermem monitors the peermem module status.
|
Package peermem monitors the peermem module status. |
|
persistence-mode
Package persistencemode tracks the NVIDIA persistence mode.
|
Package persistencemode tracks the NVIDIA persistence mode. |
|
power
Package power tracks the NVIDIA per-GPU power usage.
|
Package power tracks the NVIDIA per-GPU power usage. |
|
processes
Package processes tracks the NVIDIA per-GPU processes.
|
Package processes tracks the NVIDIA per-GPU processes. |
|
remapped-rows
Package remappedrows tracks the NVIDIA per-GPU remapped rows.
|
Package remappedrows tracks the NVIDIA per-GPU remapped rows. |
|
sxid
Package sxid tracks the NVIDIA GPU SXid errors scanning the kmsg.
|
Package sxid tracks the NVIDIA GPU SXid errors scanning the kmsg. |
|
temperature
Package temperature tracks the NVIDIA per-GPU temperatures.
|
Package temperature tracks the NVIDIA per-GPU temperatures. |
|
utilization
Package utilization tracks the NVIDIA per-GPU utilization.
|
Package utilization tracks the NVIDIA per-GPU utilization. |
|
xid
Package xid tracks the NVIDIA GPU Xid errors scanning the kmsg See Xid messages https://docs.nvidia.com/deploy/gpu-debug-guidelines/index.html#xid-messages.
|
Package xid tracks the NVIDIA GPU Xid errors scanning the kmsg See Xid messages https://docs.nvidia.com/deploy/gpu-debug-guidelines/index.html#xid-messages. |
Click to show internal directories.
Click to hide internal directories.