Documentation
¶
Overview ¶
Package nvidia contains the NVIDIA accelerator components and its query interface.
Directories
¶
| Path | Synopsis |
|---|---|
|
Package clockspeed tracks the NVIDIA per-GPU clock speed.
|
Package clockspeed tracks the NVIDIA per-GPU clock speed. |
|
Package ecc tracks the NVIDIA per-GPU ECC errors and other ECC related information.
|
Package ecc tracks the NVIDIA per-GPU ECC errors and other ECC related information. |
|
Package fabricmanager tracks NVIDIA fabric manager and fabric health monitoring services.
|
Package fabricmanager tracks NVIDIA fabric manager and fabric health monitoring services. |
|
Package gpm tracks the NVIDIA per-GPU GPM metrics.
|
Package gpm tracks the NVIDIA per-GPU GPM metrics. |
|
Package gpucounts monitors the GPU count of the system.
|
Package gpucounts monitors the GPU count of the system. |
|
Package hwslowdown monitors NVIDIA GPU hardware clock events of all GPUs, such as HW Slowdown events.
|
Package hwslowdown monitors NVIDIA GPU hardware clock events of all GPUs, such as HW Slowdown events. |
|
Package infiniband monitors the infiniband status of the system.
|
Package infiniband monitors the infiniband status of the system. |
|
class
Package class implements the infiniband class sysfs interface.
|
Package class implements the infiniband class sysfs interface. |
|
store
Package store stores infiniband states in time-series.
|
Package store stores infiniband states in time-series. |
|
types
Package types contains shared types for the infiniband package to avoid import cycles.
|
Package types contains shared types for the infiniband package to avoid import cycles. |
|
Package memory tracks the NVIDIA per-GPU memory usage.
|
Package memory tracks the NVIDIA per-GPU memory usage. |
|
Package nccl monitors the NCCL status.
|
Package nccl monitors the NCCL status. |
|
Package nvlink monitors the NVIDIA per-GPU nvlink devices.
|
Package nvlink monitors the NVIDIA per-GPU nvlink devices. |
|
Package peermem monitors the peermem module status.
|
Package peermem monitors the peermem module status. |
|
Package persistencemode tracks the NVIDIA persistence mode.
|
Package persistencemode tracks the NVIDIA persistence mode. |
|
Package power tracks the NVIDIA per-GPU power usage.
|
Package power tracks the NVIDIA per-GPU power usage. |
|
Package processes tracks the NVIDIA per-GPU processes.
|
Package processes tracks the NVIDIA per-GPU processes. |
|
Package remappedrows tracks the NVIDIA per-GPU remapped rows.
|
Package remappedrows tracks the NVIDIA per-GPU remapped rows. |
|
Package sxid tracks the NVIDIA GPU SXid errors scanning the kmsg.
|
Package sxid tracks the NVIDIA GPU SXid errors scanning the kmsg. |
|
Package temperature tracks the NVIDIA per-GPU temperatures.
|
Package temperature tracks the NVIDIA per-GPU temperatures. |
|
Package utilization tracks the NVIDIA per-GPU utilization.
|
Package utilization tracks the NVIDIA per-GPU utilization. |
|
Package xid tracks the NVIDIA GPU Xid errors scanning the kmsg See Xid messages https://docs.nvidia.com/deploy/gpu-debug-guidelines/index.html#xid-messages.
|
Package xid tracks the NVIDIA GPU Xid errors scanning the kmsg See Xid messages https://docs.nvidia.com/deploy/gpu-debug-guidelines/index.html#xid-messages. |
Click to show internal directories.
Click to hide internal directories.