GetSafetensorsDtype returns the quantization type for a safetensors model.
If the model is quantized (has _scale tensors), returns the quantization type (e.g., "FP8").
Otherwise returns the torch_dtype from config.json.
GetSafetensorsLLMInfo extracts model information from safetensors LLM models.
It reads the config.json layer and returns a map compatible with GGML's KV format.
GetSafetensorsTensorInfo extracts tensor information from safetensors model layers.
Each tensor is stored as a minimal safetensors file with an 88-byte header containing metadata.