api

package

v0.10.12 Latest Latest Go to latest Published: Mar 12, 2026 License: Apache-2.0 Imports: 8 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/NVIDIA/aicr

Links

Open Source Insights

Documentation ¶

Overview ¶

Package api provides the HTTP API layer for the AICR Recipe Generation service.

This package acts as a thin wrapper around the reusable pkg/server package, configuring it with application-specific routes and handlers. It exposes the recipe generation functionality (Step 2 of the four-stage workflow) via REST API. Note: The API server does not support snapshot capture (Step 1) or validation (Step 3); use the CLI for these operations.

Usage ¶

To start the API server:

package main

import (
    "log"
    "github.com/NVIDIA/aicr/pkg/api"
)

func main() {
    if err := api.Serve(); err != nil {
        log.Fatalf("server error: %v", err)
    }
}

Architecture ¶

The API layer is responsible for:

Configuring structured logging with application name and version
Setting up route handlers (e.g., /v1/recipe)
Delegating server lifecycle management to pkg/server

The pkg/server package handles:

HTTP server setup and graceful shutdown
Middleware (rate limiting, logging, metrics, panic recovery)
Health and readiness endpoints
Prometheus metrics

Endpoints ¶

Application Endpoints (with rate limiting):

GET /v1/recipe - Generate configuration recipe based on query parameters
POST /v1/recipe - Generate configuration recipe from criteria body (JSON/YAML)

System Endpoints (no rate limiting):

GET /health - Health check (liveness probe)
GET /ready - Readiness check
GET /metrics - Prometheus metrics

Query Parameters (GET /v1/recipe) ¶

The /v1/recipe endpoint accepts these query parameters for GET requests:

os: Operating system (ubuntu, cos, rhel, any)
osv: OS version (e.g., 24.04)
kernel: Kernel version (supports vendor suffixes)
service: Kubernetes service (eks, gke, aks, self-managed, any)
k8s: Kubernetes version (supports vendor suffixes)
gpu: GPU type (h100, gb200, a100, l40, any)
intent: Workload intent (training, inference, any)
context: Include context metadata (true/false)

Request Body (POST /v1/recipe) ¶

POST requests accept a RecipeCriteria resource in the request body. Supports both JSON (application/json) and YAML (application/x-yaml) formats.

Example request body:

kind: RecipeCriteria
apiVersion: aicr.nvidia.com/v1alpha1
metadata:
  name: my-criteria
spec:
  service: eks
  accelerator: gb200
  os: ubuntu
  intent: training

Example curl command:

curl -X POST http://localhost:8080/v1/recipe \
  -H "Content-Type: application/yaml" \
  -d @criteria.yaml

Configuration ¶

The server is configured via environment variables:

PORT: HTTP server port (default: 8080)
LOG_LEVEL: Logging level (debug, info, warn, error)

Version information is set at build time using ldflags:

go build -ldflags="-X 'github.com/NVIDIA/aicr/pkg/api.version=1.0.0'"

Index ¶

func Serve() error

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func Serve ¶

func Serve() error

Serve starts the API server and blocks until shutdown. It configures logging, sets up routes, and handles graceful shutdown. Returns an error if the server fails to start or encounters a fatal error.

Types ¶

This section is empty.

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL