v1beta1

package

v0.6.1 Latest Latest Go to latest Published: Aug 30, 2021 License: Apache-2.0 Imports: 28 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/kubeflow/kfserving

Links

Open Source Insights

Documentation ¶

Overview ¶

Package v1beta1 contains API Schema definitions for the serving v1beta1 API group +k8s:openapi-gen=true +k8s:deepcopy-gen=package,register +k8s:conversion-gen=kfserving/pkg/apis/serving +k8s:defaulter-gen=TypeMeta +groupName=serving.kubeflow.org

Index ¶

Constants
Variables
func ExactlyOneErrorFor(component Component) error
func GetIntReference(number int) *int
func GetOpenAPIDefinitions(ref common.ReferenceCallback) map[string]common.OpenAPIDefinition
func IsPrefixStorageURISupported(storageURI string, supportedStorageURIPrefixes []string) bool
func Resource(resource string) schema.GroupResource
func ValidateMaxArgumentWorkers(slice []string, maxWorkers int64) error
type AIXExplainerSpec
- func (in *AIXExplainerSpec) DeepCopy() *AIXExplainerSpec
- func (in *AIXExplainerSpec) DeepCopyInto(out *AIXExplainerSpec)
- func (s *AIXExplainerSpec) Default(config *InferenceServicesConfig)
- func (s *AIXExplainerSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (s *AIXExplainerSpec) GetProtocol() constants.InferenceServiceProtocol
- func (s *AIXExplainerSpec) GetResourceRequirements() *v1.ResourceRequirements
- func (s *AIXExplainerSpec) GetStorageUri() *string
- func (s *AIXExplainerSpec) IsMMS(config *InferenceServicesConfig) bool
- func (s *AIXExplainerSpec) Validate() error
type AIXExplainerType
type ARTExplainerSpec
- func (in *ARTExplainerSpec) DeepCopy() *ARTExplainerSpec
- func (in *ARTExplainerSpec) DeepCopyInto(out *ARTExplainerSpec)
- func (s *ARTExplainerSpec) Default(config *InferenceServicesConfig)
- func (s *ARTExplainerSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (s *ARTExplainerSpec) GetProtocol() constants.InferenceServiceProtocol
- func (s *ARTExplainerSpec) GetResourceRequirements() *v1.ResourceRequirements
- func (s *ARTExplainerSpec) GetStorageUri() *string
- func (s *ARTExplainerSpec) IsMMS(config *InferenceServicesConfig) bool
- func (s *ARTExplainerSpec) Validate() error
type ARTExplainerType
type AlibiExplainerSpec
- func (in *AlibiExplainerSpec) DeepCopy() *AlibiExplainerSpec
- func (in *AlibiExplainerSpec) DeepCopyInto(out *AlibiExplainerSpec)
- func (s *AlibiExplainerSpec) Default(config *InferenceServicesConfig)
- func (s *AlibiExplainerSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (s *AlibiExplainerSpec) GetProtocol() constants.InferenceServiceProtocol
- func (s *AlibiExplainerSpec) GetResourceRequirements() *v1.ResourceRequirements
- func (s *AlibiExplainerSpec) GetStorageUri() *string
- func (s *AlibiExplainerSpec) IsMMS(config *InferenceServicesConfig) bool
- func (s *AlibiExplainerSpec) Validate() error
type AlibiExplainerType
type Batcher
- func (in *Batcher) DeepCopy() *Batcher
- func (in *Batcher) DeepCopyInto(out *Batcher)
type Component
type ComponentExtensionSpec
- func (in *ComponentExtensionSpec) DeepCopy() *ComponentExtensionSpec
- func (in *ComponentExtensionSpec) DeepCopyInto(out *ComponentExtensionSpec)
- func (s *ComponentExtensionSpec) Default(config *InferenceServicesConfig)
- func (s *ComponentExtensionSpec) Validate() error
type ComponentImplementation
- func FirstNonNilComponent(objects []ComponentImplementation) ComponentImplementation
- func NonNilComponents(objects []ComponentImplementation) (results []ComponentImplementation)
type ComponentStatusSpec
- func (in *ComponentStatusSpec) DeepCopy() *ComponentStatusSpec
- func (in *ComponentStatusSpec) DeepCopyInto(out *ComponentStatusSpec)
type ComponentType
type CustomExplainer
- func NewCustomExplainer(podSpec *PodSpec) *CustomExplainer
- func (in *CustomExplainer) DeepCopy() *CustomExplainer
- func (in *CustomExplainer) DeepCopyInto(out *CustomExplainer)
- func (c *CustomExplainer) Default(config *InferenceServicesConfig)
- func (c *CustomExplainer) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (c *CustomExplainer) GetProtocol() constants.InferenceServiceProtocol
- func (c *CustomExplainer) GetStorageUri() *string
- func (c *CustomExplainer) IsMMS(config *InferenceServicesConfig) bool
- func (s *CustomExplainer) Validate() error
type CustomPredictor
- func NewCustomPredictor(podSpec *PodSpec) *CustomPredictor
- func (in *CustomPredictor) DeepCopy() *CustomPredictor
- func (in *CustomPredictor) DeepCopyInto(out *CustomPredictor)
- func (c *CustomPredictor) Default(config *InferenceServicesConfig)
- func (c *CustomPredictor) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (c *CustomPredictor) GetProtocol() constants.InferenceServiceProtocol
- func (c *CustomPredictor) GetStorageUri() *string
- func (c *CustomPredictor) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool
- func (c *CustomPredictor) IsMMS(config *InferenceServicesConfig) bool
- func (c *CustomPredictor) Validate() error
type CustomTransformer
- func NewCustomTransformer(podSpec *PodSpec) *CustomTransformer
- func (in *CustomTransformer) DeepCopy() *CustomTransformer
- func (in *CustomTransformer) DeepCopyInto(out *CustomTransformer)
- func (c *CustomTransformer) Default(config *InferenceServicesConfig)
- func (c *CustomTransformer) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (c *CustomTransformer) GetProtocol() constants.InferenceServiceProtocol
- func (c *CustomTransformer) GetStorageUri() *string
- func (c *CustomTransformer) IsMMS(config *InferenceServicesConfig) bool
- func (c *CustomTransformer) Validate() error
type ExplainerConfig
type ExplainerExtensionSpec
- func (in *ExplainerExtensionSpec) DeepCopy() *ExplainerExtensionSpec
- func (in *ExplainerExtensionSpec) DeepCopyInto(out *ExplainerExtensionSpec)
type ExplainerSpec
- func (in *ExplainerSpec) DeepCopy() *ExplainerSpec
- func (in *ExplainerSpec) DeepCopyInto(out *ExplainerSpec)
- func (s *ExplainerSpec) GetExtensions() *ComponentExtensionSpec
- func (s *ExplainerSpec) GetImplementation() ComponentImplementation
- func (s *ExplainerSpec) GetImplementations() []ComponentImplementation
type ExplainersConfig
type InferenceService
- func (in *InferenceService) DeepCopy() *InferenceService
- func (in *InferenceService) DeepCopyInto(out *InferenceService)
- func (in *InferenceService) DeepCopyObject() runtime.Object
- func (isvc *InferenceService) Default()
- func (isvc *InferenceService) DefaultInferenceService(config *InferenceServicesConfig)
- func (*InferenceService) Hub()
- func (isvc *InferenceService) ValidateCreate() error
- func (isvc *InferenceService) ValidateDelete() error
- func (isvc *InferenceService) ValidateUpdate(old runtime.Object) error
type InferenceServiceList
- func (in *InferenceServiceList) DeepCopy() *InferenceServiceList
- func (in *InferenceServiceList) DeepCopyInto(out *InferenceServiceList)
- func (in *InferenceServiceList) DeepCopyObject() runtime.Object
type InferenceServiceSpec
- func (in *InferenceServiceSpec) DeepCopy() *InferenceServiceSpec
- func (in *InferenceServiceSpec) DeepCopyInto(out *InferenceServiceSpec)
type InferenceServiceStatus
- func (in *InferenceServiceStatus) DeepCopy() *InferenceServiceStatus
- func (in *InferenceServiceStatus) DeepCopyInto(out *InferenceServiceStatus)
- func (ss *InferenceServiceStatus) GetCondition(t apis.ConditionType) *apis.Condition
- func (ss *InferenceServiceStatus) InitializeConditions()
- func (ss *InferenceServiceStatus) IsConditionReady(t apis.ConditionType) bool
- func (ss *InferenceServiceStatus) IsReady() bool
- func (ss *InferenceServiceStatus) PropagateStatus(component ComponentType, serviceStatus *knservingv1.ServiceStatus)
- func (ss *InferenceServiceStatus) SetCondition(conditionType apis.ConditionType, condition *apis.Condition)
type InferenceServicesConfig
- func NewInferenceServicesConfig(cli client.Client) (*InferenceServicesConfig, error)
type IngressConfig
- func NewIngressConfig(cli client.Client) (*IngressConfig, error)
type LightGBMSpec
- func (in *LightGBMSpec) DeepCopy() *LightGBMSpec
- func (in *LightGBMSpec) DeepCopyInto(out *LightGBMSpec)
- func (x *LightGBMSpec) Default(config *InferenceServicesConfig)
- func (x *LightGBMSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (x *LightGBMSpec) GetProtocol() constants.InferenceServiceProtocol
- func (x *LightGBMSpec) GetStorageUri() *string
- func (x *LightGBMSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool
- func (x *LightGBMSpec) IsMMS(config *InferenceServicesConfig) bool
- func (x *LightGBMSpec) Validate() error
type LoggerSpec
- func (in *LoggerSpec) DeepCopy() *LoggerSpec
- func (in *LoggerSpec) DeepCopyInto(out *LoggerSpec)
type LoggerType
type ONNXRuntimeSpec
- func (in *ONNXRuntimeSpec) DeepCopy() *ONNXRuntimeSpec
- func (in *ONNXRuntimeSpec) DeepCopyInto(out *ONNXRuntimeSpec)
- func (o *ONNXRuntimeSpec) Default(config *InferenceServicesConfig)
- func (o *ONNXRuntimeSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (o *ONNXRuntimeSpec) GetProtocol() constants.InferenceServiceProtocol
- func (o *ONNXRuntimeSpec) GetStorageUri() *string
- func (o *ONNXRuntimeSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool
- func (o *ONNXRuntimeSpec) IsMMS(config *InferenceServicesConfig) bool
- func (o *ONNXRuntimeSpec) Validate() error
type PMMLSpec
- func (in *PMMLSpec) DeepCopy() *PMMLSpec
- func (in *PMMLSpec) DeepCopyInto(out *PMMLSpec)
- func (p *PMMLSpec) Default(config *InferenceServicesConfig)
- func (p *PMMLSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (p *PMMLSpec) GetProtocol() constants.InferenceServiceProtocol
- func (p *PMMLSpec) GetStorageUri() *string
- func (p *PMMLSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool
- func (p *PMMLSpec) IsMMS(config *InferenceServicesConfig) bool
- func (p *PMMLSpec) Validate() error
type PaddleServerSpec
- func (in *PaddleServerSpec) DeepCopy() *PaddleServerSpec
- func (in *PaddleServerSpec) DeepCopyInto(out *PaddleServerSpec)
- func (p *PaddleServerSpec) Default(config *InferenceServicesConfig)
- func (p *PaddleServerSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (p *PaddleServerSpec) GetProtocol() constants.InferenceServiceProtocol
- func (p *PaddleServerSpec) GetStorageUri() *string
- func (p *PaddleServerSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool
- func (p *PaddleServerSpec) IsMMS(config *InferenceServicesConfig) bool
- func (p *PaddleServerSpec) Validate() error
type PodSpec
- func (in *PodSpec) DeepCopy() *PodSpec
- func (in *PodSpec) DeepCopyInto(out *PodSpec)
type PredictorConfig
type PredictorExtensionSpec
- func (in *PredictorExtensionSpec) DeepCopy() *PredictorExtensionSpec
- func (in *PredictorExtensionSpec) DeepCopyInto(out *PredictorExtensionSpec)
type PredictorImplementation
- func NonNilPredictors(objects []PredictorImplementation) (results []PredictorImplementation)
type PredictorProtocols
type PredictorSpec
- func (in *PredictorSpec) DeepCopy() *PredictorSpec
- func (in *PredictorSpec) DeepCopyInto(out *PredictorSpec)
- func (s *PredictorSpec) GetExtensions() *ComponentExtensionSpec
- func (s *PredictorSpec) GetImplementation() ComponentImplementation
- func (s *PredictorSpec) GetImplementations() []ComponentImplementation
- func (s *PredictorSpec) GetPredictorImplementation() *PredictorImplementation
- func (s *PredictorSpec) GetPredictorImplementations() []PredictorImplementation
type PredictorsConfig
type SKLearnSpec
- func (in *SKLearnSpec) DeepCopy() *SKLearnSpec
- func (in *SKLearnSpec) DeepCopyInto(out *SKLearnSpec)
- func (k *SKLearnSpec) Default(config *InferenceServicesConfig)
- func (k *SKLearnSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (k *SKLearnSpec) GetProtocol() constants.InferenceServiceProtocol
- func (k *SKLearnSpec) GetStorageUri() *string
- func (k *SKLearnSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool
- func (k *SKLearnSpec) IsMMS(config *InferenceServicesConfig) bool
- func (k *SKLearnSpec) Validate() error
type TFServingSpec
- func (in *TFServingSpec) DeepCopy() *TFServingSpec
- func (in *TFServingSpec) DeepCopyInto(out *TFServingSpec)
- func (t *TFServingSpec) Default(config *InferenceServicesConfig)
- func (t *TFServingSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (t *TFServingSpec) GetProtocol() constants.InferenceServiceProtocol
- func (t *TFServingSpec) GetStorageUri() *string
- func (t *TFServingSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool
- func (t *TFServingSpec) IsMMS(config *InferenceServicesConfig) bool
- func (t *TFServingSpec) Validate() error
type TorchServeSpec
- func (in *TorchServeSpec) DeepCopy() *TorchServeSpec
- func (in *TorchServeSpec) DeepCopyInto(out *TorchServeSpec)
- func (t *TorchServeSpec) Default(config *InferenceServicesConfig)
- func (t *TorchServeSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (t *TorchServeSpec) GetContainerV1(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (t *TorchServeSpec) GetContainerV2(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (t *TorchServeSpec) GetProtocol() constants.InferenceServiceProtocol
- func (t *TorchServeSpec) GetStorageUri() *string
- func (t *TorchServeSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool
- func (t *TorchServeSpec) IsMMS(config *InferenceServicesConfig) bool
- func (t *TorchServeSpec) Validate() error
type TransformerConfig
type TransformerSpec
- func (in *TransformerSpec) DeepCopy() *TransformerSpec
- func (in *TransformerSpec) DeepCopyInto(out *TransformerSpec)
- func (s *TransformerSpec) GetExtensions() *ComponentExtensionSpec
- func (s *TransformerSpec) GetImplementation() ComponentImplementation
- func (s *TransformerSpec) GetImplementations() []ComponentImplementation
type TransformersConfig
type TritonSpec
- func (in *TritonSpec) DeepCopy() *TritonSpec
- func (in *TritonSpec) DeepCopyInto(out *TritonSpec)
- func (t *TritonSpec) Default(config *InferenceServicesConfig)
- func (t *TritonSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (t *TritonSpec) GetProtocol() constants.InferenceServiceProtocol
- func (t *TritonSpec) GetStorageUri() *string
- func (t *TritonSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool
- func (t *TritonSpec) IsMMS(config *InferenceServicesConfig) bool
- func (t *TritonSpec) Validate() error
type XGBoostSpec
- func (in *XGBoostSpec) DeepCopy() *XGBoostSpec
- func (in *XGBoostSpec) DeepCopyInto(out *XGBoostSpec)
- func (x *XGBoostSpec) Default(config *InferenceServicesConfig)
- func (x *XGBoostSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, ...) *v1.Container
- func (x *XGBoostSpec) GetProtocol() constants.InferenceServiceProtocol
- func (x *XGBoostSpec) GetStorageUri() *string
- func (x *XGBoostSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool
- func (x *XGBoostSpec) IsMMS(config *InferenceServicesConfig) bool
- func (x *XGBoostSpec) Validate() error

Constants ¶

View Source

const (
	MinReplicasShouldBeLessThanMaxError = "MinReplicas cannot be greater than MaxReplicas."
	MinReplicasLowerBoundExceededError  = "MinReplicas cannot be less than 0."
	MaxReplicasLowerBoundExceededError  = "MaxReplicas cannot be less than 0."
	ParallelismLowerBoundExceededError  = "Parallelism cannot be less than 0."
	UnsupportedStorageURIFormatError    = "" /* 156-byte string literal not displayed */
	InvalidLoggerType                   = "Invalid logger type"
	InvalidISVCNameFormatError          = "" /* 235-byte string literal not displayed */
	MaxWorkersShouldBeLessThanMaxError  = "Workers cannot be greater than %d"
	InvalidWorkerArgument               = "Invalid workers argument"
	InvalidProtocol                     = "Invalid protocol %s. Must be one of [%s]"
)

Known error messages

View Source

const (
	PredictorConfigKeyName   = "predictors"
	TransformerConfigKeyName = "transformers"
	ExplainerConfigKeyName   = "explainers"
)

ConfigMap Keys

View Source

const (
	// PredictorRoutesReady is set when network configuration has completed.
	PredictorRouteReady apis.ConditionType = "PredictorRouteReady"
	// TransformerRoutesReady is set when network configuration has completed.
	TransformerRouteReady apis.ConditionType = "TransformerRouteReady"
	// ExplainerRoutesReady is set when network configuration has completed.
	ExplainerRoutesReady apis.ConditionType = "ExplainerRoutesReady"
	// PredictorConfigurationReady is set when predictor pods are ready.
	PredictorConfigurationReady apis.ConditionType = "PredictorConfigurationReady"
	// TransformerConfigurationeReady is set when transformer pods are ready.
	TransformerConfigurationeReady apis.ConditionType = "TransformerConfigurationeReady"
	// ExplainerConfigurationReady is set when explainer pods are ready.
	ExplainerConfigurationReady apis.ConditionType = "ExplainerConfigurationReady"
	// PredictorReady is set when predictor has reported readiness.
	PredictorReady apis.ConditionType = "PredictorReady"
	// TransformerReady is set when transformer has reported readiness.
	TransformerReady apis.ConditionType = "TransformerReady"
	// ExplainerReady is set when explainer has reported readiness.
	ExplainerReady apis.ConditionType = "ExplainerReady"
	// Ingress is created
	IngressReady apis.ConditionType = "IngressReady"
)

ConditionType represents a Service condition value

View Source

const (
	DefaultPyTorchModelClassName     = "PyTorchModel"
	PyTorchServingGPUSuffix          = "-gpu"
	InvalidPyTorchRuntimeIncludesGPU = "PyTorch RuntimeVersion is not GPU enabled but GPU resources are requested. "
	InvalidPyTorchRuntimeExcludesGPU = "PyTorch RuntimeVersion is GPU enabled but GPU resources are not requested. "
	InvalidInferenceProtocolVersion  = "PyTorch ProtocolVersion v2 is not supported"
)

View Source

const (
	IngressConfigKeyName = "ingress"
)

View Source

const (
	IsvcNameFmt string = "[a-z]([-a-z0-9]*[a-z0-9])?"
)

regular expressions for validation of isvc name

Variables ¶

View Source

var (
	SupportedStorageURIPrefixList = []string{"gs://", "s3://", "pvc://", "file://", "https://", "http://"}
	AzureBlobURL                  = "blob.core.windows.net"
	AzureBlobURIRegEx             = "https://(.+?).blob.core.windows.net/(.+)"
)

Constants

View Source

var (
	ONNXServingRestPort = "8080"
	ONNXServingGRPCPort = "9000"
	ONNXFileExt         = ".onnx"
	DefaultONNXFileName = "model.onnx"
)

View Source

var (
	TensorflowEntrypointCommand          = "/usr/bin/tensorflow_model_server"
	TensorflowServingGRPCPort            = "9000"
	TensorflowServingRestPort            = "8080"
	TensorflowServingGPUSuffix           = "-gpu"
	InvalidTensorflowRuntimeVersionError = "Tensorflow RuntimeVersion must be one of %s"
	InvalidTensorflowRuntimeIncludesGPU  = "Tensorflow RuntimeVersion is not GPU enabled but GPU resources are requested. " + InvalidTensorflowRuntimeVersionError
	InvalidTensorflowRuntimeExcludesGPU  = "Tensorflow RuntimeVersion is GPU enabled but GPU resources are not requested. " + InvalidTensorflowRuntimeVersionError
)

View Source

var (
	TritonISGRPCPort = int32(9000)
	TritonISRestPort = int32(8080)
)

View Source

var (
	// APIVersion is the current API version used to register these objects
	APIVersion = "v1beta1"

	// SchemeGroupVersion is group version used to register these objects
	SchemeGroupVersion = schema.GroupVersion{Group: constants.KFServingAPIGroupName, Version: APIVersion}

	// SchemeBuilder is used to add go types to the GroupVersionKind scheme
	SchemeBuilder = &scheme.Builder{GroupVersion: SchemeGroupVersion}

	// AddToScheme is required by pkg/client/...
	AddToScheme = SchemeBuilder.AddToScheme
)

View Source

var (

	// regular expressions for validation of isvc name
	IsvcRegexp = regexp.MustCompile("^" + IsvcNameFmt + "$")
)

Functions ¶

func ExactlyOneErrorFor ¶

func ExactlyOneErrorFor(component Component) error

ExactlyOneErrorFor creates an error for the component's one-of semantic.

func GetIntReference ¶

func GetIntReference(number int) *int

GetIntReference returns the pointer for the integer input

func GetOpenAPIDefinitions ¶

func GetOpenAPIDefinitions(ref common.ReferenceCallback) map[string]common.OpenAPIDefinition

func IsPrefixStorageURISupported ¶ added in v0.5.1

func IsPrefixStorageURISupported(storageURI string, supportedStorageURIPrefixes []string) bool

func Resource ¶

func Resource(resource string) schema.GroupResource

Resource is required by pkg/client/listers/...

func ValidateMaxArgumentWorkers ¶ added in v0.6.0

func ValidateMaxArgumentWorkers(slice []string, maxWorkers int64) error

ValidateMaxArgumentWorkers will to validate illegal workers count.

Types ¶

type AIXExplainerSpec ¶

type AIXExplainerSpec struct {
	// The type of AIX explainer
	Type AIXExplainerType `json:"type"`
	// Contains fields shared across all explainers
	ExplainerExtensionSpec `json:",inline"`
}

AIXExplainerSpec defines the arguments for configuring an AIX Explanation Server

func (*AIXExplainerSpec) DeepCopy ¶

func (in *AIXExplainerSpec) DeepCopy() *AIXExplainerSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new AIXExplainerSpec.

func (*AIXExplainerSpec) DeepCopyInto ¶

func (in *AIXExplainerSpec) DeepCopyInto(out *AIXExplainerSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*AIXExplainerSpec) Default ¶

func (s *AIXExplainerSpec) Default(config *InferenceServicesConfig)

func (*AIXExplainerSpec) GetContainer ¶

func (s *AIXExplainerSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

func (*AIXExplainerSpec) GetProtocol ¶

func (s *AIXExplainerSpec) GetProtocol() constants.InferenceServiceProtocol

func (*AIXExplainerSpec) GetResourceRequirements ¶

func (s *AIXExplainerSpec) GetResourceRequirements() *v1.ResourceRequirements

func (*AIXExplainerSpec) GetStorageUri ¶

func (s *AIXExplainerSpec) GetStorageUri() *string

func (*AIXExplainerSpec) IsMMS ¶

func (s *AIXExplainerSpec) IsMMS(config *InferenceServicesConfig) bool

func (*AIXExplainerSpec) Validate ¶

func (s *AIXExplainerSpec) Validate() error

Validate the spec

type AIXExplainerType ¶

type AIXExplainerType string

const (
	AIXLimeImageExplainer AIXExplainerType = "LimeImages"
)

type ARTExplainerSpec ¶

type ARTExplainerSpec struct {
	// The type of ART explainer
	Type ARTExplainerType `json:"type"`
	// Contains fields shared across all explainers
	ExplainerExtensionSpec `json:",inline"`
}

ARTExplainerType defines the arguments for configuring an ART Explanation Server

func (*ARTExplainerSpec) DeepCopy ¶

func (in *ARTExplainerSpec) DeepCopy() *ARTExplainerSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new ARTExplainerSpec.

func (*ARTExplainerSpec) DeepCopyInto ¶

func (in *ARTExplainerSpec) DeepCopyInto(out *ARTExplainerSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*ARTExplainerSpec) Default ¶

func (s *ARTExplainerSpec) Default(config *InferenceServicesConfig)

func (*ARTExplainerSpec) GetContainer ¶

func (s *ARTExplainerSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

func (*ARTExplainerSpec) GetProtocol ¶

func (s *ARTExplainerSpec) GetProtocol() constants.InferenceServiceProtocol

func (*ARTExplainerSpec) GetResourceRequirements ¶

func (s *ARTExplainerSpec) GetResourceRequirements() *v1.ResourceRequirements

func (*ARTExplainerSpec) GetStorageUri ¶

func (s *ARTExplainerSpec) GetStorageUri() *string

func (*ARTExplainerSpec) IsMMS ¶

func (s *ARTExplainerSpec) IsMMS(config *InferenceServicesConfig) bool

func (*ARTExplainerSpec) Validate ¶

func (s *ARTExplainerSpec) Validate() error

Validate the spec

type ARTExplainerType ¶

type ARTExplainerType string

const (
	ARTSquareAttackExplainer ARTExplainerType = "SquareAttack"
)

type AlibiExplainerSpec ¶

type AlibiExplainerSpec struct {
	// The type of Alibi explainer <br />
	// Valid values are: <br />
	// - "AnchorTabular"; <br />
	// - "AnchorImages"; <br />
	// - "AnchorText"; <br />
	// - "Counterfactuals"; <br />
	// - "Contrastive"; <br />
	Type AlibiExplainerType `json:"type"`
	// Contains fields shared across all explainers
	ExplainerExtensionSpec `json:",inline"`
}

AlibiExplainerSpec defines the arguments for configuring an Alibi Explanation Server

func (*AlibiExplainerSpec) DeepCopy ¶

func (in *AlibiExplainerSpec) DeepCopy() *AlibiExplainerSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new AlibiExplainerSpec.

func (*AlibiExplainerSpec) DeepCopyInto ¶

func (in *AlibiExplainerSpec) DeepCopyInto(out *AlibiExplainerSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*AlibiExplainerSpec) Default ¶

func (s *AlibiExplainerSpec) Default(config *InferenceServicesConfig)

func (*AlibiExplainerSpec) GetContainer ¶

func (s *AlibiExplainerSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

func (*AlibiExplainerSpec) GetProtocol ¶

func (s *AlibiExplainerSpec) GetProtocol() constants.InferenceServiceProtocol

func (*AlibiExplainerSpec) GetResourceRequirements ¶

func (s *AlibiExplainerSpec) GetResourceRequirements() *v1.ResourceRequirements

func (*AlibiExplainerSpec) GetStorageUri ¶

func (s *AlibiExplainerSpec) GetStorageUri() *string

func (*AlibiExplainerSpec) IsMMS ¶

func (s *AlibiExplainerSpec) IsMMS(config *InferenceServicesConfig) bool

func (*AlibiExplainerSpec) Validate ¶

func (s *AlibiExplainerSpec) Validate() error

Validate the spec

type AlibiExplainerType ¶

type AlibiExplainerType string

AlibiExplainerType is the explanation method

const (
	AlibiAnchorsTabularExplainer  AlibiExplainerType = "AnchorTabular"
	AlibiAnchorsImageExplainer    AlibiExplainerType = "AnchorImages"
	AlibiAnchorsTextExplainer     AlibiExplainerType = "AnchorText"
	AlibiCounterfactualsExplainer AlibiExplainerType = "Counterfactuals"
	AlibiContrastiveExplainer     AlibiExplainerType = "Contrastive"
)

AlibiExplainerType Enum

type Batcher ¶

type Batcher struct {
	// Specifies the max number of requests to trigger a batch
	// +optional
	MaxBatchSize *int `json:"maxBatchSize,omitempty"`
	// Specifies the max latency to trigger a batch
	// +optional
	MaxLatency *int `json:"maxLatency,omitempty"`
	// Specifies the timeout of a batch
	// +optional
	Timeout *int `json:"timeout,omitempty"`
}

Batcher specifies optional payload batching available for all components

func (*Batcher) DeepCopy ¶

func (in *Batcher) DeepCopy() *Batcher

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new Batcher.

func (*Batcher) DeepCopyInto ¶

func (in *Batcher) DeepCopyInto(out *Batcher)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

type Component ¶

type Component interface {
	GetImplementation() ComponentImplementation
	GetImplementations() []ComponentImplementation
	GetExtensions() *ComponentExtensionSpec
}

Component interface is implemented by all specs that contain component implementations, e.g. PredictorSpec, ExplainerSpec, TransformerSpec. +kubebuilder:object:generate=false

type ComponentExtensionSpec ¶

type ComponentExtensionSpec struct {
	// Minimum number of replicas, defaults to 1 but can be set to 0 to enable scale-to-zero.
	// +optional
	MinReplicas *int `json:"minReplicas,omitempty"`
	// Maximum number of replicas for autoscaling.
	// +optional
	MaxReplicas int `json:"maxReplicas,omitempty"`
	// ContainerConcurrency specifies how many requests can be processed concurrently, this sets the hard limit of the container
	// concurrency(https://knative.dev/docs/serving/autoscaling/concurrency).
	// +optional
	ContainerConcurrency *int64 `json:"containerConcurrency,omitempty"`
	// TimeoutSeconds specifies the number of seconds to wait before timing out a request to the component.
	// +optional
	TimeoutSeconds *int64 `json:"timeout,omitempty"`
	// CanaryTrafficPercent defines the traffic split percentage between the candidate revision and the last ready revision
	// +optional
	CanaryTrafficPercent *int64 `json:"canaryTrafficPercent,omitempty"`
	// Activate request/response logging and logger configurations
	// +optional
	Logger *LoggerSpec `json:"logger,omitempty"`
	// Activate request batching and batching configurations
	// +optional
	Batcher *Batcher `json:"batcher,omitempty"`
}

ComponentExtensionSpec defines the deployment configuration for a given InferenceService component

func (*ComponentExtensionSpec) DeepCopy ¶

func (in *ComponentExtensionSpec) DeepCopy() *ComponentExtensionSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new ComponentExtensionSpec.

func (*ComponentExtensionSpec) DeepCopyInto ¶

func (in *ComponentExtensionSpec) DeepCopyInto(out *ComponentExtensionSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*ComponentExtensionSpec) Default ¶

func (s *ComponentExtensionSpec) Default(config *InferenceServicesConfig)

Default the ComponentExtensionSpec

func (*ComponentExtensionSpec) Validate ¶

func (s *ComponentExtensionSpec) Validate() error

Validate the ComponentExtensionSpec

type ComponentImplementation ¶

type ComponentImplementation interface {
	Default(config *InferenceServicesConfig)
	Validate() error
	GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container
	GetStorageUri() *string
	GetProtocol() constants.InferenceServiceProtocol
	IsMMS(config *InferenceServicesConfig) bool
}

ComponentImplementation interface is implemented by predictor, transformer, and explainer implementations +kubebuilder:object:generate=false

func FirstNonNilComponent ¶

func FirstNonNilComponent(objects []ComponentImplementation) ComponentImplementation

FirstNonNilComponent returns the first non nil object or returns nil

func NonNilComponents ¶

func NonNilComponents(objects []ComponentImplementation) (results []ComponentImplementation)

NonNilComponents returns components that are not nil

type ComponentStatusSpec ¶

type ComponentStatusSpec struct {
	// Latest revision name that is in ready state
	// +optional
	LatestReadyRevision string `json:"latestReadyRevision,omitempty"`
	// Latest revision name that is created
	// +optional
	LatestCreatedRevision string `json:"latestCreatedRevision,omitempty"`
	// Previous revision name that is rolled out with 100 percent traffic
	// +optional
	PreviousRolledoutRevision string `json:"previousRolledoutRevision,omitempty"`
	// Latest revision name that is rolled out with 100 percent traffic
	// +optional
	LatestRolledoutRevision string `json:"latestRolledoutRevision,omitempty"`
	// Traffic holds the configured traffic distribution for latest ready revision and previous rolled out revision.
	// +optional
	Traffic []knservingv1.TrafficTarget `json:"traffic,omitempty"`
	// URL holds the url that will distribute traffic over the provided traffic targets.
	// It generally has the form http[s]://{route-name}.{route-namespace}.{cluster-level-suffix}
	// +optional
	URL *apis.URL `json:"url,omitempty"`
	// Addressable endpoint for the InferenceService
	// +optional
	Address *duckv1.Addressable `json:"address,omitempty"`
}

ComponentStatusSpec describes the state of the component

func (*ComponentStatusSpec) DeepCopy ¶

func (in *ComponentStatusSpec) DeepCopy() *ComponentStatusSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new ComponentStatusSpec.

func (*ComponentStatusSpec) DeepCopyInto ¶

func (in *ComponentStatusSpec) DeepCopyInto(out *ComponentStatusSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

type ComponentType ¶

type ComponentType string

ComponentType contains the different types of components of the service

const (
	PredictorComponent   ComponentType = "predictor"
	ExplainerComponent   ComponentType = "explainer"
	TransformerComponent ComponentType = "transformer"
)

ComponentType Enum

type CustomExplainer ¶

type CustomExplainer struct {
	v1.PodSpec `json:",inline"`
}

CustomExplainer defines arguments for configuring a custom explainer.

func NewCustomExplainer ¶

func NewCustomExplainer(podSpec *PodSpec) *CustomExplainer

func (*CustomExplainer) DeepCopy ¶

func (in *CustomExplainer) DeepCopy() *CustomExplainer

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new CustomExplainer.

func (*CustomExplainer) DeepCopyInto ¶

func (in *CustomExplainer) DeepCopyInto(out *CustomExplainer)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*CustomExplainer) Default ¶

func (c *CustomExplainer) Default(config *InferenceServicesConfig)

Default sets defaults on the resource

func (*CustomExplainer) GetContainer ¶

func (c *CustomExplainer) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

GetContainer transforms the resource into a container spec

func (*CustomExplainer) GetProtocol ¶

func (c *CustomExplainer) GetProtocol() constants.InferenceServiceProtocol

func (*CustomExplainer) GetStorageUri ¶

func (c *CustomExplainer) GetStorageUri() *string

func (*CustomExplainer) IsMMS ¶

func (c *CustomExplainer) IsMMS(config *InferenceServicesConfig) bool

func (*CustomExplainer) Validate ¶

func (s *CustomExplainer) Validate() error

Validate the spec

type CustomPredictor ¶

type CustomPredictor struct {
	v1.PodSpec `json:",inline"`
}

CustomPredictor defines arguments for configuring a custom server.

func NewCustomPredictor ¶

func NewCustomPredictor(podSpec *PodSpec) *CustomPredictor

func (*CustomPredictor) DeepCopy ¶

func (in *CustomPredictor) DeepCopy() *CustomPredictor

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new CustomPredictor.

func (*CustomPredictor) DeepCopyInto ¶

func (in *CustomPredictor) DeepCopyInto(out *CustomPredictor)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*CustomPredictor) Default ¶

func (c *CustomPredictor) Default(config *InferenceServicesConfig)

Default sets defaults on the resource

func (*CustomPredictor) GetContainer ¶

func (c *CustomPredictor) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

GetContainers transforms the resource into a container spec

func (*CustomPredictor) GetProtocol ¶

func (c *CustomPredictor) GetProtocol() constants.InferenceServiceProtocol

func (*CustomPredictor) GetStorageUri ¶

func (c *CustomPredictor) GetStorageUri() *string

func (*CustomPredictor) IsFrameworkSupported ¶ added in v0.6.0

func (c *CustomPredictor) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool

func (*CustomPredictor) IsMMS ¶

func (c *CustomPredictor) IsMMS(config *InferenceServicesConfig) bool

func (*CustomPredictor) Validate ¶

func (c *CustomPredictor) Validate() error

Validate returns an error if invalid

type CustomTransformer ¶

type CustomTransformer struct {
	v1.PodSpec `json:",inline"`
}

CustomTransformer defines arguments for configuring a custom transformer.

func NewCustomTransformer ¶

func NewCustomTransformer(podSpec *PodSpec) *CustomTransformer

func (*CustomTransformer) DeepCopy ¶

func (in *CustomTransformer) DeepCopy() *CustomTransformer

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new CustomTransformer.

func (*CustomTransformer) DeepCopyInto ¶

func (in *CustomTransformer) DeepCopyInto(out *CustomTransformer)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*CustomTransformer) Default ¶

func (c *CustomTransformer) Default(config *InferenceServicesConfig)

Default sets defaults on the resource

func (*CustomTransformer) GetContainer ¶

func (c *CustomTransformer) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

GetContainers transforms the resource into a container spec

func (*CustomTransformer) GetProtocol ¶

func (c *CustomTransformer) GetProtocol() constants.InferenceServiceProtocol

func (*CustomTransformer) GetStorageUri ¶

func (c *CustomTransformer) GetStorageUri() *string

func (*CustomTransformer) IsMMS ¶

func (c *CustomTransformer) IsMMS(config *InferenceServicesConfig) bool

func (*CustomTransformer) Validate ¶

func (c *CustomTransformer) Validate() error

Validate returns an error if invalid

type ExplainerConfig ¶

type ExplainerConfig struct {
	// explainer docker image name
	ContainerImage string `json:"image"`
	// default explainer docker image version
	DefaultImageVersion string `json:"defaultImageVersion"`
}

+kubebuilder:object:generate=false

type ExplainerExtensionSpec ¶

type ExplainerExtensionSpec struct {
	// The location of a trained explanation model
	StorageURI string `json:"storageUri,omitempty"`
	// Defaults to latest Explainer Version
	RuntimeVersion *string `json:"runtimeVersion,omitempty"`
	// Inline custom parameter settings for explainer
	Config map[string]string `json:"config,omitempty"`
	// Container enables overrides for the predictor.
	// Each framework will have different defaults that are populated in the underlying container spec.
	// +optional
	v1.Container `json:",inline"`
}

ExplainerExtensionSpec defines configuration shared across all explainer frameworks

func (*ExplainerExtensionSpec) DeepCopy ¶

func (in *ExplainerExtensionSpec) DeepCopy() *ExplainerExtensionSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new ExplainerExtensionSpec.

func (*ExplainerExtensionSpec) DeepCopyInto ¶

func (in *ExplainerExtensionSpec) DeepCopyInto(out *ExplainerExtensionSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

type ExplainerSpec ¶

type ExplainerSpec struct {
	// Spec for alibi explainer
	Alibi *AlibiExplainerSpec `json:"alibi,omitempty"`
	// Spec for AIX explainer
	AIX *AIXExplainerSpec `json:"aix,omitempty"`
	// Spec for ART explainer
	ART *ARTExplainerSpec `json:"art,omitempty"`
	// This spec is dual purpose.
	// 1) Users may choose to provide a full PodSpec for their custom explainer.
	// The field PodSpec.Containers is mutually exclusive with other explainers (i.e. Alibi).
	// 2) Users may choose to provide a Explainer (i.e. Alibi) and specify PodSpec
	// overrides in the PodSpec. They must not provide PodSpec.Containers in this case.
	PodSpec `json:",inline"`
	// Component extension defines the deployment configurations for explainer
	ComponentExtensionSpec `json:",inline"`
}

ExplainerSpec defines the container spec for a model explanation server, The following fields follow a "1-of" semantic. Users must specify exactly one spec.

func (*ExplainerSpec) DeepCopy ¶

func (in *ExplainerSpec) DeepCopy() *ExplainerSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new ExplainerSpec.

func (*ExplainerSpec) DeepCopyInto ¶

func (in *ExplainerSpec) DeepCopyInto(out *ExplainerSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*ExplainerSpec) GetExtensions ¶

func (s *ExplainerSpec) GetExtensions() *ComponentExtensionSpec

GetExtensions returns the extensions for the component

func (*ExplainerSpec) GetImplementation ¶

func (s *ExplainerSpec) GetImplementation() ComponentImplementation

GetImplementation returns the implementation for the component

func (*ExplainerSpec) GetImplementations ¶

func (s *ExplainerSpec) GetImplementations() []ComponentImplementation

GetImplementations returns the implementations for the component

type ExplainersConfig ¶

type ExplainersConfig struct {
	AlibiExplainer ExplainerConfig `json:"alibi,omitempty"`
	AIXExplainer   ExplainerConfig `json:"aix,omitempty"`
	ARTExplainer   ExplainerConfig `json:"art,omitempty"`
}

+kubebuilder:object:generate=false

type InferenceService ¶

type InferenceService struct {
	metav1.TypeMeta   `json:",inline"`
	metav1.ObjectMeta `json:"metadata,omitempty"`

	Spec   InferenceServiceSpec   `json:"spec,omitempty"`
	Status InferenceServiceStatus `json:"status,omitempty"`
}

InferenceService is the Schema for the InferenceServices API +k8s:openapi-gen=true +kubebuilder:object:root=true +kubebuilder:subresource:status +kubebuilder:printcolumn:name="URL",type="string",JSONPath=".status.url" +kubebuilder:printcolumn:name="Ready",type="string",JSONPath=".status.conditions[?(@.type=='Ready')].status" +kubebuilder:printcolumn:name="Prev",type="integer",JSONPath=".status.components.predictor.traffic[?(@.tag=='prev')].percent" +kubebuilder:printcolumn:name="Latest",type="integer",JSONPath=".status.components.predictor.traffic[?(@.latestRevision==true)].percent" +kubebuilder:printcolumn:name="PrevRolledoutRevision",type="string",JSONPath=".status.components.predictor.traffic[?(@.tag=='prev')].revisionName" +kubebuilder:printcolumn:name="LatestReadyRevision",type="string",JSONPath=".status.components.predictor.traffic[?(@.latestRevision==true)].revisionName" +kubebuilder:printcolumn:name="Age",type="date",JSONPath=".metadata.creationTimestamp" +kubebuilder:resource:path=inferenceservices,shortName=isvc +kubebuilder:storageversion

func (*InferenceService) DeepCopy ¶

func (in *InferenceService) DeepCopy() *InferenceService

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new InferenceService.

func (*InferenceService) DeepCopyInto ¶

func (in *InferenceService) DeepCopyInto(out *InferenceService)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*InferenceService) DeepCopyObject ¶

func (in *InferenceService) DeepCopyObject() runtime.Object

DeepCopyObject is an autogenerated deepcopy function, copying the receiver, creating a new runtime.Object.

func (*InferenceService) Default ¶

func (isvc *InferenceService) Default()

func (*InferenceService) DefaultInferenceService ¶

func (isvc *InferenceService) DefaultInferenceService(config *InferenceServicesConfig)

func (*InferenceService) Hub ¶

func (*InferenceService) Hub()

func (*InferenceService) ValidateCreate ¶

func (isvc *InferenceService) ValidateCreate() error

ValidateCreate implements webhook.Validator so a webhook will be registered for the type

func (*InferenceService) ValidateDelete ¶

func (isvc *InferenceService) ValidateDelete() error

ValidateDelete implements webhook.Validator so a webhook will be registered for the type

func (*InferenceService) ValidateUpdate ¶

func (isvc *InferenceService) ValidateUpdate(old runtime.Object) error

ValidateUpdate implements webhook.Validator so a webhook will be registered for the type

type InferenceServiceList ¶

type InferenceServiceList struct {
	metav1.TypeMeta `json:",inline"`
	metav1.ListMeta `json:"metadata,omitempty"`
	// +listType=set
	Items []InferenceService `json:"items"`
}

InferenceServiceList contains a list of Service +kubebuilder:object:root=true

func (*InferenceServiceList) DeepCopy ¶

func (in *InferenceServiceList) DeepCopy() *InferenceServiceList

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new InferenceServiceList.

func (*InferenceServiceList) DeepCopyInto ¶

func (in *InferenceServiceList) DeepCopyInto(out *InferenceServiceList)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*InferenceServiceList) DeepCopyObject ¶

func (in *InferenceServiceList) DeepCopyObject() runtime.Object

DeepCopyObject is an autogenerated deepcopy function, copying the receiver, creating a new runtime.Object.

type InferenceServiceSpec ¶

type InferenceServiceSpec struct {
	// Predictor defines the model serving spec
	// +required
	Predictor PredictorSpec `json:"predictor"`
	// Explainer defines the model explanation service spec,
	// explainer service calls to predictor or transformer if it is specified.
	// +optional
	Explainer *ExplainerSpec `json:"explainer,omitempty"`
	// Transformer defines the pre/post processing before and after the predictor call,
	// transformer service calls to predictor service.
	// +optional
	Transformer *TransformerSpec `json:"transformer,omitempty"`
}

InferenceServiceSpec is the top level type for this resource

func (*InferenceServiceSpec) DeepCopy ¶

func (in *InferenceServiceSpec) DeepCopy() *InferenceServiceSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new InferenceServiceSpec.

func (*InferenceServiceSpec) DeepCopyInto ¶

func (in *InferenceServiceSpec) DeepCopyInto(out *InferenceServiceSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

type InferenceServiceStatus ¶

type InferenceServiceStatus struct {
	// Conditions for the InferenceService <br/>
	// - PredictorReady: predictor readiness condition; <br/>
	// - TransformerReady: transformer readiness condition; <br/>
	// - ExplainerReady: explainer readiness condition; <br/>
	// - RoutesReady: aggregated routing condition; <br/>
	// - Ready: aggregated condition; <br/>
	duckv1.Status `json:",inline"`
	// Addressable endpoint for the InferenceService
	// +optional
	Address *duckv1.Addressable `json:"address,omitempty"`
	// URL holds the url that will distribute traffic over the provided traffic targets.
	// It generally has the form http[s]://{route-name}.{route-namespace}.{cluster-level-suffix}
	// +optional
	URL *apis.URL `json:"url,omitempty"`
	// Statuses for the components of the InferenceService
	Components map[ComponentType]ComponentStatusSpec `json:"components,omitempty"`
}

InferenceServiceStatus defines the observed state of InferenceService

func (*InferenceServiceStatus) DeepCopy ¶

func (in *InferenceServiceStatus) DeepCopy() *InferenceServiceStatus

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new InferenceServiceStatus.

func (*InferenceServiceStatus) DeepCopyInto ¶

func (in *InferenceServiceStatus) DeepCopyInto(out *InferenceServiceStatus)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*InferenceServiceStatus) GetCondition ¶

func (ss *InferenceServiceStatus) GetCondition(t apis.ConditionType) *apis.Condition

GetCondition returns the condition by name.

func (*InferenceServiceStatus) InitializeConditions ¶

func (ss *InferenceServiceStatus) InitializeConditions()

func (*InferenceServiceStatus) IsConditionReady ¶

func (ss *InferenceServiceStatus) IsConditionReady(t apis.ConditionType) bool

IsConditionReady returns the readiness for a given condition

func (*InferenceServiceStatus) IsReady ¶

func (ss *InferenceServiceStatus) IsReady() bool

IsReady returns if the service is ready to serve the requested configuration.

func (*InferenceServiceStatus) PropagateStatus ¶

func (ss *InferenceServiceStatus) PropagateStatus(component ComponentType, serviceStatus *knservingv1.ServiceStatus)

func (*InferenceServiceStatus) SetCondition ¶

func (ss *InferenceServiceStatus) SetCondition(conditionType apis.ConditionType, condition *apis.Condition)

type InferenceServicesConfig ¶

type InferenceServicesConfig struct {
	// Transformer configurations
	Transformers TransformersConfig `json:"transformers"`
	// Predictor configurations
	Predictors PredictorsConfig `json:"predictors"`
	// Explainer configurations
	Explainers ExplainersConfig `json:"explainers"`
}

+kubebuilder:object:generate=false

func NewInferenceServicesConfig ¶

func NewInferenceServicesConfig(cli client.Client) (*InferenceServicesConfig, error)

type IngressConfig ¶

type IngressConfig struct {
	IngressGateway          string `json:"ingressGateway,omitempty"`
	IngressServiceName      string `json:"ingressService,omitempty"`
	LocalGateway            string `json:"localGateway,omitempty"`
	LocalGatewayServiceName string `json:"localGatewayService,omitempty"`
}

+kubebuilder:object:generate=false

func NewIngressConfig ¶

func NewIngressConfig(cli client.Client) (*IngressConfig, error)

type LightGBMSpec ¶

type LightGBMSpec struct {
	// Contains fields shared across all predictors
	PredictorExtensionSpec `json:",inline"`
}

LightGBMSpec defines arguments for configuring LightGBMSpec model serving.

func (*LightGBMSpec) DeepCopy ¶

func (in *LightGBMSpec) DeepCopy() *LightGBMSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new LightGBMSpec.

func (*LightGBMSpec) DeepCopyInto ¶

func (in *LightGBMSpec) DeepCopyInto(out *LightGBMSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*LightGBMSpec) Default ¶

func (x *LightGBMSpec) Default(config *InferenceServicesConfig)

Default sets defaults on the resource

func (*LightGBMSpec) GetContainer ¶

func (x *LightGBMSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

GetContainer transforms the resource into a container spec

func (*LightGBMSpec) GetProtocol ¶

func (x *LightGBMSpec) GetProtocol() constants.InferenceServiceProtocol

func (*LightGBMSpec) GetStorageUri ¶

func (x *LightGBMSpec) GetStorageUri() *string

func (*LightGBMSpec) IsFrameworkSupported ¶ added in v0.6.0

func (x *LightGBMSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool

func (*LightGBMSpec) IsMMS ¶

func (x *LightGBMSpec) IsMMS(config *InferenceServicesConfig) bool

func (*LightGBMSpec) Validate ¶

func (x *LightGBMSpec) Validate() error

Validate returns an error if invalid

type LoggerSpec ¶

type LoggerSpec struct {
	// URL to send logging events
	// +optional
	URL *string `json:"url,omitempty"`
	// Specifies the scope of the loggers. <br />
	// Valid values are: <br />
	// - "all" (default): log both request and response; <br />
	// - "request": log only request; <br />
	// - "response": log only response <br />
	// +optional
	Mode LoggerType `json:"mode,omitempty"`
}

LoggerSpec specifies optional payload logging available for all components

func (*LoggerSpec) DeepCopy ¶

func (in *LoggerSpec) DeepCopy() *LoggerSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new LoggerSpec.

func (*LoggerSpec) DeepCopyInto ¶

func (in *LoggerSpec) DeepCopyInto(out *LoggerSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

type LoggerType ¶

type LoggerType string

LoggerType controls the scope of log publishing +kubebuilder:validation:Enum=all;request;response

const (
	// Logger mode to log both request and response
	LogAll LoggerType = "all"
	// Logger mode to log only request
	LogRequest LoggerType = "request"
	// Logger mode to log only response
	LogResponse LoggerType = "response"
)

LoggerType Enum

type ONNXRuntimeSpec ¶

type ONNXRuntimeSpec struct {
	// Contains fields shared across all predictors
	PredictorExtensionSpec `json:",inline"`
}

ONNXRuntimeSpec defines arguments for configuring ONNX model serving.

func (*ONNXRuntimeSpec) DeepCopy ¶

func (in *ONNXRuntimeSpec) DeepCopy() *ONNXRuntimeSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new ONNXRuntimeSpec.

func (*ONNXRuntimeSpec) DeepCopyInto ¶

func (in *ONNXRuntimeSpec) DeepCopyInto(out *ONNXRuntimeSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*ONNXRuntimeSpec) Default ¶

func (o *ONNXRuntimeSpec) Default(config *InferenceServicesConfig)

Default sets defaults on the resource

func (*ONNXRuntimeSpec) GetContainer ¶

func (o *ONNXRuntimeSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

GetContainers transforms the resource into a container spec

func (*ONNXRuntimeSpec) GetProtocol ¶

func (o *ONNXRuntimeSpec) GetProtocol() constants.InferenceServiceProtocol

func (*ONNXRuntimeSpec) GetStorageUri ¶

func (o *ONNXRuntimeSpec) GetStorageUri() *string

func (*ONNXRuntimeSpec) IsFrameworkSupported ¶ added in v0.6.0

func (o *ONNXRuntimeSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool

func (*ONNXRuntimeSpec) IsMMS ¶

func (o *ONNXRuntimeSpec) IsMMS(config *InferenceServicesConfig) bool

func (*ONNXRuntimeSpec) Validate ¶

func (o *ONNXRuntimeSpec) Validate() error

Validate returns an error if invalid

type PMMLSpec ¶

type PMMLSpec struct {
	// Contains fields shared across all predictors
	PredictorExtensionSpec `json:",inline"`
}

PMMLSpec defines arguments for configuring PMML model serving.

func (*PMMLSpec) DeepCopy ¶

func (in *PMMLSpec) DeepCopy() *PMMLSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new PMMLSpec.

func (*PMMLSpec) DeepCopyInto ¶

func (in *PMMLSpec) DeepCopyInto(out *PMMLSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*PMMLSpec) Default ¶

func (p *PMMLSpec) Default(config *InferenceServicesConfig)

Default sets defaults on the resource

func (*PMMLSpec) GetContainer ¶

func (p *PMMLSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

GetContainer transforms the resource into a container spec

func (*PMMLSpec) GetProtocol ¶

func (p *PMMLSpec) GetProtocol() constants.InferenceServiceProtocol

func (*PMMLSpec) GetStorageUri ¶

func (p *PMMLSpec) GetStorageUri() *string

func (*PMMLSpec) IsFrameworkSupported ¶ added in v0.6.0

func (p *PMMLSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool

func (*PMMLSpec) IsMMS ¶

func (p *PMMLSpec) IsMMS(config *InferenceServicesConfig) bool

func (*PMMLSpec) Validate ¶

func (p *PMMLSpec) Validate() error

Validate returns an error if invalid

type PaddleServerSpec ¶ added in v0.6.0

type PaddleServerSpec struct {
	PredictorExtensionSpec `json:",inline"`
}

func (*PaddleServerSpec) DeepCopy ¶ added in v0.6.0

func (in *PaddleServerSpec) DeepCopy() *PaddleServerSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new PaddleServerSpec.

func (*PaddleServerSpec) DeepCopyInto ¶ added in v0.6.0

func (in *PaddleServerSpec) DeepCopyInto(out *PaddleServerSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*PaddleServerSpec) Default ¶ added in v0.6.0

func (p *PaddleServerSpec) Default(config *InferenceServicesConfig)

func (*PaddleServerSpec) GetContainer ¶ added in v0.6.0

func (p *PaddleServerSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

GetContainers transforms the resource into a container spec

func (*PaddleServerSpec) GetProtocol ¶ added in v0.6.0

func (p *PaddleServerSpec) GetProtocol() constants.InferenceServiceProtocol

func (*PaddleServerSpec) GetStorageUri ¶ added in v0.6.0

func (p *PaddleServerSpec) GetStorageUri() *string

func (*PaddleServerSpec) IsFrameworkSupported ¶ added in v0.6.0

func (p *PaddleServerSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool

func (*PaddleServerSpec) IsMMS ¶ added in v0.6.0

func (p *PaddleServerSpec) IsMMS(config *InferenceServicesConfig) bool

func (*PaddleServerSpec) Validate ¶ added in v0.6.0

func (p *PaddleServerSpec) Validate() error

type PodSpec ¶

type PodSpec struct {
	// List of volumes that can be mounted by containers belonging to the pod.
	// More info: https://kubernetes.io/docs/concepts/storage/volumes
	// +optional
	// +patchMergeKey=name
	// +patchStrategy=merge,retainKeys
	Volumes []v1.Volume `json:"volumes,omitempty" patchStrategy:"merge,retainKeys" patchMergeKey:"name" protobuf:"bytes,1,rep,name=volumes"`
	// List of initialization containers belonging to the pod.
	// Init containers are executed in order prior to containers being started. If any
	// init container fails, the pod is considered to have failed and is handled according
	// to its restartPolicy. The name for an init container or normal container must be
	// unique among all containers.
	// Init containers may not have Lifecycle actions, Readiness probes, Liveness probes, or Startup probes.
	// The resourceRequirements of an init container are taken into account during scheduling
	// by finding the highest request/limit for each resource type, and then using the max of
	// of that value or the sum of the normal containers. Limits are applied to init containers
	// in a similar fashion.
	// Init containers cannot currently be added or removed.
	// Cannot be updated.
	// More info: https://kubernetes.io/docs/concepts/workloads/pods/init-containers/
	// +patchMergeKey=name
	// +patchStrategy=merge
	InitContainers []v1.Container `json:"initContainers,omitempty" patchStrategy:"merge" patchMergeKey:"name" protobuf:"bytes,20,rep,name=initContainers"`
	// List of containers belonging to the pod.
	// Containers cannot currently be added or removed.
	// There must be at least one container in a Pod.
	// Cannot be updated.
	// +patchMergeKey=name
	// +patchStrategy=merge
	Containers []v1.Container `json:"containers,omitempty" patchStrategy:"merge" patchMergeKey:"name" protobuf:"bytes,2,rep,name=containers"`
	// List of ephemeral containers run in this pod. Ephemeral containers may be run in an existing
	// pod to perform user-initiated actions such as debugging. This list cannot be specified when
	// creating a pod, and it cannot be modified by updating the pod spec. In order to add an
	// ephemeral container to an existing pod, use the pod's ephemeralcontainers subresource.
	// This field is alpha-level and is only honored by servers that enable the EphemeralContainers feature.
	// +optional
	// +patchMergeKey=name
	// +patchStrategy=merge
	// +kubebuilder:pruning:PreserveUnknownFields
	EphemeralContainers []v1.EphemeralContainer `` /* 128-byte string literal not displayed */
	// Restart policy for all containers within the pod.
	// One of Always, OnFailure, Never.
	// Default to Always.
	// More info: https://kubernetes.io/docs/concepts/workloads/pods/pod-lifecycle/#restart-policy
	// +optional
	RestartPolicy v1.RestartPolicy `json:"restartPolicy,omitempty" protobuf:"bytes,3,opt,name=restartPolicy,casttype=RestartPolicy"`
	// Optional duration in seconds the pod needs to terminate gracefully. May be decreased in delete request.
	// Value must be non-negative integer. The value zero indicates delete immediately.
	// If this value is nil, the default grace period will be used instead.
	// The grace period is the duration in seconds after the processes running in the pod are sent
	// a termination signal and the time when the processes are forcibly halted with a kill signal.
	// Set this value longer than the expected cleanup time for your process.
	// Defaults to 30 seconds.
	// +optional
	TerminationGracePeriodSeconds *int64 `json:"terminationGracePeriodSeconds,omitempty" protobuf:"varint,4,opt,name=terminationGracePeriodSeconds"`
	// Optional duration in seconds the pod may be active on the node relative to
	// StartTime before the system will actively try to mark it failed and kill associated containers.
	// Value must be a positive integer.
	// +optional
	ActiveDeadlineSeconds *int64 `json:"activeDeadlineSeconds,omitempty" protobuf:"varint,5,opt,name=activeDeadlineSeconds"`
	// Set DNS policy for the pod.
	// Defaults to "ClusterFirst".
	// Valid values are 'ClusterFirstWithHostNet', 'ClusterFirst', 'Default' or 'None'.
	// DNS parameters given in DNSConfig will be merged with the policy selected with DNSPolicy.
	// To have DNS options set along with hostNetwork, you have to specify DNS policy
	// explicitly to 'ClusterFirstWithHostNet'.
	// +optional
	DNSPolicy v1.DNSPolicy `json:"dnsPolicy,omitempty" protobuf:"bytes,6,opt,name=dnsPolicy,casttype=DNSPolicy"`
	// NodeSelector is a selector which must be true for the pod to fit on a node.
	// Selector which must match a node's labels for the pod to be scheduled on that node.
	// More info: https://kubernetes.io/docs/concepts/configuration/assign-pod-node/
	// +optional
	NodeSelector map[string]string `json:"nodeSelector,omitempty" protobuf:"bytes,7,rep,name=nodeSelector"`

	// ServiceAccountName is the name of the ServiceAccount to use to run this pod.
	// More info: https://kubernetes.io/docs/tasks/configure-pod-container/configure-service-account/
	// +optional
	ServiceAccountName string `json:"serviceAccountName,omitempty" protobuf:"bytes,8,opt,name=serviceAccountName"`
	// DeprecatedServiceAccount is a depreciated alias for ServiceAccountName.
	// Deprecated: Use serviceAccountName instead.
	// +k8s:conversion-gen=false
	// +optional
	DeprecatedServiceAccount string `json:"serviceAccount,omitempty" protobuf:"bytes,9,opt,name=serviceAccount"`
	// AutomountServiceAccountToken indicates whether a service account token should be automatically mounted.
	// +optional
	AutomountServiceAccountToken *bool `json:"automountServiceAccountToken,omitempty" protobuf:"varint,21,opt,name=automountServiceAccountToken"`

	// NodeName is a request to schedule this pod onto a specific node. If it is non-empty,
	// the scheduler simply schedules this pod onto that node, assuming that it fits resource
	// requirements.
	// +optional
	NodeName string `json:"nodeName,omitempty" protobuf:"bytes,10,opt,name=nodeName"`
	// Host networking requested for this pod. Use the host's network namespace.
	// If this option is set, the ports that will be used must be specified.
	// Default to false.
	// +k8s:conversion-gen=false
	// +optional
	HostNetwork bool `json:"hostNetwork,omitempty" protobuf:"varint,11,opt,name=hostNetwork"`
	// Use the host's pid namespace.
	// Optional: Default to false.
	// +k8s:conversion-gen=false
	// +optional
	HostPID bool `json:"hostPID,omitempty" protobuf:"varint,12,opt,name=hostPID"`
	// Use the host's ipc namespace.
	// Optional: Default to false.
	// +k8s:conversion-gen=false
	// +optional
	HostIPC bool `json:"hostIPC,omitempty" protobuf:"varint,13,opt,name=hostIPC"`
	// Share a single process namespace between all of the containers in a pod.
	// When this is set containers will be able to view and signal processes from other containers
	// in the same pod, and the first process in each container will not be assigned PID 1.
	// HostPID and ShareProcessNamespace cannot both be set.
	// Optional: Default to false.
	// +k8s:conversion-gen=false
	// +optional
	ShareProcessNamespace *bool `json:"shareProcessNamespace,omitempty" protobuf:"varint,27,opt,name=shareProcessNamespace"`
	// SecurityContext holds pod-level security attributes and common container settings.
	// Optional: Defaults to empty.  See type description for default values of each field.
	// +optional
	SecurityContext *v1.PodSecurityContext `json:"securityContext,omitempty" protobuf:"bytes,14,opt,name=securityContext"`
	// ImagePullSecrets is an optional list of references to secrets in the same namespace to use for pulling any of the images used by this PodSpec.
	// If specified, these secrets will be passed to individual puller implementations for them to use. For example,
	// in the case of docker, only DockerConfig type secrets are honored.
	// More info: https://kubernetes.io/docs/concepts/containers/images#specifying-imagepullsecrets-on-a-pod
	// +optional
	// +patchMergeKey=name
	// +patchStrategy=merge
	ImagePullSecrets []v1.LocalObjectReference `json:"imagePullSecrets,omitempty" patchStrategy:"merge" patchMergeKey:"name" protobuf:"bytes,15,rep,name=imagePullSecrets"`
	// Specifies the hostname of the Pod
	// If not specified, the pod's hostname will be set to a system-defined value.
	// +optional
	Hostname string `json:"hostname,omitempty" protobuf:"bytes,16,opt,name=hostname"`
	// If specified, the fully qualified Pod hostname will be "<hostname>.<subdomain>.<pod namespace>.svc.<cluster domain>".
	// If not specified, the pod will not have a domainname at all.
	// +optional
	Subdomain string `json:"subdomain,omitempty" protobuf:"bytes,17,opt,name=subdomain"`
	// If specified, the pod's scheduling constraints
	// +optional
	Affinity *v1.Affinity `json:"affinity,omitempty" protobuf:"bytes,18,opt,name=affinity"`
	// If specified, the pod will be dispatched by specified scheduler.
	// If not specified, the pod will be dispatched by default scheduler.
	// +optional
	SchedulerName string `json:"schedulerName,omitempty" protobuf:"bytes,19,opt,name=schedulerName"`
	// If specified, the pod's tolerations.
	// +optional
	Tolerations []v1.Toleration `json:"tolerations,omitempty" protobuf:"bytes,22,opt,name=tolerations"`
	// HostAliases is an optional list of hosts and IPs that will be injected into the pod's hosts
	// file if specified. This is only valid for non-hostNetwork pods.
	// +optional
	// +patchMergeKey=ip
	// +patchStrategy=merge
	HostAliases []v1.HostAlias `json:"hostAliases,omitempty" patchStrategy:"merge" patchMergeKey:"ip" protobuf:"bytes,23,rep,name=hostAliases"`
	// If specified, indicates the pod's priority. "system-node-critical" and
	// "system-cluster-critical" are two special keywords which indicate the
	// highest priorities with the former being the highest priority. Any other
	// name must be defined by creating a PriorityClass object with that name.
	// If not specified, the pod priority will be default or zero if there is no
	// default.
	// +optional
	PriorityClassName string `json:"priorityClassName,omitempty" protobuf:"bytes,24,opt,name=priorityClassName"`
	// The priority value. Various system components use this field to find the
	// priority of the pod. When Priority Admission Controller is enabled, it
	// prevents users from setting this field. The admission controller populates
	// this field from PriorityClassName.
	// The higher the value, the higher the priority.
	// +optional
	Priority *int32 `json:"priority,omitempty" protobuf:"bytes,25,opt,name=priority"`
	// Specifies the DNS parameters of a pod.
	// Parameters specified here will be merged to the generated DNS
	// configuration based on DNSPolicy.
	// +optional
	DNSConfig *v1.PodDNSConfig `json:"dnsConfig,omitempty" protobuf:"bytes,26,opt,name=dnsConfig"`
	// If specified, all readiness gates will be evaluated for pod readiness.
	// A pod is ready when all its containers are ready AND
	// all conditions specified in the readiness gates have status equal to "True"
	// More info: https://git.k8s.io/enhancements/keps/sig-network/0007-pod-ready%2B%2B.md
	// +optional
	ReadinessGates []v1.PodReadinessGate `json:"readinessGates,omitempty" protobuf:"bytes,28,opt,name=readinessGates"`
	// RuntimeClassName refers to a RuntimeClass object in the node.k8s.io group, which should be used
	// to run this pod.  If no RuntimeClass resource matches the named class, the pod will not be run.
	// If unset or empty, the "legacy" RuntimeClass will be used, which is an implicit class with an
	// empty definition that uses the default runtime handler.
	// More info: https://git.k8s.io/enhancements/keps/sig-node/runtime-class.md
	// This is a beta feature as of Kubernetes v1.14.
	// +optional
	RuntimeClassName *string `json:"runtimeClassName,omitempty" protobuf:"bytes,29,opt,name=runtimeClassName"`
	// EnableServiceLinks indicates whether information about services should be injected into pod's
	// environment variables, matching the syntax of Docker links.
	// Optional: Defaults to true.
	// +optional
	EnableServiceLinks *bool `json:"enableServiceLinks,omitempty" protobuf:"varint,30,opt,name=enableServiceLinks"`
	// PreemptionPolicy is the Policy for preempting pods with lower priority.
	// One of Never, PreemptLowerPriority.
	// Defaults to PreemptLowerPriority if unset.
	// This field is beta-level, gated by the NonPreemptingPriority feature-gate.
	// +optional
	PreemptionPolicy *v1.PreemptionPolicy `json:"preemptionPolicy,omitempty" protobuf:"bytes,31,opt,name=preemptionPolicy"`
	// Overhead represents the resource overhead associated with running a pod for a given RuntimeClass.
	// This field will be autopopulated at admission time by the RuntimeClass admission controller. If
	// the RuntimeClass admission controller is enabled, overhead must not be set in Pod create requests.
	// The RuntimeClass admission controller will reject Pod create requests which have the overhead already
	// set. If RuntimeClass is configured and selected in the PodSpec, Overhead will be set to the value
	// defined in the corresponding RuntimeClass, otherwise it will remain unset and treated as zero.
	// More info: https://git.k8s.io/enhancements/keps/sig-node/20190226-pod-overhead.md
	// This field is alpha-level as of Kubernetes v1.16, and is only honored by servers that enable the PodOverhead feature.
	// +optional
	Overhead v1.ResourceList `json:"overhead,omitempty" protobuf:"bytes,32,opt,name=overhead"`
	// TopologySpreadConstraints describes how a group of pods ought to spread across topology
	// domains. Scheduler will schedule pods in a way which abides by the constraints.
	// All topologySpreadConstraints are ANDed.
	// +optional
	// +patchMergeKey=topologyKey
	// +patchStrategy=merge
	// +listType=map
	// +listMapKey=topologyKey
	// +listMapKey=whenUnsatisfiable
	TopologySpreadConstraints []v1.TopologySpreadConstraint `` /* 147-byte string literal not displayed */
	// If true the pod's hostname will be configured as the pod's FQDN, rather than the leaf name (the default).
	// In Linux containers, this means setting the FQDN in the hostname field of the kernel (the nodename field of struct utsname).
	// In Windows containers, this means setting the registry value of hostname for the registry key HKEY_LOCAL_MACHINE\\SYSTEM\\CurrentControlSet\\Services\\Tcpip\\Parameters to FQDN.
	// If a pod does not have FQDN, this has no effect.
	// Default to false.
	// +optional
	SetHostnameAsFQDN *bool `json:"setHostnameAsFQDN,omitempty" protobuf:"varint,35,opt,name=setHostnameAsFQDN"`
}

PodSpec is a description of a pod.

func (*PodSpec) DeepCopy ¶

func (in *PodSpec) DeepCopy() *PodSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new PodSpec.

func (*PodSpec) DeepCopyInto ¶

func (in *PodSpec) DeepCopyInto(out *PodSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

type PredictorConfig ¶

type PredictorConfig struct {
	// predictor docker image name
	ContainerImage string `json:"image"`
	// default predictor docker image version on cpu
	DefaultImageVersion string `json:"defaultImageVersion"`
	// default predictor docker image version on gpu
	DefaultGpuImageVersion string `json:"defaultGpuImageVersion"`
	// Default timeout of predictor for serving a request, in seconds
	DefaultTimeout int64 `json:"defaultTimeout,string,omitempty"`
	// Flag to determine if multi-model serving is supported
	MultiModelServer bool `json:"multiModelServer,boolean,omitempty"`
	// frameworks the model agent is able to run
	SupportedFrameworks []string `json:"supportedFrameworks"`
}

+kubebuilder:object:generate=false

type PredictorExtensionSpec ¶

type PredictorExtensionSpec struct {
	// This field points to the location of the trained model which is mounted onto the pod.
	// +optional
	StorageURI *string `json:"storageUri,omitempty"`
	// Runtime version of the predictor docker image
	// +optional
	RuntimeVersion *string `json:"runtimeVersion,omitempty"`
	// Protocol version to use by the predictor (i.e. v1 or v2)
	// +optional
	ProtocolVersion *constants.InferenceServiceProtocol `json:"protocolVersion,omitempty"`
	// Container enables overrides for the predictor.
	// Each framework will have different defaults that are populated in the underlying container spec.
	// +optional
	v1.Container `json:",inline"`
}

PredictorExtensionSpec defines configuration shared across all predictor frameworks

func (*PredictorExtensionSpec) DeepCopy ¶

func (in *PredictorExtensionSpec) DeepCopy() *PredictorExtensionSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new PredictorExtensionSpec.

func (*PredictorExtensionSpec) DeepCopyInto ¶

func (in *PredictorExtensionSpec) DeepCopyInto(out *PredictorExtensionSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

type PredictorImplementation ¶ added in v0.6.0

type PredictorImplementation interface {
	IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool
}

PredictorImplementation defines common functions for all predictors e.g Tensorflow, Triton, etc +kubebuilder:object:generate=false

func NonNilPredictors ¶ added in v0.6.0

func NonNilPredictors(objects []PredictorImplementation) (results []PredictorImplementation)

type PredictorProtocols ¶

type PredictorProtocols struct {
	V1 *PredictorConfig `json:"v1,omitempty"`
	V2 *PredictorConfig `json:"v2,omitempty"`
}

+kubebuilder:object:generate=false

type PredictorSpec ¶

type PredictorSpec struct {
	// Spec for SKLearn model server
	SKLearn *SKLearnSpec `json:"sklearn,omitempty"`
	// Spec for XGBoost model server
	XGBoost *XGBoostSpec `json:"xgboost,omitempty"`
	// Spec for TFServing (https://github.com/tensorflow/serving)
	Tensorflow *TFServingSpec `json:"tensorflow,omitempty"`
	// Spec for TorchServe (https://pytorch.org/serve)
	PyTorch *TorchServeSpec `json:"pytorch,omitempty"`
	// Spec for Triton Inference Server (https://github.com/triton-inference-server/server)
	Triton *TritonSpec `json:"triton,omitempty"`
	// Spec for ONNX runtime (https://github.com/microsoft/onnxruntime)
	ONNX *ONNXRuntimeSpec `json:"onnx,omitempty"`
	// Spec for PMML (http://dmg.org/pmml/v4-1/GeneralStructure.html)
	PMML *PMMLSpec `json:"pmml,omitempty"`
	// Spec for LightGBM model server
	LightGBM *LightGBMSpec `json:"lightgbm,omitempty"`
	// Spec for Paddle model server (https://github.com/PaddlePaddle/Serving)
	Paddle *PaddleServerSpec `json:"paddle,omitempty"`

	// This spec is dual purpose. <br />
	// 1) Provide a full PodSpec for custom predictor.
	// The field PodSpec.Containers is mutually exclusive with other predictors (i.e. TFServing). <br />
	// 2) Provide a predictor (i.e. TFServing) and specify PodSpec
	// overrides, you must not provide PodSpec.Containers in this case. <br />
	PodSpec `json:",inline"`
	// Component extension defines the deployment configurations for a predictor
	ComponentExtensionSpec `json:",inline"`
}

PredictorSpec defines the configuration for a predictor, The following fields follow a "1-of" semantic. Users must specify exactly one spec.

func (*PredictorSpec) DeepCopy ¶

func (in *PredictorSpec) DeepCopy() *PredictorSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new PredictorSpec.

func (*PredictorSpec) DeepCopyInto ¶

func (in *PredictorSpec) DeepCopyInto(out *PredictorSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*PredictorSpec) GetExtensions ¶

func (s *PredictorSpec) GetExtensions() *ComponentExtensionSpec

GetExtensions returns the extensions for the component

func (*PredictorSpec) GetImplementation ¶

func (s *PredictorSpec) GetImplementation() ComponentImplementation

GetImplementation returns the implementation for the component

func (*PredictorSpec) GetImplementations ¶

func (s *PredictorSpec) GetImplementations() []ComponentImplementation

GetImplementations returns the implementations for the component

func (*PredictorSpec) GetPredictorImplementation ¶ added in v0.6.0

func (s *PredictorSpec) GetPredictorImplementation() *PredictorImplementation

func (*PredictorSpec) GetPredictorImplementations ¶ added in v0.6.0

func (s *PredictorSpec) GetPredictorImplementations() []PredictorImplementation

GetPredictor returns the implementation for the predictor

type PredictorsConfig ¶

type PredictorsConfig struct {
	Tensorflow PredictorConfig    `json:"tensorflow,omitempty"`
	Triton     PredictorConfig    `json:"triton,omitempty"`
	XGBoost    PredictorProtocols `json:"xgboost,omitempty"`
	SKlearn    PredictorProtocols `json:"sklearn,omitempty"`
	PyTorch    PredictorProtocols `json:"pytorch,omitempty"`
	ONNX       PredictorConfig    `json:"onnx,omitempty"`
	PMML       PredictorConfig    `json:"pmml,omitempty"`
	LightGBM   PredictorConfig    `json:"lightgbm,omitempty"`
	Paddle     PredictorConfig    `json:"paddle,omitempty"`
}

+kubebuilder:object:generate=false

type SKLearnSpec ¶

type SKLearnSpec struct {
	// Contains fields shared across all predictors
	PredictorExtensionSpec `json:",inline"`
}

SKLearnSpec defines arguments for configuring SKLearn model serving.

func (*SKLearnSpec) DeepCopy ¶

func (in *SKLearnSpec) DeepCopy() *SKLearnSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new SKLearnSpec.

func (*SKLearnSpec) DeepCopyInto ¶

func (in *SKLearnSpec) DeepCopyInto(out *SKLearnSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*SKLearnSpec) Default ¶

func (k *SKLearnSpec) Default(config *InferenceServicesConfig)

Default sets defaults on the resource

func (*SKLearnSpec) GetContainer ¶

func (k *SKLearnSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

GetContainer transforms the resource into a container spec

func (*SKLearnSpec) GetProtocol ¶

func (k *SKLearnSpec) GetProtocol() constants.InferenceServiceProtocol

func (*SKLearnSpec) GetStorageUri ¶

func (k *SKLearnSpec) GetStorageUri() *string

func (*SKLearnSpec) IsFrameworkSupported ¶ added in v0.6.0

func (k *SKLearnSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool

func (*SKLearnSpec) IsMMS ¶

func (k *SKLearnSpec) IsMMS(config *InferenceServicesConfig) bool

func (*SKLearnSpec) Validate ¶

func (k *SKLearnSpec) Validate() error

Validate returns an error if invalid

type TFServingSpec ¶

type TFServingSpec struct {
	// Contains fields shared across all predictors
	PredictorExtensionSpec `json:",inline"`
}

TFServingSpec defines arguments for configuring Tensorflow model serving.

func (*TFServingSpec) DeepCopy ¶

func (in *TFServingSpec) DeepCopy() *TFServingSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new TFServingSpec.

func (*TFServingSpec) DeepCopyInto ¶

func (in *TFServingSpec) DeepCopyInto(out *TFServingSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*TFServingSpec) Default ¶

func (t *TFServingSpec) Default(config *InferenceServicesConfig)

Default sets defaults on the resource

func (*TFServingSpec) GetContainer ¶

func (t *TFServingSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

GetContainers transforms the resource into a container spec

func (*TFServingSpec) GetProtocol ¶

func (t *TFServingSpec) GetProtocol() constants.InferenceServiceProtocol

func (*TFServingSpec) GetStorageUri ¶

func (t *TFServingSpec) GetStorageUri() *string

func (*TFServingSpec) IsFrameworkSupported ¶ added in v0.6.0

func (t *TFServingSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool

func (*TFServingSpec) IsMMS ¶

func (t *TFServingSpec) IsMMS(config *InferenceServicesConfig) bool

func (*TFServingSpec) Validate ¶

func (t *TFServingSpec) Validate() error

Validate returns an error if invalid

type TorchServeSpec ¶

type TorchServeSpec struct {
	// When this field is specified KFS chooses the KFServer implementation, otherwise KFS uses the TorchServe implementation
	// +optional
	ModelClassName string `json:"modelClassName,omitempty"`
	// Contains fields shared across all predictors
	PredictorExtensionSpec `json:",inline"`
}

TorchServeSpec defines arguments for configuring PyTorch model serving.

func (*TorchServeSpec) DeepCopy ¶

func (in *TorchServeSpec) DeepCopy() *TorchServeSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new TorchServeSpec.

func (*TorchServeSpec) DeepCopyInto ¶

func (in *TorchServeSpec) DeepCopyInto(out *TorchServeSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*TorchServeSpec) Default ¶

func (t *TorchServeSpec) Default(config *InferenceServicesConfig)

Default sets defaults on the resource

func (*TorchServeSpec) GetContainer ¶

func (t *TorchServeSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

GetContainers transforms the resource into a container spec

func (*TorchServeSpec) GetContainerV1 ¶

func (t *TorchServeSpec) GetContainerV1(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

func (*TorchServeSpec) GetContainerV2 ¶

func (t *TorchServeSpec) GetContainerV2(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

func (*TorchServeSpec) GetProtocol ¶

func (t *TorchServeSpec) GetProtocol() constants.InferenceServiceProtocol

func (*TorchServeSpec) GetStorageUri ¶

func (t *TorchServeSpec) GetStorageUri() *string

func (*TorchServeSpec) IsFrameworkSupported ¶ added in v0.6.0

func (t *TorchServeSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool

func (*TorchServeSpec) IsMMS ¶

func (t *TorchServeSpec) IsMMS(config *InferenceServicesConfig) bool

func (*TorchServeSpec) Validate ¶

func (t *TorchServeSpec) Validate() error

Validate returns an error if invalid

type TransformerConfig ¶

type TransformerConfig struct {
	// transformer docker image name
	ContainerImage string `json:"image"`
	// default transformer docker image version
	DefaultImageVersion string `json:"defaultImageVersion"`
}

+kubebuilder:object:generate=false

type TransformerSpec ¶

type TransformerSpec struct {
	// This spec is dual purpose. <br />
	// 1) Provide a full PodSpec for custom transformer.
	// The field PodSpec.Containers is mutually exclusive with other transformers. <br />
	// 2) Provide a transformer and specify PodSpec
	// overrides, you must not provide PodSpec.Containers in this case. <br />
	PodSpec `json:",inline"`
	// Component extension defines the deployment configurations for a transformer
	ComponentExtensionSpec `json:",inline"`
}

TransformerSpec defines transformer service for pre/post processing

func (*TransformerSpec) DeepCopy ¶

func (in *TransformerSpec) DeepCopy() *TransformerSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new TransformerSpec.

func (*TransformerSpec) DeepCopyInto ¶

func (in *TransformerSpec) DeepCopyInto(out *TransformerSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*TransformerSpec) GetExtensions ¶

func (s *TransformerSpec) GetExtensions() *ComponentExtensionSpec

GetExtensions returns the extensions for the component

func (*TransformerSpec) GetImplementation ¶

func (s *TransformerSpec) GetImplementation() ComponentImplementation

GetImplementation returns the implementation for the component

func (*TransformerSpec) GetImplementations ¶

func (s *TransformerSpec) GetImplementations() []ComponentImplementation

GetImplementations returns the implementations for the component

type TransformersConfig ¶

type TransformersConfig struct {
	Feast TransformerConfig `json:"feast,omitempty"`
}

+kubebuilder:object:generate=false

type TritonSpec ¶

type TritonSpec struct {
	// Contains fields shared across all predictors
	PredictorExtensionSpec `json:",inline"`
}

TritonSpec defines arguments for configuring Triton model serving.

func (*TritonSpec) DeepCopy ¶

func (in *TritonSpec) DeepCopy() *TritonSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new TritonSpec.

func (*TritonSpec) DeepCopyInto ¶

func (in *TritonSpec) DeepCopyInto(out *TritonSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*TritonSpec) Default ¶

func (t *TritonSpec) Default(config *InferenceServicesConfig)

Default sets defaults on the resource

func (*TritonSpec) GetContainer ¶

func (t *TritonSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

GetContainers transforms the resource into a container spec

func (*TritonSpec) GetProtocol ¶

func (t *TritonSpec) GetProtocol() constants.InferenceServiceProtocol

func (*TritonSpec) GetStorageUri ¶

func (t *TritonSpec) GetStorageUri() *string

func (*TritonSpec) IsFrameworkSupported ¶ added in v0.6.0

func (t *TritonSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool

func (*TritonSpec) IsMMS ¶

func (t *TritonSpec) IsMMS(config *InferenceServicesConfig) bool

func (*TritonSpec) Validate ¶

func (t *TritonSpec) Validate() error

Validate returns an error if invalid

type XGBoostSpec ¶

type XGBoostSpec struct {
	// Contains fields shared across all predictors
	PredictorExtensionSpec `json:",inline"`
}

XGBoostSpec defines arguments for configuring XGBoost model serving.

func (*XGBoostSpec) DeepCopy ¶

func (in *XGBoostSpec) DeepCopy() *XGBoostSpec

DeepCopy is an autogenerated deepcopy function, copying the receiver, creating a new XGBoostSpec.

func (*XGBoostSpec) DeepCopyInto ¶

func (in *XGBoostSpec) DeepCopyInto(out *XGBoostSpec)

DeepCopyInto is an autogenerated deepcopy function, copying the receiver, writing into out. in must be non-nil.

func (*XGBoostSpec) Default ¶

func (x *XGBoostSpec) Default(config *InferenceServicesConfig)

Default sets defaults on the resource

func (*XGBoostSpec) GetContainer ¶

func (x *XGBoostSpec) GetContainer(metadata metav1.ObjectMeta, extensions *ComponentExtensionSpec, config *InferenceServicesConfig) *v1.Container

GetContainer transforms the resource into a container spec

func (*XGBoostSpec) GetProtocol ¶

func (x *XGBoostSpec) GetProtocol() constants.InferenceServiceProtocol

func (*XGBoostSpec) GetStorageUri ¶

func (x *XGBoostSpec) GetStorageUri() *string

func (*XGBoostSpec) IsFrameworkSupported ¶ added in v0.6.0

func (x *XGBoostSpec) IsFrameworkSupported(framework string, config *InferenceServicesConfig) bool

func (*XGBoostSpec) IsMMS ¶

func (x *XGBoostSpec) IsMMS(config *InferenceServicesConfig) bool

func (*XGBoostSpec) Validate ¶

func (x *XGBoostSpec) Validate() error

Validate returns an error if invalid

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL