putllama

package

v9.2.0 Latest Latest Go to latest Published: Oct 29, 2025 License: Apache-2.0 Imports: 14 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/elastic/go-elasticsearch

Links

Open Source Insights

Documentation ¶

Overview ¶

Create a Llama inference endpoint.

Create an inference endpoint to perform an inference task with the `llama` service.

Index ¶

Variables
type NewPutLlama
- func NewPutLlamaFunc(tp elastictransport.Interface) NewPutLlama
type PutLlama
- func New(tp elastictransport.Interface) *PutLlama
type Request
- func NewRequest() *Request
- func (r *Request) FromJSON(data string) (*Request, error)
type Response
- func NewResponse() *Response

Constants ¶

This section is empty.

Variables ¶

View Source

var ErrBuildPath = errors.New("cannot build path, check for missing path parameters")

ErrBuildPath is returned in case of missing parameters within the build of the request.

Functions ¶

This section is empty.

Types ¶

func NewPutLlamaFunc ¶

func NewPutLlamaFunc(tp elastictransport.Interface) NewPutLlama

NewPutLlamaFunc returns a new instance of PutLlama with the provided transport. Used in the index of the library this allows to retrieve every apis in once place.

type PutLlama ¶

type PutLlama struct {
	// contains filtered or unexported fields
}

func New ¶

func New(tp elastictransport.Interface) *PutLlama

Create a Llama inference endpoint.

Create an inference endpoint to perform an inference task with the `llama` service.

https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-inference-put-llama

func (*PutLlama) ChunkingSettings ¶

func (r *PutLlama) ChunkingSettings(chunkingsettings types.InferenceChunkingSettingsVariant) *PutLlama

The chunking configuration object. API name: chunking_settings

func (PutLlama) Do ¶

func (r PutLlama) Do(providedCtx context.Context) (*Response, error)

Do runs the request through the transport, handle the response and returns a putllama.Response

func (*PutLlama) ErrorTrace ¶

func (r *PutLlama) ErrorTrace(errortrace bool) *PutLlama

ErrorTrace When set to `true` Elasticsearch will include the full stack trace of errors when they occur. API name: error_trace

func (*PutLlama) FilterPath ¶

func (r *PutLlama) FilterPath(filterpaths ...string) *PutLlama

FilterPath Comma-separated list of filters in dot notation which reduce the response returned by Elasticsearch. API name: filter_path

func (r *PutLlama) Header(key, value string) *PutLlama

Header set a key, value pair in the PutLlama headers map.

func (*PutLlama) HttpRequest ¶

func (r *PutLlama) HttpRequest(ctx context.Context) (*http.Request, error)

HttpRequest returns the http.Request object built from the given parameters.

func (*PutLlama) Human ¶

func (r *PutLlama) Human(human bool) *PutLlama

Human When set to `true` will return statistics in a format suitable for humans. For example `"exists_time": "1h"` for humans and `"exists_time_in_millis": 3600000` for computers. When disabled the human readable values will be omitted. This makes sense for responses being consumed only by machines. API name: human

func (PutLlama) Perform ¶

func (r PutLlama) Perform(providedCtx context.Context) (*http.Response, error)

Perform runs the http.Request through the provided transport and returns an http.Response.

func (*PutLlama) Pretty ¶

func (r *PutLlama) Pretty(pretty bool) *PutLlama

Pretty If set to `true` the returned JSON will be "pretty-formatted". Only use this option for debugging only. API name: pretty

func (*PutLlama) Raw ¶

func (r *PutLlama) Raw(raw io.Reader) *PutLlama

Raw takes a json payload as input which is then passed to the http.Request If specified Raw takes precedence on Request method.

func (*PutLlama) Request ¶

func (r *PutLlama) Request(req *Request) *PutLlama

Request allows to set the request property with the appropriate payload.

func (*PutLlama) Service ¶

func (r *PutLlama) Service(service llamaservicetype.LlamaServiceType) *PutLlama

The type of service supported for the specified task type. In this case, `llama`. API name: service

func (*PutLlama) ServiceSettings ¶

func (r *PutLlama) ServiceSettings(servicesettings types.LlamaServiceSettingsVariant) *PutLlama

Settings used to install the inference model. These settings are specific to the `llama` service. API name: service_settings

func (*PutLlama) Timeout ¶

func (r *PutLlama) Timeout(duration string) *PutLlama

Timeout Specifies the amount of time to wait for the inference endpoint to be created. API name: timeout

type Request ¶

type Request struct {

	// ChunkingSettings The chunking configuration object.
	ChunkingSettings *types.InferenceChunkingSettings `json:"chunking_settings,omitempty"`
	// Service The type of service supported for the specified task type. In this case,
	// `llama`.
	Service llamaservicetype.LlamaServiceType `json:"service"`
	// ServiceSettings Settings used to install the inference model. These settings are specific to
	// the `llama` service.
	ServiceSettings types.LlamaServiceSettings `json:"service_settings"`
}

Request holds the request body struct for the package putllama

https://github.com/elastic/elasticsearch-specification/blob/d520d9e8cf14cad487de5e0654007686c395b494/specification/inference/put_llama/PutLlamaRequest.ts#L30-L79

func NewRequest ¶

func NewRequest() *Request

NewRequest returns a Request

func (*Request) FromJSON ¶

func (r *Request) FromJSON(data string) (*Request, error)

FromJSON allows to load an arbitrary json into the request structure

type Response ¶

type Response struct {

	// ChunkingSettings Chunking configuration object
	ChunkingSettings *types.InferenceChunkingSettings `json:"chunking_settings,omitempty"`
	// InferenceId The inference Id
	InferenceId string `json:"inference_id"`
	// Service The service type
	Service string `json:"service"`
	// ServiceSettings Settings specific to the service
	ServiceSettings json.RawMessage `json:"service_settings"`
	// TaskSettings Task settings specific to the service and task type
	TaskSettings json.RawMessage `json:"task_settings,omitempty"`
	// TaskType The task type
	TaskType tasktypellama.TaskTypeLlama `json:"task_type"`
}

Response holds the response body struct for the package putllama

https://github.com/elastic/elasticsearch-specification/blob/d520d9e8cf14cad487de5e0654007686c395b494/specification/inference/put_llama/PutLlamaResponse.ts#L22-L25

func NewResponse ¶

func NewResponse() *Response

NewResponse returns a Response

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL