Documentation
¶
Overview ¶
Package models defines commonly used LLM model identifier constants, grouped by vendor.
Index ¶
Constants ¶
const ( AnthropicClaudeOpus46 = "claude-opus-4-6" AnthropicClaudeSonnet46 = "claude-sonnet-4-6" AnthropicClaudeHaiku45 = "claude-haiku-4-5" AnthropicClaudeOpus41 = "claude-opus-4-1" AnthropicClaudeOpus41_20250805 = "claude-opus-4-1-20250805" AnthropicClaudeOpus4 = "claude-opus-4-0" AnthropicClaudeOpus4_20250514 = "claude-opus-4-20250514" AnthropicClaudeSonnet4 = "claude-sonnet-4-0" AnthropicClaudeSonnet4_20250514 = "claude-sonnet-4-20250514" AnthropicClaudeHaiku4 = "claude-haiku-4-0" AnthropicClaudeHaiku4_20251001 = "claude-haiku-4-20251001" )
Anthropic Claude 4.x models.
const ( AnthropicClaude35Sonnet = "claude-3-5-sonnet-20241022" AnthropicClaude35Sonnet20240620 = "claude-3-5-sonnet-20240620" AnthropicClaude35Haiku = "claude-3-5-haiku-20241022" )
Anthropic Claude 3.5 models.
const ( AnthropicClaude3Opus = "claude-3-opus-20240229" AnthropicClaude3Sonnet = "claude-3-sonnet-20240229" AnthropicClaude3Haiku = "claude-3-haiku-20240307" )
Anthropic Claude 3 models.
const ( AnthropicClaude21 = "claude-2.1" AnthropicClaude2 = "claude-2.0" )
Anthropic Claude 2 legacy models.
const ( CohereCommandA032025 = "command-a-03-2025" CohereCommandRPlus082024 = "command-r-plus-08-2024" CohereCommandR082024 = "command-r-08-2024" CohereCommandR7B122024 = "command-r7b-12-2024" CohereCommandLight32024 = "command-light-03-2024" )
Cohere command models.
const ( CohereEmbedV4 = "embed-v4.0" CohereEmbedEnglishV3 = "embed-english-v3.0" CohereEmbedMultilingualV3 = "embed-multilingual-v3.0" CohereEmbedEnglishLightV3 = "embed-english-light-v3.0" )
Cohere embed models.
const ( CohereRerankV35 = "rerank-v3.5" CohereRerankV3 = "rerank-v3.0" )
Cohere rerank models.
const ( // OpenAI vision models VisionOpenAIGPT4o = OpenAIGPT4o VisionOpenAIGPT4oMini = OpenAIGPT4oMini VisionOpenAIGPT4Turbo = OpenAIGPT4Turbo VisionOpenAIGPT4Vision = "gpt-4-vision-preview" VisionOpenAIGPT4VisionHD = "gpt-4-turbo-with-vision" // Anthropic vision models VisionAnthropicClaude3Opus = AnthropicClaude3Opus VisionAnthropicClaude3Sonnet = AnthropicClaude3Sonnet VisionAnthropicClaude35Sonnet = AnthropicClaude35Sonnet // Google vision models VisionGoogleGemini15Pro = Gemini15Pro VisionGoogleGemini15Flash = Gemini15Flash VisionGoogleGemini20Flash = Gemini20Flash // Mistral vision models VisionMistralPixtralLarge = MistralPixtralLargeLatest VisionMistralPixtral12B = MistralPixtral12BLatest // Meta vision models VisionMetaLlama32Vision = "llama-3.2-vision" // xAI vision models VisionXAIGrok2Vision = XAIGrok2Vision )
Vision-capable models across providers.
const ( // OpenAI reasoning models ReasoningOpenAIO1 = OpenAIO1 ReasoningOpenAIO1Mini = OpenAIO1Mini ReasoningOpenAIO1Pro = OpenAIO1Pro ReasoningOpenAIO3 = OpenAIO3 ReasoningOpenAIO3Mini = OpenAIO3Mini ReasoningOpenAIO3Pro = OpenAIO3Pro // DeepSeek reasoning models ReasoningDeepSeekReasoner = DeepSeekReasoner // Google reasoning models (Gemini 2.0 Pro Exp) ReasoningGoogleGemini20ProExp = Gemini20ProExp )
Reasoning-capable models (for complex problem-solving).
const ( // OpenAI lightweight models FastOpenAIGPT4oMini = OpenAIGPT4oMini FastOpenAIGPT35Turbo = OpenAIGPT35Turbo FastOpenAIGPT41Mini = OpenAIGPT41Mini FastOpenAIGPT41Nano = OpenAIGPT41Nano // Anthropic lightweight models FastAnthropicClaude35Haiku = AnthropicClaude35Haiku FastAnthropicClaudeHaiku45 = AnthropicClaudeHaiku45 // Google lightweight models FastGoogleGemini25FlashLite = Gemini25FlashLite FastGoogleGemini20FlashLite = Gemini20FlashLite FastGoogleGemini15Flash8B = Gemini15Flash8B // Mistral lightweight models FastMistralSmall = MistralSmallLatest FastMistralMinistral3B = MistralMinistral3BLatest // Qwen lightweight models FastQwenTurbo = QwenTurbo // Meta lightweight models FastMetaLlama32_1B = MetaLlama32_1BInstruct FastMetaLlama32_3B = MetaLlama32_3BInstruct )
Fast/lightweight models for cost-effective inference.
const ( // OpenAI flagship models PremiumOpenAIGPT4o = OpenAIGPT4o PremiumOpenAIGPT4Turbo = OpenAIGPT4Turbo PremiumOpenAIGPT5 = OpenAIGPT5 PremiumOpenAIGPT52Pro = OpenAIGPT52Pro // Anthropic flagship models PremiumAnthropicClaudeOpus46 = AnthropicClaudeOpus46 PremiumAnthropicClaudeSonnet46 = AnthropicClaudeSonnet46 PremiumAnthropicClaudeOpus41 = AnthropicClaudeOpus41 // Google flagship models PremiumGoogleGemini25Pro = Gemini25Pro PremiumGoogleGemini20ProExp = Gemini20ProExp // Mistral flagship models PremiumMistralLarge = MistralLargeLatest // DeepSeek flagship models PremiumDeepSeekV3 = DeepSeekV3 // Qwen flagship models PremiumQwenMax = QwenMax // Meta flagship models PremiumMetaLlama31_405B = MetaLlama31_405B )
High-performance models for complex tasks.
const ( // OpenAI code models CodeOpenAIGPT4o = OpenAIGPT4o CodeOpenAIGPT4Turbo = OpenAIGPT4Turbo CodeOpenAIGPT35Turbo = OpenAIGPT35Turbo // Mistral code models CodeMistralCodestral = MistralCodestralLatest CodeMistralCodestral2501 = MistralCodestral2501 // DeepSeek code models CodeDeepSeekCoder = DeepSeekCoder CodeDeepSeekCoderInstruct = DeepSeekCoderInstruct // Meta code models CodeMetaLlama31_70B = MetaLlama31_70B )
Code generation specialized models.
const ( // OpenAI embedding models EmbeddingOpenAITextEmbedding3Large = OpenAITextEmbedding3Large EmbeddingOpenAITextEmbedding3Small = OpenAITextEmbedding3Small EmbeddingOpenAITextEmbeddingAda002 = OpenAITextEmbeddingAda002 // Mistral embedding models EmbeddingMistralEmbed = MistralEmbed EmbeddingMistralCodestralEmbed = MistralCodestralEmbed // Cohere embedding models EmbeddingCohereEmbedV4 = CohereEmbedV4 EmbeddingCohereEmbedEnglishV3 = CohereEmbedEnglishV3 EmbeddingCohereEmbedMultilingualV3 = CohereEmbedMultilingualV3 )
Embedding models for semantic search and similarity.
const ( // Google multilingual models MultilingualGoogleGemini25Pro = Gemini25Pro MultilingualGoogleGemini15Pro = Gemini15Pro // Mistral multilingual models MultilingualMistralLarge = MistralLargeLatest // Qwen multilingual models MultilingualQwenMax = QwenMax // Meta multilingual models MultilingualMetaLlama31_70B = MetaLlama31_70B // Cohere multilingual models MultilingualCohereCommandRPlus = CohereCommandRPlus082024 )
Multilingual models with strong language support.
const ( // OpenAI long context models LongContextOpenAIGPT4Turbo = OpenAIGPT4Turbo LongContextOpenAIGPT4o = OpenAIGPT4o // Anthropic long context models LongContextAnthropicClaude35Sonnet = AnthropicClaude35Sonnet LongContextAnthropicClaudeOpus41 = AnthropicClaudeOpus41 // Google long context models LongContextGoogleGemini15Pro = Gemini15Pro // Zhipu long context models LongContextZhipuGLM4Long = ZhipuGLM4Long )
Long context window models (100k+ tokens).
const ( // OpenAI function calling models FunctionCallingOpenAIGPT4o = OpenAIGPT4o FunctionCallingOpenAIGPT4Turbo = OpenAIGPT4Turbo FunctionCallingOpenAIGPT35Turbo = OpenAIGPT35Turbo // Anthropic function calling models FunctionCallingAnthropicClaude35Sonnet = AnthropicClaude35Sonnet FunctionCallingAnthropicClaudeOpus41 = AnthropicClaudeOpus41 // Google function calling models FunctionCallingGoogleGemini25Pro = Gemini25Pro FunctionCallingGoogleGemini15Pro = Gemini15Pro // Mistral function calling models FunctionCallingMistralLarge = MistralLargeLatest // Qwen function calling models FunctionCallingQwenMax = QwenMax )
Models with function calling support.
const ( // Default general-purpose model DefaultGeneralPurpose = OpenAIGPT4o // Default fast model DefaultFast = OpenAIGPT4oMini // Default reasoning model DefaultReasoning = OpenAIO1 // Default vision model DefaultVision = OpenAIGPT4o // Default code model DefaultCode = OpenAIGPT4o // Default embedding model DefaultEmbedding = OpenAITextEmbedding3Small )
Recommended default models by use case.
const ( DeepSeekChat = "deepseek-chat" DeepSeekReasoner = "deepseek-reasoner" )
DeepSeek API chat and reasoning models.
const ( DeepSeekCoder = "deepseek-coder" DeepSeekCoderInstruct = "deepseek-coder-instruct" )
DeepSeek API code models.
const ( DeepSeekV3 = "deepseek-v3" DeepSeekV25 = "deepseek-v2.5" DeepSeekV2 = "deepseek-v2" DeepSeekLLM7BChat = "deepseek-llm-7b-chat" DeepSeekLLM67BChat = "deepseek-llm-67b-chat" )
DeepSeek API legacy / versioned models.
const ( // OpenAI family GatewayGPT4o = OpenAIGPT4o GatewayGPT4oMini = OpenAIGPT4oMini GatewayGPT41 = OpenAIGPT41 GatewayGPT41Mini = OpenAIGPT41Mini GatewayGPT41Nano = OpenAIGPT41Nano GatewayGPT4Turbo = OpenAIGPT4Turbo GatewayGPT4 = OpenAIGPT4 GatewayGPT35Turbo = OpenAIGPT35Turbo GatewayGPT5 = OpenAIGPT5 GatewayGPT52 = OpenAIGPT52 GatewayO1 = OpenAIO1 GatewayO1Mini = OpenAIO1Mini GatewayO3 = OpenAIO3 GatewayO3Mini = OpenAIO3Mini GatewayO4Mini = OpenAIO4Mini // Anthropic family GatewayClaudeOpus46 = AnthropicClaudeOpus46 GatewayClaudeSonnet46 = AnthropicClaudeSonnet46 GatewayClaudeHaiku45 = AnthropicClaudeHaiku45 GatewayClaudeOpus41 = AnthropicClaudeOpus41 GatewayClaudeSonnet4 = AnthropicClaudeSonnet4 GatewayClaude37Sonnet = AnthropicClaude37Sonnet GatewayClaude35Sonnet = AnthropicClaude35Sonnet GatewayClaude35Haiku = AnthropicClaude35Haiku GatewayClaude3Opus = AnthropicClaude3Opus GatewayClaude3Sonnet = AnthropicClaude3Sonnet GatewayClaude3Haiku = AnthropicClaude3Haiku // DeepSeek family GatewayDeepSeekChat = DeepSeekChat GatewayDeepSeekReasoner = DeepSeekReasoner GatewayDeepSeekCoder = DeepSeekCoder GatewayDeepSeekV3 = DeepSeekV3 // Google Gemini family GatewayGemini25Pro = Gemini25Pro GatewayGemini25Flash = Gemini25Flash GatewayGemini25FlashLite = Gemini25FlashLite GatewayGemini20Flash = Gemini20Flash GatewayGemini20FlashLite = Gemini20FlashLite GatewayGemini15Pro = Gemini15Pro GatewayGemini15Flash = Gemini15Flash GatewayGemini15Flash8B = Gemini15Flash8B // Mistral family GatewayMistralLarge = MistralLargeLatest GatewayMistralSmall = MistralSmallLatest GatewayMistralMedium = MistralMediumLatest GatewayMistralCodestral = MistralCodestralLatest GatewayMistralNemo = MistralOpenNemo // Qwen family GatewayQwenMax = QwenMax GatewayQwenPlus = QwenPlus GatewayQwenTurbo = QwenTurbo GatewayQwen3Max = Qwen3Max // xAI Grok family GatewayGrok3 = XAIGrok3 GatewayGrok3Mini = XAIGrok3Mini GatewayGrok4 = XAIGrok4 // Zhipu / Moonshot GatewayGLM4Plus = ZhipuGLM4Plus GatewayGLM4Flash = ZhipuGLM4Flash GatewayKimiK2 = MoonshotKimiK2 )
OpenAI-compatible gateway aliases (aggregators, proxies, private deployments).
const ( Gemini25Pro = "gemini-2.5-pro" Gemini25ProPreview = "gemini-2.5-pro-preview-05-06" Gemini25Flash = "gemini-2.5-flash" Gemini25FlashPreview = "gemini-2.5-flash-preview-05-20" Gemini25FlashLite = "gemini-2.5-flash-lite" Gemini25FlashLitePreview = "gemini-2.5-flash-lite-preview-06-17" )
Google Gemini 2.5 models.
const ( Gemini20Flash = "gemini-2.0-flash" Gemini20FlashLite = "gemini-2.0-flash-lite" Gemini20Flash001 = "gemini-2.0-flash-001" Gemini20FlashLite001 = "gemini-2.0-flash-lite-001" Gemini20ProExp = "gemini-2.0-pro-exp" Gemini20ProExp0205 = "gemini-2.0-pro-exp-02-05" )
Google Gemini 2.0 models.
const ( Gemini15Pro = "gemini-1.5-pro" Gemini15Pro002 = "gemini-1.5-pro-002" Gemini15Pro001 = "gemini-1.5-pro-001" Gemini15Flash = "gemini-1.5-flash" Gemini15Flash002 = "gemini-1.5-flash-002" Gemini15Flash001 = "gemini-1.5-flash-001" Gemini15Flash8B = "gemini-1.5-flash-8b" Gemini15Flash8B001 = "gemini-1.5-flash-8b-001" )
Google Gemini 1.5 models.
const ( Gemini10Pro = "gemini-1.0-pro" Gemini10Pro001 = "gemini-1.0-pro-001" Gemini10ProLatest = "gemini-1.0-pro-latest" )
Google Gemini 1.0 legacy models.
const ( Gemma31BIt = "gemma-3-1b-it" Gemma34BIt = "gemma-3-4b-it" Gemma312BIt = "gemma-3-12b-it" Gemma327BIt = "gemma-3-27b-it" Gemma227BIt = "gemma-2-27b-it" Gemma29BIt = "gemma-2-9b-it" Gemma22BIt = "gemma-2-2b-it" )
Google Gemma API models.
const ( GeminiEmbedding001 = "gemini-embedding-001" GeminiTextEmbedding004 = "text-embedding-004" )
Google embedding models.
const ( MetaLlama4Maverick17B = "Llama-4-Maverick-17B-128E-Instruct" MetaLlama4Scout17B = "Llama-4-Scout-17B-16E-Instruct" MetaLlama32_90BVision = "Llama-3.2-90B-Vision-Instruct" MetaLlama32_11BVision = "Llama-3.2-11B-Vision-Instruct" MetaLlama32_3BInstruct = "Llama-3.2-3B-Instruct" MetaLlama32_1BInstruct = "Llama-3.2-1B-Instruct" MetaLlama31_405B = "Llama-3.1-405B-Instruct" MetaLlama31_70B = "Llama-3.1-70B-Instruct" MetaLlama31_8B = "Llama-3.1-8B-Instruct" MetaLlama3_70B = "Llama-3-70B-Instruct" MetaLlama3_8B = "Llama-3-8B-Instruct" MetaLlama2_70B = "Llama-2-70b-chat-hf" MetaLlama2_13B = "Llama-2-13b-chat-hf" MetaLlama2_7B = "Llama-2-7b-chat-hf" )
Meta Llama API instruct models.
const ( MetaLlamaGuard3_8B = "Llama-Guard-3-8B" MetaLlamaGuard3_11B = "Llama-Guard-3-11B-Vision" MetaLlamaGuard2_8B = "Llama-Guard-2-8B" )
Meta Llama Guard safety models.
const ( MistralLargeLatest = "mistral-large-latest" MistralLarge2411 = "mistral-large-2411" MistralLarge2407 = "mistral-large-2407" MistralSmallLatest = "mistral-small-latest" MistralSmall2501 = "mistral-small-2501" MistralSmall2409 = "mistral-small-2409" MistralMediumLatest = "mistral-medium-latest" MistralMedium2312 = "mistral-medium-2312" )
Mistral AI flagship models.
const ( MistralOpen7B = "open-mistral-7b" MistralOpenNemo = "open-mistral-nemo" MistralOpenMixtral8x7B = "open-mixtral-8x7b" MistralOpenMixtral8x22B = "open-mixtral-8x22b" )
Mistral AI open-weight models.
const ( MistralCodestralLatest = "codestral-latest" MistralCodestral2501 = "codestral-2501" MistralCodestral2405 = "codestral-2405" MistralDevstralSmall = "devstral-small-2505" MistralDevstralMedium = "devstral-medium-2507" )
Mistral AI code models.
const ( MistralMinistral3BLatest = "ministral-3b-latest" MistralMinistral8BLatest = "ministral-8b-latest" )
Mistral AI compact models.
const ( MistralPixtralLargeLatest = "pixtral-large-latest" MistralPixtral12BLatest = "pixtral-12b-latest" MistralPixtral12B2409 = "pixtral-12b-2409" )
Mistral AI vision models.
const ( MistralEmbed = "mistral-embed" MistralCodestralEmbed = "codestral-embed" )
Mistral AI embedding models.
const ( MistralLarge = MistralLargeLatest MistralSmall = MistralSmallLatest MistralMedium = MistralMediumLatest MistralNemo = MistralOpenNemo Mistral7B = MistralOpen7B MistralCodestral = MistralCodestralLatest )
Backward-compatible aliases.
const ( MoonshotKimiK2 = "kimi-k2" MoonshotKimiThinking = "kimi-thinking" MoonshotMoonshotV1_8K = "moonshot-v1-8k" MoonshotMoonshotV1_32K = "moonshot-v1-32k" MoonshotMoonshotV1_128K = "moonshot-v1-128k" )
Moonshot Kimi models.
const ( OllamaDeepSeekR1 = "deepseek-r1" OllamaDeepSeekR1Latest = "deepseek-r1:latest" OllamaDeepSeekR1_1_5B = "deepseek-r1:1.5b" OllamaDeepSeekR1_7B = "deepseek-r1:7b" OllamaDeepSeekR1_8B = "deepseek-r1:8b" OllamaDeepSeekR1_14B = "deepseek-r1:14b" OllamaDeepSeekR1_32B = "deepseek-r1:32b" OllamaDeepSeekR1_70B = "deepseek-r1:70b" OllamaDeepSeekR1_671B = "deepseek-r1:671b" OllamaDeepSeekV3 = "deepseek-v3" OllamaDeepSeekV3Latest = "deepseek-v3:latest" OllamaDeepSeekV2 = "deepseek-v2" OllamaDeepSeekV2Latest = "deepseek-v2:latest" OllamaDeepSeekV2_16B = "deepseek-v2:16b" OllamaDeepSeekV2_236B = "deepseek-v2:236b" OllamaDeepSeekCoder = "deepseek-coder" OllamaDeepSeekCoderLatest = "deepseek-coder:latest" OllamaDeepSeekCoder_6_7B = "deepseek-coder:6.7b" OllamaDeepSeekCoder_33B = "deepseek-coder:33b" OllamaDeepSeekLLM7B = "deepseek-llm:7b" OllamaDeepSeekLLM67B = "deepseek-llm:67b" )
Ollama DeepSeek models.
const ( OllamaMistral = "mistral" OllamaMistralLatest = "mistral:latest" OllamaMistral_7B = "mistral:7b" OllamaMistralNemo = "mistral-nemo" OllamaMistralNemo12B = "mistral-nemo:12b" OllamaMistralSmall = "mistral-small" OllamaMistralSmall22B = "mistral-small:22b" OllamaMistralLarge = "mistral-large" OllamaMistralLarge123B = "mistral-large:123b" OllamaMixtral = "mixtral" OllamaMixtralLatest = "mixtral:latest" OllamaMixtral8x7B = "mixtral:8x7b" OllamaMixtral8x22B = "mixtral:8x22b" OllamaMinistral3B = "ministral-3b" OllamaMinistral8B = "ministral-8b" )
Ollama Mistral / Mixtral models.
const ( OllamaGemma = "gemma" OllamaGemmaLatest = "gemma:latest" OllamaGemma2 = "gemma2" OllamaGemma2Latest = "gemma2:latest" OllamaGemma2_2B = "gemma2:2b" OllamaGemma2_9B = "gemma2:9b" OllamaGemma2_27B = "gemma2:27b" OllamaGemma3 = "gemma3" OllamaGemma3Latest = "gemma3:latest" OllamaGemma3_1B = "gemma3:1b" OllamaGemma3_4B = "gemma3:4b" OllamaGemma3_12B = "gemma3:12b" OllamaGemma3_27B = "gemma3:27b" )
Ollama Google Gemma models.
const ( OllamaPhi3 = "phi3" OllamaPhi3Latest = "phi3:latest" OllamaPhi3Mini = "phi3:mini" OllamaPhi3Medium = "phi3:medium" OllamaPhi3_3_8B = "phi3:3.8b" OllamaPhi3_14B = "phi3:14b" OllamaPhi4 = "phi4" OllamaPhi4Latest = "phi4:latest" OllamaPhi4Mini = "phi4-mini" )
Ollama Microsoft Phi models.
const ( OllamaCodellama = "codellama" OllamaCodellamaLatest = "codellama:latest" OllamaCodellama_7B = "codellama:7b" OllamaCodellama_13B = "codellama:13b" OllamaCodellama_34B = "codellama:34b" OllamaCodellama_70B = "codellama:70b" OllamaCodeLlamaInstruct = "codellama:instruct" OllamaCodeLlamaPython = "codellama:python" OllamaStarCoder2 = "starcoder2" OllamaStarCoder2_3B = "starcoder2:3b" OllamaStarCoder2_7B = "starcoder2:7b" OllamaStarCoder2_15B = "starcoder2:15b" OllamaWizardCoder = "wizardcoder" OllamaWizardCoder33B = "wizardcoder:33b" )
Ollama code models.
const ( OllamaLlama32 = "llama3.2" OllamaLlama32Latest = "llama3.2:latest" OllamaLlama32_1B = "llama3.2:1b" OllamaLlama32_3B = "llama3.2:3b" OllamaLlama32Vision = "llama3.2-vision" OllamaLlama32Vision11B = "llama3.2-vision:11b" OllamaLlama32Vision90B = "llama3.2-vision:90b" OllamaLlama31 = "llama3.1" OllamaLlama31Latest = "llama3.1:latest" OllamaLlama31_8B = "llama3.1:8b" OllamaLlama31_70B = "llama3.1:70b" OllamaLlama31_405B = "llama3.1:405b" OllamaLlama3 = "llama3" OllamaLlama3Latest = "llama3:latest" OllamaLlama3_8B = "llama3:8b" OllamaLlama3_70B = "llama3:70b" OllamaLlama2 = "llama2" OllamaLlama2Latest = "llama2:latest" OllamaLlama2_7B = "llama2:7b" OllamaLlama2_13B = "llama2:13b" OllamaLlama2_70B = "llama2:70b" OllamaLlama4 = "llama4" OllamaLlama4Latest = "llama4:latest" OllamaLlama4Scout = "llama4:scout" OllamaLlama4Maverick = "llama4:maverick" )
Ollama Meta Llama models.
const ( OllamaLLaVA = "llava" OllamaLLaVALatest = "llava:latest" OllamaLLaVA_7B = "llava:7b" OllamaLLaVA_13B = "llava:13b" OllamaLLaVA_34B = "llava:34b" OllamaLLaVAPhi3 = "llava-phi3" OllamaLLaVALlama3 = "llava-llama3" OllamaBakLLaVA = "bakllava" OllamaMoondream = "moondream" OllamaMiniCPMV = "minicpm-v" )
Ollama vision / multimodal models.
const ( OllamaNomicEmbedText = "nomic-embed-text" OllamaMXBAIEmbedLarge = "mxbai-embed-large" OllamaSnowflakeArcticEmbed = "snowflake-arctic-embed" )
Ollama embedding models.
const ( OllamaCommandR = "command-r" OllamaCommandRLatest = "command-r:latest" OllamaCommandR7B = "command-r7b" OllamaCommandRPlus = "command-r-plus" OllamaCommandRPlus104B = "command-r-plus:104b" OllamaGranite3 = "granite3" OllamaGranite3Latest = "granite3:latest" OllamaGranite3_1B = "granite3-dense:1b" OllamaGranite3_2B = "granite3-dense:2b" OllamaGranite3_8B = "granite3-dense:8b" OllamaSolar = "solar" OllamaSolarLatest = "solar:latest" OllamaSolar10_7B = "solar:10.7b" OllamaYi = "yi" OllamaYiLatest = "yi:latest" OllamaYi_6B = "yi:6b" OllamaYi_9B = "yi:9b" OllamaYi_34B = "yi:34b" OllamaNousHermes2 = "nous-hermes2" OllamaNousHermes2Latest = "nous-hermes2:latest" OllamaNousHermes2Mixtral = "nous-hermes2-mixtral" OllamaNousHermes2Mixtral8x7B = "nous-hermes2-mixtral:8x7b" OllamaWizardLM2 = "wizardlm2" OllamaWizardLM2_7B = "wizardlm2:7b" OllamaWizardLM2_8x22B = "wizardlm2:8x22b" OllamaDolphinMixtral = "dolphin-mixtral" OllamaDolphinMixtral8x7B = "dolphin-mixtral:8x7b" OllamaDolphinMixtral8x22B = "dolphin-mixtral:8x22b" OllamaOpenChat = "openchat" OllamaOpenChatLatest = "openchat:latest" OllamaOpenChat7B = "openchat:7b" OllamaVicuna = "vicuna" OllamaVicunaLatest = "vicuna:latest" OllamaVicuna_7B = "vicuna:7b" OllamaVicuna_13B = "vicuna:13b" OllamaVicuna_33B = "vicuna:33b" OllamaOrcaMini = "orca-mini" OllamaOrcaMini3B = "orca-mini:3b" OllamaOrcaMini7B = "orca-mini:7b" OllamaTinyLlama = "tinyllama" OllamaSmolLM2 = "smollm2" OllamaSmolLM2_135M = "smollm2:135m" OllamaSmolLM2_360M = "smollm2:360m" OllamaSmolLM2_1_7B = "smollm2:1.7b" )
Ollama Cohere, IBM, and other vendor models.
const ( OllamaQwen25 = "qwen2.5" OllamaQwen25Latest = "qwen2.5:latest" OllamaQwen25_0_5B = "qwen2.5:0.5b" OllamaQwen25_1_5B = "qwen2.5:1.5b" OllamaQwen25_3B = "qwen2.5:3b" OllamaQwen25_7B = "qwen2.5:7b" OllamaQwen25_14B = "qwen2.5:14b" OllamaQwen25_32B = "qwen2.5:32b" OllamaQwen25_72B = "qwen2.5:72b" OllamaQwen25Coder = "qwen2.5-coder" OllamaQwen25CoderLatest = "qwen2.5-coder:latest" OllamaQwen25Coder_7B = "qwen2.5-coder:7b" OllamaQwen25Coder_14B = "qwen2.5-coder:14b" OllamaQwen25Coder_32B = "qwen2.5-coder:32b" OllamaQwen25VL = "qwen2.5vl" OllamaQwen25VL_7B = "qwen2.5vl:7b" OllamaQwen25VL_32B = "qwen2.5vl:32b" OllamaQwen3 = "qwen3" OllamaQwen3Latest = "qwen3:latest" OllamaQwen3_4B = "qwen3:4b" OllamaQwen3_8B = "qwen3:8b" OllamaQwen3_14B = "qwen3:14b" OllamaQwen3_30B = "qwen3:30b" OllamaQwen3_32B = "qwen3:32b" OllamaQwen3_235B = "qwen3:235b" OllamaQwen2 = "qwen2" OllamaQwen2Latest = "qwen2:latest" OllamaQwen2_7B = "qwen2:7b" OllamaQwen2_72B = "qwen2:72b" )
Ollama Qwen models.
const ( OpenAIGPT4o = "gpt-4o" OpenAIGPT4oMini = "gpt-4o-mini" OpenAIGPT4o2024_05_13 = "gpt-4o-2024-05-13" OpenAIGPT4o2024_08_06 = "gpt-4o-2024-08-06" OpenAIGPT4o2024_11_20 = "gpt-4o-2024-11-20" OpenAIGPT4oMini2024_07_18 = "gpt-4o-mini-2024-07-18" OpenAIChatGPT4oLatest = "chatgpt-4o-latest" )
OpenAI GPT-4o family.
const ( OpenAIGPT41 = "gpt-4.1" OpenAIGPT41Mini = "gpt-4.1-mini" OpenAIGPT41Nano = "gpt-4.1-nano" )
OpenAI GPT-4.1 family.
const ( OpenAIGPT4 = "gpt-4" OpenAIGPT4Turbo = "gpt-4-turbo" OpenAIGPT4TurboPreview = "gpt-4-turbo-preview" OpenAIGPT4Turbo2024_04_09 = "gpt-4-turbo-2024-04-09" OpenAIGPT4_0613 = "gpt-4-0613" OpenAIGPT4_32k = "gpt-4-32k" OpenAIGPT4_32k0613 = "gpt-4-32k-0613" )
OpenAI GPT-4 legacy family.
const ( OpenAIGPT35Turbo = "gpt-3.5-turbo" OpenAIGPT35Turbo16k = "gpt-3.5-turbo-16k" OpenAIGPT35Turbo0125 = "gpt-3.5-turbo-0125" OpenAIGPT35Turbo1106 = "gpt-3.5-turbo-1106" OpenAIGPT35TurboInstruct = "gpt-3.5-turbo-instruct" )
OpenAI GPT-3.5 family.
const ( OpenAIGPT5 = "gpt-5" OpenAIGPT5Mini = "gpt-5-mini" OpenAIGPT5Nano = "gpt-5-nano" OpenAIGPT52 = "gpt-5.2" OpenAIGPT52Pro = "gpt-5.2-pro" OpenAIGPT52Mini = "gpt-5.2-mini" )
OpenAI GPT-5 family.
const ( OpenAIO1 = "o1" OpenAIO1Mini = "o1-mini" OpenAIO1Preview = "o1-preview" OpenAIO1Pro = "o1-pro" OpenAIO3 = "o3" OpenAIO3Mini = "o3-mini" OpenAIO3Pro = "o3-pro" OpenAIO4Mini = "o4-mini" OpenAIO4MiniHigh = "o4-mini-high" )
OpenAI o-series reasoning models.
const ( OpenAIGPT4oAudioPreview = "gpt-4o-audio-preview" OpenAIGPT4oMiniAudioPreview = "gpt-4o-mini-audio-preview" OpenAIGPT4oTranscribe = "gpt-4o-transcribe" OpenAIGPT4oMiniTranscribe = "gpt-4o-mini-transcribe" )
OpenAI audio models.
const ( OpenAIGPT4oRealtime = "gpt-4o-realtime-preview" OpenAIGPT4oMiniRealtime = "gpt-4o-mini-realtime-preview" OpenAIGPT4oRealtime2024_10_01 = "gpt-4o-realtime-preview-2024-10-01" )
OpenAI realtime models.
const ( OpenAITextEmbedding3Large = "text-embedding-3-large" OpenAITextEmbedding3Small = "text-embedding-3-small" OpenAITextEmbeddingAda002 = "text-embedding-ada-002" )
OpenAI embedding models.
const ( OpenAIDALLE3 = "dall-e-3" OpenAIDALLE2 = "dall-e-2" OpenAIGPTImage1 = "gpt-image-1" )
OpenAI image models.
const ( OpenAIWhisper1 = "whisper-1" OpenAITTS1 = "tts-1" OpenAITTS1HD = "tts-1-hd" )
OpenAI speech models.
const ( OpenAIModerationLatest = "omni-moderation-latest" OpenAIModerationStable = "text-moderation-stable" )
OpenAI moderation models.
const ( QwenMax = "qwen-max" QwenMaxLatest = "qwen-max-latest" QwenPlus = "qwen-plus" QwenPlusLatest = "qwen-plus-latest" QwenTurbo = "qwen-turbo" QwenTurboLatest = "qwen-turbo-latest" QwenLong = "qwen-long" QwenQwQ32B = "qwq-32b" QwenQwQPlus = "qwq-plus" Qwen25Max = "qwen2.5-max" Qwen25Plus = "qwen2.5-plus" Qwen25Turbo = "qwen2.5-turbo" Qwen25_72BInstruct = "qwen2.5-72b-instruct" Qwen25_32BInstruct = "qwen2.5-32b-instruct" Qwen25_14BInstruct = "qwen2.5-14b-instruct" Qwen25_7BInstruct = "qwen2.5-7b-instruct" Qwen25Coder32BInstruct = "qwen2.5-coder-32b-instruct" Qwen25Coder7BInstruct = "qwen2.5-coder-7b-instruct" Qwen3Max = "qwen3-max" Qwen3Plus = "qwen3-plus" Qwen3235BInstruct = "qwen3-235b-a22b-instruct" Qwen332BInstruct = "qwen3-32b-instruct" Qwen314BInstruct = "qwen3-14b-instruct" Qwen38BInstruct = "qwen3-8b-instruct" )
Alibaba Qwen DashScope API models.
const ( XAIGrok4 = "grok-4" XAIGrok3 = "grok-3" XAIGrok3Mini = "grok-3-mini" XAIGrok3MiniFast = "grok-3-mini-fast" XAIGrok3Fast = "grok-3-fast" XAIGrok2 = "grok-2" XAIGrok2Mini = "grok-2-mini" XAIGrok2Vision = "grok-2-vision" XAIGrokBeta = "grok-beta" )
xAI Grok models.
const ( ZhipuGLM4Plus = "glm-4-plus" ZhipuGLM4Air = "glm-4-air" ZhipuGLM4AirX = "glm-4-airx" ZhipuGLM4Flash = "glm-4-flash" ZhipuGLM4Long = "glm-4-long" ZhipuGLM4 = "glm-4" ZhipuGLM3Turbo = "glm-3-turbo" ZhipuGLMZeroPreview = "glm-zero-preview" )
Zhipu GLM models.
const (
AnthropicClaude37Sonnet = "claude-3-7-sonnet-20250219"
)
Anthropic Claude 3.7 models.
const (
AnthropicClaudeInstant12 = "claude-instant-1.2"
)
Anthropic Claude Instant legacy.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
This section is empty.