llm-adapters

Supported Models

This table provides an overview of the supported models, including their vendor, provider, cost details, and capabilities.

Model Vendor Provider Prompt $ Completion $ Request $ Context Completion User Repeating Roles Streaming Vision Tools Tools Streaming Supports N System Multiple Systems System Last Empty Content Tool Choice Tool Choice Required JSON Output JSON Content Last Assistant First Assistant Temperature Only System Only Assistant Stop Max Tokens Max Completion Tokens Vision Multiple Presence Penalty Repetition Penalty Top P Top K Min P Developer
claude-3-opus-latest anthropic anthropic $1.5e-05 $7.5e-05 $0.0 200000 4096
claude-3-opus-20240229 anthropic anthropic $1.5e-05 $7.5e-05 $0.0 200000 4096
claude-3-5-haiku-latest anthropic anthropic $8e-07 $4e-06 $0.0 200000 8192
claude-3-5-haiku-20241022 anthropic anthropic $8e-07 $4e-06 $0.0 200000 8192
claude-3-5-sonnet-latest anthropic anthropic $3e-06 $1.5e-05 $0.0 200000 8192
claude-3-5-sonnet-20241022 anthropic anthropic $3e-06 $1.5e-05 $0.0 200000 8192
claude-3-5-sonnet-20240620 anthropic anthropic $3e-06 $1.5e-05 $0.0 200000 8192
claude-3-7-sonnet-20250219 anthropic anthropic $3e-06 $1.5e-05 $0.0 200000 64000
claude-3-7-sonnet-latest anthropic anthropic $3e-06 $1.5e-05 $0.0 200000 64000
claude-sonnet-4-20250514 anthropic anthropic $3e-06 $1.5e-05 $0.0 200000 64000
claude-sonnet-4-0 anthropic anthropic $3e-06 $1.5e-05 $0.0 200000 64000
claude-opus-4-20250514 anthropic anthropic $1.5e-05 $7.5e-05 $0.0 200000 32000
claude-opus-4-0 anthropic anthropic $1.5e-05 $7.5e-05 $0.0 200000 32000
llama3.1-8b meta-llama cerebras $1e-07 $1e-07 $0.0 32768 N/A
llama3.3-70b meta-llama cerebras $8.5e-07 $1.2e-06 $0.0 32768 N/A
command-r-plus-04-2024 cohere cohere $3e-06 $1.5e-05 $0.0 128000 4000
command-r-plus-08-2024 cohere cohere $2.5e-06 $1e-05 $0.0 128000 4000
command-r-plus cohere cohere $2.5e-06 $1e-05 $0.0 128000 4000
command-r-03-2024 cohere cohere $5e-07 $1.5e-06 $0.0 128000 4000
command-r-08-2024 cohere cohere $1.5e-07 $6e-07 $0.0 128000 4000
command-r cohere cohere $1.5e-07 $6e-07 $0.0 128000 4000
command cohere cohere $1e-06 $2e-06 $0.0 4000 4000
command-nightly cohere cohere $1e-06 $2e-06 $0.0 128000 128000
command-light cohere cohere $3e-07 $6e-07 $0.0 4000 4000
command-light-nightly cohere cohere $3e-07 $6e-07 $0.0 4000 4000
c4ai-aya-expanse-8b cohere cohere $5e-07 $1.5e-06 $0.0 8000 4000
c4ai-aya-expanse-32b cohere cohere $5e-07 $1.5e-06 $0.0 128000 4000
Llama-3.3-70B-Instruct meta-llama deepinfra $2.3e-07 $4e-07 $0.0 131072 N/A
Llama-3.3-70B-Instruct-Turbo meta-llama deepinfra $7e-08 $2.5e-07 $0.0 131072 N/A
phi-4 microsoft deepinfra $7e-08 $1.4e-07 $0.0 16384 N/A
DeepSeek-V3 deepseek-ai deepinfra $3.8e-07 $8.9e-07 $0.0 16000 N/A
DeepSeek-R1 deepseek-ai deepinfra $4.5e-07 $2.18e-06 $0.0 16000 N/A
DeepSeek-R1-Distill-Llama-70B deepseek-ai deepinfra $1e-07 $4e-07 $0.0 131072 N/A
Qwen2.5-Coder-32B-Instruct Qwen deepinfra $6e-08 $1.5e-07 $0.0 32768 N/A
Qwen2.5-72B-Instruct Qwen deepinfra $1.2e-07 $3.9e-07 $0.0 32768 N/A
WizardLM-2-8x22B microsoft deepinfra $5e-07 $5e-07 $0.0 65536 N/A
Llama-4-Maverick-17B-128E-Instruct-FP8 meta-llama deepinfra $1.6e-07 $6e-07 $0.0 131072 N/A
Llama-4-Scout-17B-16E-Instruct meta-llama deepinfra $8e-08 $3e-07 $0.0 131072 N/A
DeepSeek-R1-Turbo deepseek-ai deepinfra $1e-06 $3e-06 $0.0 32768 N/A
QwQ-32B Qwen deepinfra $1.5e-07 $2e-07 $0.0 131072 N/A
DeepSeek-V3-0324 deepseek-ai deepinfra $3e-07 $8.8e-07 $0.0 163840 N/A
gemma-3-27b-it google deepinfra $1e-07 $2e-07 $0.0 131072 N/A
gemma-3-12b-it google deepinfra $5e-08 $1e-07 $0.0 131072 N/A
gemma-3-4b-it google deepinfra $2e-08 $4e-08 $0.0 131072 N/A
Phi-4-multimodal-instruct microsoft deepinfra $5e-08 $1e-07 $0.0 131072 N/A
Meta-Llama-3.1-405B-Instruct meta-llama deepinfra $8e-07 $8e-07 $0.0 32768 N/A
Meta-Llama-3.1-70B-Instruct meta-llama deepinfra $2.3e-07 $4e-07 $0.0 131072 N/A
Meta-Llama-3.1-8B-Instruct meta-llama deepinfra $3e-08 $5e-08 $0.0 131072 N/A
Meta-Llama-3.1-70B-Instruct-Turbo meta-llama deepinfra $1e-07 $2.8e-07 $0.0 131072 N/A
Meta-Llama-3.1-8B-Instruct-Turbo meta-llama deepinfra $2e-08 $5e-08 $0.0 131072 N/A
Mistral-Small-24B-Instruct-2501 mistralai deepinfra $6e-08 $1.2e-07 $0.0 32768 N/A
Mixtral-8x7B-Instruct-v0.1 mistralai deepinfra $8e-08 $2.4e-07 $0.0 32768 N/A
Llama-3.2-90B-Vision-Instruct meta-llama deepinfra $3.5e-07 $4e-07 $0.0 32768 N/A
Llama-3.2-11B-Vision-Instruct meta-llama deepinfra $4.9e-08 $4.9e-08 $0.0 131072 N/A
Llama-3.2-1B-Instruct meta-llama deepinfra $5e-09 $1e-08 $0.0 131072 N/A
Llama-3.2-3B-Instruct meta-llama deepinfra $1e-08 $2e-08 $0.0 131072 N/A
deepseek-chat deepseek deepseek $2.7e-07 $1.1e-06 $0.0 64000 8000
deepseek-reasoner deepseek deepseek $5.5e-07 $2.19e-06 $0.0 64000 N/A
gemini-1.5-pro-latest gemini gemini $1.25e-06 $5e-06 $0.0 2097152 8192
gemini-1.5-pro gemini gemini $1.25e-06 $5e-06 $0.0 2097152 8192
gemini-1.5-flash-latest gemini gemini $7.5e-08 $3e-07 $0.0 1048576 8192
gemini-1.5-flash gemini gemini $7.5e-08 $3e-07 $0.0 1048576 8192
gemini-1.5-flash-8b-latest gemini gemini $3.75e-08 $1.5e-07 $0.0 1048576 8192
gemini-1.5-flash-8b gemini gemini $3.75e-08 $1.5e-07 $0.0 1048576 8192
gemini-2.0-flash gemini gemini $1e-07 $4e-07 $0.0 1048576 8192
gemini-2.5-pro-preview-03-25 gemini gemini $1.5e-07 $6e-07 $0.0 1048576 65536
gemini-2.5-flash-preview-04-17 gemini gemini $1.25e-06 $1e-05 $0.0 1048576 65536
gemini-2.5-pro-preview-05-06 gemini gemini $1.25e-06 $1e-05 $0.0 1048576 65536
llama-3.1-8b-instant meta-llama groq $5e-08 $8e-08 $0.0 131072 N/A
llama3-70b-8192 meta-llama groq $5.9e-07 $7.9e-07 $0.0 8192 N/A
llama3-8b-8192 meta-llama groq $5e-08 $8e-08 $0.0 8192 N/A
gemma2-9b-it google groq $2e-07 $2e-07 $0.0 8192 N/A
llama-guard-3-8b meta-llama groq $2e-07 $2e-07 $0.0 8192 N/A
llama3.1-405b-instruct-fp8 meta-llama lambdalabs $8e-07 $8e-07 $0.0 131000 N/A
lfm-40b liquid lambdalabs $1.5e-07 $1.5e-07 $0.0 66000 N/A
qwen25-coder-32b-instruct qwen lambdalabs $7e-08 $1.6e-07 $0.0 33000 N/A
hermes3-405b nous-hermes lambdalabs $9e-07 $9e-07 $0.0 131000 N/A
deepseek-llama3.3-70b deepseek lambdalabs $2e-07 $2e-07 $0.0 131000 N/A
llama3.3-70b-instruct-fp8 meta-llama lambdalabs $2e-07 $2e-07 $0.0 131000 N/A
llama3.2-3b-instruct meta-llama lambdalabs $2e-08 $2e-08 $0.0 131000 N/A
hermes3-8b nous-hermes lambdalabs $3e-08 $3e-08 $0.0 131000 N/A
llama3.1-nemotron-70b-instruct-fp8 meta-llama lambdalabs $2e-07 $2e-07 $0.0 131000 N/A
hermes3-70b nous-hermes lambdalabs $2e-07 $2e-07 $0.0 131000 N/A
llama3.1-8b-instruct meta-llama lambdalabs $2.5e-08 $4e-08 $0.0 131000 N/A
DeepSeek-V3-0324 deepseek lambdalabs $3.4e-07 $8.8e-07 $0.0 164000 N/A
llama3.1-70b-instruct-fp8 meta-llama lambdalabs $1.2e-07 $3e-07 $0.0 131000 N/A
Llama-4-maverick-17b-128e-instruct-fp8 meta-llama lambdalabs $1.8e-07 $6e-07 $0.0 1000000 N/A
Llama-4-scout-17b-16e-instruct meta-llama lambdalabs $8e-08 $3e-07 $0.0 1000000 N/A
gpt-3.5-turbo-instruct openai openai $1.5e-06 $2e-06 $0.0 16385 16385
gpt-4o openai openai $2.5e-06 $1e-05 $0.0 128000 16384
gpt-4o-2024-11-20 openai openai $2.5e-06 $1e-05 $0.0 128000 16384
gpt-4o-2024-08-06 openai openai $2.5e-06 $1e-05 $0.0 128000 16384
gpt-4o-2024-11-20 openai openai $2.5e-06 $1e-05 $0.0 128000 16384
gpt-4o-mini openai openai $6e-07 $2.4e-06 $0.0 128000 16385
gpt-4o-mini-2024-07-18 openai openai $1.5e-07 $6e-07 $0.0 128000 16385
o1-mini openai openai $1.1e-06 $4.4e-06 $0.0 128000 65536
o1-mini-2024-09-12 openai openai $1.1e-06 $4.4e-06 $0.0 128000 65536
o1 openai openai $1.5e-05 $6e-05 $0.0 200000 100000
o1-2024-12-17 openai openai $1.5e-05 $6e-05 $0.0 200000 100000
o3-mini openai openai $1.1e-06 $4.4e-06 $0.0 200000 100000
o3-mini-2025-01-31 openai openai $1.1e-06 $4.4e-06 $0.0 200000 100000
gpt-4.5-preview-2025-02-27 openai openai $7.5e-05 $0.00015 $0.0 128000 16384
gpt-4.5-preview openai openai $7.5e-05 $0.00015 $0.0 128000 16384
gpt-4.1 openai openai $2e-06 $8e-06 $0.0 1047576 32768
gpt-4.1-2025-04-14 openai openai $2e-06 $8e-06 $0.0 1047576 32768
gpt-4.1-mini openai openai $4e-07 $1.6e-06 $0.0 1047576 32768
gpt-4.1-mini-2025-04-14 openai openai $4e-07 $1.6e-06 $0.0 1047576 32768
gpt-4.1-nano openai openai $1e-07 $4e-07 $0.0 1047576 32768
gpt-4.1-nano-2025-04-14 openai openai $1e-07 $4e-07 $0.0 1047576 32768
o4-mini openai openai $1.1e-06 $4.4e-06 $0.0 200000 100000
o4-mini-2025-04-16 openai openai $1.1e-06 $4.4e-06 $0.0 200000 100000
sonar perplexity perplexity $1e-06 $1e-06 $0.01 127000 N/A
sonar-pro perplexity perplexity $3e-06 $1.5e-05 $0.01 127000 8000
sonar-reasoning perplexity perplexity $1e-06 $5e-06 $0.01 127000 N/A
sonar-reasoning-pro perplexity perplexity $2e-06 $8e-06 $0.01 127000 N/A
DeepSeek-R1 deepseek-ai together $3e-06 $7e-06 $0.0 164000 N/A
DeepSeek-V3 deepseek-ai together $1.25e-06 $1.25e-06 $0.0 131000 N/A
Qwen2.5-72B-Instruct-Turbo Qwen together $1.2e-06 $1.2e-06 $0.0 32768 N/A
Mistral-Small-24B-Instruct-2501 mistralai together $8e-07 $8e-07 $0.0 32768 N/A
Llama-3.1-Nemotron-70B-Instruct-HF nvidia together $8.8e-07 $8.8e-07 $0.0 128000 N/A
Llama-3.3-70B-Instruct-Turbo meta-llama together $8.8e-07 $8.8e-07 $0.0 128000 N/A
Meta-Llama-3.1-8B-Instruct-Turbo meta-llama together $1.8e-07 $1.8e-07 $0.0 131072 N/A
Meta-Llama-3.1-70B-Instruct-Turbo meta-llama together $8.8e-07 $8.8e-07 $0.0 131072 N/A
Meta-Llama-3.1-405B-Instruct-Turbo meta-llama together $3.5e-06 $3.5e-06 $0.0 131000 N/A
Qwen2.5-Coder-32B-Instruct Qwen together $8e-07 $8e-07 $0.0 32768 N/A
gemma-2-27b-it google together $8e-07 $8e-07 $0.0 8000 N/A
DeepSeek-R1-Distill-Qwen-1.5B deepseek-ai together $1.8e-07 $1.8e-07 $0.0 128000 N/A
Llama-4-Maverick-17B-128E-Instruct-FP8 meta-llama together $2.7e-07 $8.5e-07 $0.0 500000 N/A
Llama-4-Scout-17B-16E-Instruct meta-llama together $1.8e-07 $5.9e-07 $0.0 300000 N/A
Llama-3.2-11B-Vision-Instruct-Turbo meta-llama together $1.8e-07 $1.8e-07 $0.0 128000 N/A
Llama-3.2-90B-Vision-Instruct-Turbo meta-llama together $1.2e-06 $1.2e-06 $0.0 128000 N/A