LLM Model Prices (per million tokens)

Thanks to the LiteLLM project, the model provider websites and APIs and random sources online for the source data. Most benchmark data is from the Epoch AI benchmarking dashboard, provided under a Creative Commons license by Epoch AI, but some data is from other sources. The Aider Polyglot data is from the Aider website. Always check the original source for the most up-to-date information as there may be errors in the data (some sources are a bit rough) and benchmark score matching is a bit fuzzy.

$
Showing 1395 models (589 rows displayed)
Provider ⬍Model ⬍Max Input Tokens ⬍Input Token Price ⬍Output Token Price ⬍GPQA (Diamond) ⬍MATH (Level 5) ⬍OTIS Mock AIME 24-25 ⬍Aider Polyglot ⬍
openai, openrouter chatgpt-4o-latest (2 endpoints)128k$5.00$15.0045.3%
vercel_ai_gateway, vertex_ai-anthropic_models, openrouter, anthropic, bedrock claude-3.5-sonnet (15 endpoints)200k$3.00$3.60$15.00$18.0051.6%
vercel_ai_gateway, openrouter, anthropic, vertex_ai-anthropic_models, bedrock_converse, bedrock, deepinfra claude-3.7-sonnet (10 endpoints)200k$3.00$3.30$15.00$16.5064.9%
vertex_ai-anthropic_models, openrouter, anthropic, bedrock_converse claude-opus-4 (7 endpoints)200k$15.00$75.0072%
vertex_ai-anthropic_models, openrouter, anthropic, bedrock_converse claude-sonnet-4 (8 endpoints)1000k$3.00
>200k: $6.00
$15.00
>200k: $22.50
61.3%
openrouter, vercel_ai_gateway, vertex_ai-mistral_models, codestral, mistral codestral (9 endpoints)32k262k$0.00$1.00$0.00$3.0011.1%
lambda, openrouter, deepinfra, wandb, fireworks_ai, lambda_ai, hyperbolic, vercel_ai_gateway, azure_ai, bedrock_converse, together_ai, deepseek, sambanova deepseek-r1 (20 endpoints)32k164k$0.00$135,000.00$0.00$540,000.0071.4%
openrouter, deepinfra, fireworks_ai, wandb, lambda_ai, vercel_ai_gateway, azure_ai, deepseek, together_ai, hyperbolic, sambanova deepseek-v3 (18 endpoints)32k163k$0.00$114,000.00$0.00$275,000.0055.1%
openrouter deepseek/​deepseek-​v3.2-​exp
deepseek-​v3.2-​exp
163k$0.27$0.4074.2%
vertex_ai-language-models, gemini, vercel_ai_gateway gemini-2.0-flash (3 endpoints)1048k$0.10$0.15$0.40$0.60
vertex_ai-language-models, gemini, vercel_ai_gateway gemini-2.0-flash-lite (3 endpoints)1048k$0.075$0.30
vertex_ai-language-models, gemini, openrouter gemini-2.5-flash-preview-04-17 (7 endpoints)1048k$0.15$0.30$0.60$2.5055.1%
vertex_ai-language-models, gemini gemini-2.5-pro-exp-03-25 (3 endpoints)1048k$0.00$1.25
>200k: $0.00$2.50
$0.00$10.00
>200k: $0.00$15.00
openrouter, vertex_ai-language-models, gemini gemini-2.5-pro-preview (7 endpoints)1048k$1.25
>200k: $2.50
$10.00
>200k: $15.00
83.1%
azure, openai, vercel_ai_gateway, openrouter gpt-4.1 (6 endpoints)1047k$2.00$8.0052.4%
azure azure/​gpt-​4.5-​preview
gpt-​4.5-​preview
128k$75.00$150.0044.9%
azure, openai, vercel_ai_gateway, openrouter gpt-4o (21 endpoints)128k$2.50$5.00$10.00$15.0023.1%
azure, openai, vercel_ai_gateway, openrouter gpt-4o-mini (10 endpoints)128k$0.15$0.165$0.60$0.663.6%
openrouter, azure, openai gpt-5 (5 endpoints)272k400k$1.25$10.0088%
xai, openrouter grok-3-beta (3 endpoints)131k$3.00$15.0053.3%
xai, openrouter grok-3-mini-beta (3 endpoints)131k$0.30$0.5049.3%
vercel_ai_gateway, xai, openrouter, azure_ai, oci grok-4 (7 endpoints)128k256k$3.00$5.50$0.15$27.5079.6%
openrouter, vercel_ai_gateway kimi-k2 (4 endpoints)32k262k$0.00$0.55$0.00$2.4959.1%
openrouter, lambda, vercel_ai_gateway llama-4-maverick (4 endpoints)128k1048k$0.00$0.20$0.00$0.6015.6%
lambda, openrouter, vercel_ai_gateway llama-4-scout (4 endpoints)128k1000k$0.00$0.10$0.00$0.30
azure_ai, vertex_ai-mistral_models, openrouter, mistral, vercel_ai_gateway, bedrock mistral-small (8 endpoints)32k128k$0.10$1.00$0.30$3.00
azure, openai, vercel_ai_gateway, openrouter o1 (8 endpoints)200k$15.00$16.50$60.00$66.0061.7%
openai, openrouter, azure o1-mini (8 endpoints)128k$1.10$3.00$4.40$12.0032.9%
openrouter openai/​o1-​pro
o1-​pro
200k$150.00$600.00
azure, openai, vercel_ai_gateway, openrouter o3 (6 endpoints)200k$2.00$10.00$8.00$40.0081.3%
azure, openai, vercel_ai_gateway, openrouter o3-mini (9 endpoints)200k$1.10$1.21$4.40$4.8460.4%
openrouter openai/​o3-​pro
o3-​pro
200k$20.00$80.0084.9%
azure, openai, vercel_ai_gateway, openrouter o4-mini (7 endpoints)200k$1.10$4.4072%
openrouter arcee-​ai/​afm-​4.5b
afm-​4.5b
65k$0.10$0.40
openrouter aion-​labs/​aion-​1.0
aion-​1.0
131k$4.00$8.00
openrouter aion-​labs/​aion-​1.0-​mini
aion-​1.0-​mini
131k$0.70$1.40
openrouter aion-​labs/​aion-​rp-​llama-​3.1-​8b
aion-​rp-​llama-​3.1-​8b
32k$0.20$0.20
gradient_aigradient_ai/​anthropic-​claude-​3.5-​haiku
anthropic-​claude-​3.5-​haiku
$0.80$4.00
gradient_aigradient_ai/​anthropic-​claude-​3.5-​sonnet
anthropic-​claude-​3.5-​sonnet
$3.00$15.00
gradient_aigradient_ai/​anthropic-​claude-​3.7-​sonnet
anthropic-​claude-​3.7-​sonnet
$3.00$15.00
gradient_aigradient_ai/​anthropic-​claude-​3-​opus
anthropic-​claude-​3-​opus
$15.00$75.00
openrouter thedrummer/​anubis-​70b-​v1.1
anubis-​70b-​v1.1
131k$0.65$1.00
vertex_ai-chat-models, palm chat-bison (2 endpoints)8k$0.125$0.125
vertex_ai-chat-models, palm chat-bison@001 (2 endpoints)8k$0.125$0.125
vertex_ai-chat-models chat-​bison@002ℹ️8k$0.125$0.125
vertex_ai-chat-models chat-​bison-​32kℹ️32k$0.125$0.125
vertex_ai-chat-models chat-​bison-​32k@002ℹ️32k$0.125$0.125
nlp_cloud chatdolphin16k$0.50$0.50
bedrock, vercel_ai_gateway, openrouter, anthropic, vertex_ai-anthropic_models claude-3.5-haiku (11 endpoints)200k$0.25$1.00$1.25$5.0028%
bedrock claude-3-5-sonnet-20241022-v2.0 (4 endpoints)200k$3.00$15.00
vertex_ai-anthropic_models claude-3-5-sonnet-v2 (2 endpoints)200k$3.00$15.00
vercel_ai_gateway, vertex_ai-anthropic_models, openrouter, anthropic, bedrock claude-3-haiku (11 endpoints)200k$0.25$0.30$1.25$1.50
vercel_ai_gateway, vertex_ai-anthropic_models, openrouter, anthropic, bedrock claude-3-opus (9 endpoints)200k$15.00$75.00
vertex_ai-anthropic_models, bedrock claude-3-sonnet (6 endpoints)200k$3.00$15.00
vercel_ai_gateway, deepinfra claude-4-opus (2 endpoints)200k$15.00$16.50$75.00$82.50
vercel_ai_gateway, deepinfra claude-4-sonnet (2 endpoints)200k$3.00$3.30$15.00$16.50
bedrock claude-instant-v1 (5 endpoints)100k$0.80$2.48$2.40$8.38
vertex_ai-anthropic_models, openrouter, anthropic, bedrock_converse claude-opus-4-1 (7 endpoints)200k$15.00$75.00
openrouter, vertex_ai-anthropic_models, anthropic, bedrock_converse claude-sonnet-4.5 (7 endpoints)200k1000k$3.00
>200k: $6.00
$15.00
>200k: $22.50
bedrock claude-v1 (5 endpoints)100k$8.00$24.00
bedrock claude-v2.1 (5 endpoints)100k$8.00$24.00
vertex_ai-code-text-models code-​bisonℹ️6k$0.125$0.125
vertex_ai-code-chat-models codechat-​bisonℹ️6k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison@001ℹ️6k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison@002ℹ️6k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison-​32kℹ️32k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison-​32k@002ℹ️32k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison@latestℹ️6k$0.125$0.125
perplexity, anyscale codellama-34b-instruct (2 endpoints)4k16k$0.35$1.00$1.00$1.40
perplexity, anyscale codellama-70b-instruct (2 endpoints)4k16k$0.70$1.00$1.00$2.80
cloudflare cloudflare/​@hf/​thebloke/​codellama-​7b-​instruct-​awq
codellama-​7b-​instruct-​awq
4k$1.923$1.923
openrouter alfredpros/​codellama-​7b-​instruct-​solidity
codellama-​7b-​instruct-​solidity
4k$0.80$1.20
openrouter arcee-​ai/​coder-​large
coder-​large
32k$0.50$0.80
vercel_ai_gatewayvercel_ai_gateway/​mistral/​codestral-​embed
codestral-​embed
0$0.15$0.00
vertex_ai-mistral_models vertex_ai/​codestral@latest
codestral@latest
128k$0.20$0.60
mistral mistral/​codestral-​mamba-​latestℹ️
codestral-​mamba-​latest
256k$0.25$0.25
openrouter openai/​codex-​mini
codex-​mini
200k$1.50$6.00
openrouter deepcogito/​cogito-​v2-​preview-​deepseek-​671b
cogito-​v2-​preview-​deepseek-​671b
163k$1.25$1.25
openrouter deepcogito/​cogito-​v2-​preview-​llama-​109b-​moe
cogito-​v2-​preview-​llama-​109b-​moe
32k$0.18$0.59
vercel_ai_gateway, openrouter, cohere_chat command-a (3 endpoints)256k$2.50$10.00
cohere_chat command-​light4k$0.30$0.60
bedrock cohere.​command-​light-​text-​v14
command-​light-​text-​v14
4k$0.30$0.60
cohere_chat, vercel_ai_gateway, openrouter, bedrock command-r (5 endpoints)128k$0.15$0.50$0.60$1.50
openrouter, cohere_chat command-r7b-12-2024 (2 endpoints)128k$0.0375$0.15$0.0375$0.15
cohere_chat, vercel_ai_gateway, openrouter, azure, bedrock command-r-plus (6 endpoints)128k$2.50$3.00$10.00$15.00
bedrock cohere.​command-​text-​v14
command-​text-​v14
4k$1.50$2.00
azure computer-use-preview (2 endpoints)8k$3.00$12.00
openrouter thedrummer/​cydonia-​24b-​v4.1
cydonia-​24b-​v4.1
131k$0.15$0.50
databricks databricks/​databricks-​claude-​3-​7-​sonnetℹ️
databricks-​claude-​3-​7-​sonnet
200k$2.50$17.857
databricks databricks/​databricks-​llama-​2-​70b-​chatℹ️
databricks-​llama-​2-​70b-​chat
4k$0.50$1.50
databricks databricks/​databricks-​llama-​4-​maverickℹ️
databricks-​llama-​4-​maverick
128k$5.00$15.00
databricks databricks/​databricks-​meta-​llama-​3-​1-​405b-​instructℹ️
databricks-​meta-​llama-​3-​1-​405b-​instruct
128k$5.00$15.00
databricks databricks/​databricks-​meta-​llama-​3-​3-​70b-​instructℹ️
databricks-​meta-​llama-​3-​3-​70b-​instruct
128k$1.00$3.00
databricks databricks/​databricks-​meta-​llama-​3-​70b-​instructℹ️
databricks-​meta-​llama-​3-​70b-​instruct
128k$1.00$3.00
databricks databricks/​databricks-​mixtral-​8x7b-​instructℹ️
databricks-​mixtral-​8x7b-​instruct
4k$0.50$0.999
databricks databricks/​databricks-​mpt-​30b-​instructℹ️
databricks-​mpt-​30b-​instruct
8k$0.999$0.999
databricks databricks/​databricks-​mpt-​7b-​instructℹ️
databricks-​mpt-​7b-​instruct
8k$0.50$0.00
openrouter deepcoder-14b-preview (2 endpoints)96k$0.00$0.015$0.00$0.015
openrouter deephermes-3-llama-3-8b-preview (2 endpoints)131k$0.00$0.03$0.00$0.11
openrouter nousresearch/​deephermes-​3-​mistral-​24b-​preview
deephermes-​3-​mistral-​24b-​preview
32k$0.15$0.59
deepseek deepseek-​chatℹ️131k$0.60$1.70
openrouter deepseek-chat-v3.1 (2 endpoints)163k163k$0.00$0.20$0.00$0.80
deepseek deepseek/​deepseek-​coder
deepseek-​coder
128k$0.14$0.28
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​deepseek-​coder-​v2-​instructℹ️
deepseek-​coder-​v2-​instruct
65k$1.20$1.20
lambda_ai, lambda deepseek-llama3.3-70b (2 endpoints)131k131k$0.20$0.60
openrouter deepseek/​deepseek-​prover-​v2
deepseek-​prover-​v2
163k$0.50$2.18
vertex_ai-deepseek_modelsvertex_ai/​deepseek-​ai/​deepseek-​r1-​0528-​maasℹ️
deepseek-​r1-​0528-​maas
65k$1.35$5.40
openrouter deepseek-r1-0528-qwen3-8b (2 endpoints)32k131k$0.00$0.03$0.00$0.11
together_ai together_ai/​deepseek-​ai/​DeepSeek-​R1-​0528-​tputℹ️
deepseek-​r1-​0528-​tput
128k$0.55$2.19
deepinfra deepinfra/​deepseek-​ai/​DeepSeek-​R1-​0528-​Turbo
deepseek-​r1-​0528-​turbo
32k$1.00$3.00
lambda_ailambda_ai/​deepseek-​r1-​671b
deepseek-​r1-​671b
131k$0.80$0.80
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​deepseek-​r1-​basicℹ️
deepseek-​r1-​basic
128k$0.55$2.19
openrouter, deepinfra, sambanova, vercel_ai_gateway, ovhcloud, groq, nscale, gradient_ai deepseek-r1-distill-llama-70b (9 endpoints)8k131k$0.00$0.99$0.00$1.40
openrouter, nscale deepseek-r1-distill-llama-8b (2 endpoints)32k$0.025$0.04$0.025$0.04
openrouter, nscale deepseek-r1-distill-qwen-14b (2 endpoints)32k$0.07$0.15$0.07$0.15
nscalenscale/​deepseek-​ai/​DeepSeek-​R1-​Distill-​Qwen-​1.5Bℹ️
deepseek-​r1-​distill-​qwen-​1.5b
$0.09$0.09
deepinfra, openrouter, nscale deepseek-r1-distill-qwen-32b (3 endpoints)131k$0.15$0.27$0.15$0.27
nscalenscale/​deepseek-​ai/​DeepSeek-​R1-​Distill-​Qwen-​7Bℹ️
deepseek-​r1-​distill-​qwen-​7b
$0.20$0.20
openrouter deepseek-r1t2-chimera (2 endpoints)163k$0.00$0.30$0.00$1.20
openrouter deepseek-r1t-chimera (2 endpoints)163k$0.00$0.30$0.00$1.20
deepinfra deepinfra/​deepseek-​ai/​DeepSeek-​R1-​Turbo
deepseek-​r1-​turbo
40k$1.00$3.00
deepseek deepseek-​reasonerℹ️131k$0.60$1.70
deepinfra, wandb, sambanova, together_ai deepseek-v3.1 (4 endpoints)32k163k$0.27$55,000.00$1.00$165,000.00
vertex_ai-deepseek_modelsvertex_ai/​deepseek-​ai/​deepseek-​v3.1-​maasℹ️
deepseek-​v3.1-​maas
163k$1.35$5.40
openrouter, deepinfra deepseek-v3.1-terminus (2 endpoints)163k$0.23$0.27$0.90$1.00
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​deepseek-​v3p1ℹ️
deepseek-​v3p1
128k$0.56$1.68
openrouter, mistral devstral-medium (2 endpoints)128k131k$0.40$2.00
openrouter, vercel_ai_gateway, mistral devstral-small (6 endpoints)32k131k$0.00$0.10$0.00$0.30
openrouter dolphin3.0-mistral-24b (2 endpoints)32k$0.00$0.04$0.00$0.17
openrouter cognitivecomputations/​dolphin3.0-​r1-​mistral-​24b:free
dolphin3.0-​r1-​mistral-​24b
32k$0.00$0.00
openrouter cognitivecomputations/​dolphin-​mistral-​24b-​venice-​edition:free
dolphin-​mistral-​24b-​venice-​edition
32k$0.00$0.00
vercel_ai_gatewayvercel_ai_gateway/​cohere/​embed-​v4.0
embed-​v4.0
0$0.12$0.00
openrouter baidu/​ernie-​4.5-​21b-​a3b
ernie-​4.5-​21b-​a3b
120k$0.07$0.28
openrouter baidu/​ernie-​4.5-​300b-​a47b
ernie-​4.5-​300b-​a47b
123k$0.28$1.10
openrouter baidu/​ernie-​4.5-​vl-​28b-​a3b
ernie-​4.5-​vl-​28b-​a3b
30k$0.14$0.56
openrouter baidu/​ernie-​4.5-​vl-​424b-​a47b
ernie-​4.5-​vl-​424b-​a47b
123k$0.42$1.25
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​firefunction-​v2ℹ️
firefunction-​v2
8k$0.90$0.90
vertex_ai-language-models gemini-​1.0-​proℹ️32k$0.50$1.50
vertex_ai-language-models gemini-​1.0-​pro-​001ℹ️32k$0.50$1.50
vertex_ai-language-models gemini-​1.0-​pro-​002ℹ️32k$0.50$1.50
vertex_ai-vision-models gemini-​1.0-​pro-​visionℹ️16k$0.50$1.50
vertex_ai-vision-models gemini-​1.0-​pro-​vision-​001ℹ️16k$0.50$1.50
vertex_ai-language-models gemini-​1.0-​ultraℹ️8k$0.50$1.50
vertex_ai-language-models gemini-​1.0-​ultra-​001ℹ️8k$0.50$1.50
gemini, vertex_ai-language-models gemini-1.5-flash (3 endpoints)1000k1048k$0.075
>128k: $0.15$1.00
$0.30
>128k: $0.60
gemini, vertex_ai-language-models gemini-1.5-flash-001 (2 endpoints)1000k1048k$0.075
>128k: $0.15$1.00
$0.30
>128k: $0.60
vertex_ai-language-models, gemini gemini-1.5-flash-002 (2 endpoints)1048k$0.075
>128k: $0.15$1.00
$0.30
>128k: $0.60
gemini gemini/​gemini-​1.5-​flash-​8bℹ️
gemini-​1.5-​flash-​8b
1048k$0.00
>128k: $0.00
$0.00
>128k: $0.00
gemini, vertex_ai-language-models gemini-1.5-flash-exp-0827 (2 endpoints)1000k1048k$0.00$0.0047
>128k: $0.00$1.00
$0.00$0.0047
>128k: $0.00$0.0094
vertex_ai-language-models gemini-​1.5-​flash-​preview-​0514ℹ️1000k$0.075
>128k: $1.00
$0.0047
>128k: $0.0094
vertex_ai-language-models, gemini gemini-1.5-pro (3 endpoints)1048k2097k$1.25$3.50
>128k: $2.50$7.00
$1.05$10.50
>128k: $10.00$21.00
gemini, vertex_ai-language-models gemini-1.5-pro-001 (2 endpoints)1000k2097k$1.25$3.50
>128k: $2.50$7.00
$5.00$10.50
>128k: $10.00$21.00
vertex_ai-language-models, gemini gemini-1.5-pro-002 (2 endpoints)2097k$1.25$3.50
>128k: $2.50$7.00
$5.00$10.50
>128k: $10.00$21.00
vertex_ai-language-models gemini-1.5-pro-preview-0215 (3 endpoints)1000k$0.0781
>128k: $0.1563
$0.3125
>128k: $0.625
gemini, openrouter, vertex_ai-language-models, deepinfra gemini-2.0-flash-001 (4 endpoints)1000k1048k$0.10$0.15$0.40$0.60
openrouter google/​gemini-​2.0-​flash-​exp:free
gemini-​2.0-​flash-​exp
1048k$0.00$0.0022.2%
vertex_ai-language-models, openrouter gemini-2.0-flash-lite-001 (2 endpoints)1048k$0.075$0.30
gemini gemini/​gemini-​2.0-​flash-​lite-​preview-​02-​05ℹ️
gemini-​2.0-​flash-​lite-​preview-​02-​05
1048k$0.075$0.30
gemini gemini/​gemini-​2.0-​flash-​live-​001ℹ️
gemini-​2.0-​flash-​live-​001
1048k$0.35$1.50
vertex_ai-language-models gemini-​2.0-​flash-​live-​preview-​04-​09ℹ️1048k$0.50$2.00
vertex_ai-language-models, gemini gemini-2.0-flash-preview-image-generation (2 endpoints)1048k$0.10$0.40
vertex_ai-language-models, gemini gemini-2.0-flash-thinking-exp (4 endpoints)1048k$0.00
>128k: $0.00
$0.00
>128k: $0.00
18.2%
vertex_ai-language-models, gemini, openrouter, deepinfra, vercel_ai_gateway gemini-2.5-flash (5 endpoints)1000k1048k$0.30$2.50
openrouter google/​gemini-​2.5-​flash-​image-​preview
gemini-​2.5-​flash-​image-​preview
32k$0.30$2.50
vertex_ai-language-models, gemini, openrouter gemini-2.5-flash-lite (3 endpoints)1048k$0.10$0.40
vertex_ai-language-models, gemini, openrouter gemini-2.5-flash-lite-preview-06-17 (6 endpoints)1048k$0.10$0.40
gemini gemini/​gemini-​2.5-​flash-​preview-​ttsℹ️
gemini-​2.5-​flash-​preview-​tts
1048k$0.15$0.60
vertex_ai-language-models, gemini, openrouter, vercel_ai_gateway, deepinfra gemini-2.5-pro (5 endpoints)1000k1048k$1.25$2.50
>200k: $2.50
$10.00
>200k: $15.00
vertex_ai-language-models, gemini gemini-2.5-pro-preview-tts (2 endpoints)1048k$1.25
>200k: $2.50
$10.00
>200k: $15.00
vertex_ai-language-models gemini-​flash-​experimentalℹ️1000k$0.00$0.00
gemini gemini/​gemini-​flash-​latestℹ️
gemini-​flash-​latest
1048k$0.30$2.50
gemini gemini/​gemini-​flash-​lite-​latestℹ️
gemini-​flash-​lite-​latest
1048k$0.10$0.40
gemini gemini/​gemini-​gemma-​2-​27b-​itℹ️
gemini-​gemma-​2-​27b-​it
$0.35$1.05
gemini gemini/​gemini-​gemma-​2-​9b-​itℹ️
gemini-​gemma-​2-​9b-​it
$0.35$1.05
gemini, vertex_ai-language-models gemini-pro (2 endpoints)32k$0.35$0.50
>128k: $0.70
$1.05$1.50
>128k: $2.10
vertex_ai-language-models gemini-​pro-​experimentalℹ️1000k$0.00$0.00
gemini, vertex_ai-vision-models gemini-pro-vision (2 endpoints)16k30k$0.35$0.50
>128k: $0.70
$1.05$1.50
>128k: $2.10
openrouter google/​gemma-​2-​27b-​it
gemma-​2-​27b-​it
8k$0.65$0.65
vercel_ai_gatewayvercel_ai_gateway/​google/​gemma-​2-​9b
gemma-​2-​9b
8k$0.20$0.20
openrouter, groq gemma-2-9b-it (3 endpoints)8k$0.00$0.20$0.00$0.20
openrouter, deepinfra gemma-3-12b-it (3 endpoints)32k131k$0.00$0.05$0.00$0.13
gemini, deepinfra, openrouter gemma-3-27b-it (4 endpoints)96k131k$0.00$0.09
>128k: $0.00
$0.00$0.16
>128k: $0.00
4.9%
deepinfra, openrouter gemma-3-4b-it (3 endpoints)32k131k$0.00$0.04$0.00$0.08
openrouter google/​gemma-​3n-​e2b-​it:free
gemma-​3n-​e2b-​it
8k$0.00$0.00
openrouter gemma-3n-e4b-it (2 endpoints)8k32k$0.00$0.02$0.00$0.04
groq, anyscale gemma-7b-it (2 endpoints)8k$0.07$0.15$0.07$0.15
openrouter thudm/​glm-​4.1v-​9b-​thinking
glm-​4.1v-​9b-​thinking
65k$0.035$0.138
openrouter z-​ai/​glm-​4-​32b
glm-​4-​32b
128k$0.10$0.10
openrouter, deepinfra, vercel_ai_gateway, wandb glm-4.5 (4 endpoints)131k$0.35$55,000.00$1.55$200,000.00
openrouter, vercel_ai_gateway glm-4.5-air (3 endpoints)128k131k$0.00$0.20$0.00$1.10
together_ai together_ai/​zai-​org/​GLM-​4.5-​Air-​FP8ℹ️
glm-​4.5-​air-​fp8
128k$0.20$1.10
openrouter z-​ai/​glm-​4.5v
glm-​4.5v
65k$0.60$1.80
openrouter z-​ai/​glm-​4.6
glm-​4.6
202k$0.50$1.75
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​glm-​4p5ℹ️
glm-​4p5
128k$0.55$2.19
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​glm-​4p5-​airℹ️
glm-​4p5-​air
128k$0.22$0.88
openrouter thudm/​glm-​z1-​32b
glm-​z1-​32b
32k$0.05$0.22
openrouter alpindale/​goliath-​120b
goliath-​120b
6k$4.00$5.50
openai, vercel_ai_gateway, openrouter, azure gpt-3.5-turbo (13 endpoints)4k16k$0.20$1.50$1.50$2.00
azure, openai, openrouter gpt-35-turbo-16k (4 endpoints)16k$3.00$4.00
vercel_ai_gateway, openrouter gpt-3.5-turbo-instruct (2 endpoints)4k8k$1.50$2.00
azure, openai, openrouter gpt-4 (6 endpoints)8k8k$30.00$60.00
azure, openai gpt-4-0125-preview (2 endpoints)128k$10.00$30.00
azure, openai, openrouter gpt-4-1106-preview (3 endpoints)128k$10.00$30.00
azure, openai, vercel_ai_gateway, openrouter gpt-4.1-mini (6 endpoints)1047k$0.40$1.6032.4%
azure, openai, vercel_ai_gateway, openrouter gpt-4.1-nano (6 endpoints)1047k$0.10$0.408.9%
azure gpt-4-32k (2 endpoints)32k$60.00$120.00
openai, openrouter, azure gpt-4o-audio-preview (6 endpoints)128k$2.50$10.00
openrouter openai/​gpt-​4o:extended
gpt-​4o:extended
128k$6.00$18.00
openai, azure gpt-4o-mini-audio-preview (3 endpoints)128k$0.15$2.50$0.60$10.00
openai, azure gpt-4o-mini-realtime-preview (5 endpoints)128k$0.60$0.66$2.40$2.64
openai, openrouter gpt-4o-mini-search-preview (3 endpoints)128k$0.15$0.60
openai, azure gpt-4o-realtime-preview (10 endpoints)128k$5.00$5.50$20.00$22.00
openai, openrouter gpt-4o-search-preview (3 endpoints)128k$2.50$10.00
azure, openai, vercel_ai_gateway, openrouter gpt-4-turbo (6 endpoints)128k$10.00$30.00
openai, openrouter gpt-4-turbo-preview (2 endpoints)128k$10.00$30.00
azure azure/​gpt-​4-​turbo-​vision-​preview
gpt-​4-​turbo-​vision-​preview
128k$10.00$30.00
azure, openrouter, openai gpt-5-chat (4 endpoints)128k272k$1.25$10.00
openrouter openai/​gpt-​5-​codex
gpt-​5-​codex
400k$1.25$10.00
openrouter, azure, openai gpt-5-mini (5 endpoints)272k400k$0.25$2.00
openrouter, azure, openai gpt-5-nano (5 endpoints)272k400k$0.05$0.40
openrouter, deepinfra, fireworks_ai, groq, cerebras, sambanova, wandb, ovhcloud, together_ai gpt-oss-120b (9 endpoints)128k131k$0.04$15,000.00$0.40$60,000.0041.8%
bedrock_converse openai.​gpt-​oss-​120b-​1:0
gpt-​oss-​120b-​1:0
128k$0.15$0.60
vertex_ai-openai_modelsvertex_ai/​openai/​gpt-​oss-​120b-​maasℹ️
gpt-​oss-​120b-​maas
131k$0.15$0.60
openrouter, deepinfra, fireworks_ai, groq, wandb, ovhcloud, together_ai gpt-oss-20b (8 endpoints)128k131k$0.00$5,000.00$0.00$20,000.00
bedrock_converse openai.​gpt-​oss-​20b-​1:0
gpt-​oss-​20b-​1:0
128k$0.07$0.30
vertex_ai-openai_modelsvertex_ai/​openai/​gpt-​oss-​20b-​maasℹ️
gpt-​oss-​20b-​maas
131k$0.075$0.30
openai gpt-realtime (2 endpoints)32k$4.00$16.00
watsonx watsonx/​ibm/​granite-​3-​8b-​instruct
granite-​3-​8b-​instruct
8k$200.00$200.00
vercel_ai_gateway, xai grok-2 (4 endpoints)131k$2.00$10.00
vercel_ai_gateway, xai grok-2-vision (4 endpoints)32k$2.00$10.00
oci, azure_ai, vercel_ai_gateway, xai, openrouter grok-3 (7 endpoints)131k$3.00$3.30$0.15$16.50
oci, vercel_ai_gateway, xai grok-3-fast (3 endpoints)131k$5.00$25.00
xai xai/​grok-​3-​fast-​betaℹ️
grok-​3-​fast-​beta
131k$5.00$25.00
azure_ai, oci, vercel_ai_gateway, xai, openrouter grok-3-mini (7 endpoints)131k$0.25$0.30$0.50$1.38
oci, vercel_ai_gateway, xai grok-3-mini-fast (4 endpoints)131k$0.60$4.00
xai xai/​grok-​3-​mini-​fast-​betaℹ️
grok-​3-​mini-​fast-​beta
131k$0.60$4.00
openrouter x-​ai/​grok-​4-​fast
grok-​4-​fast
2000k$0.20$0.50
xai, azure_ai grok-4-fast-non-reasoning (2 endpoints)131k2000k$0.20$0.43$0.50$1.73
xai, azure_ai grok-4-fast-reasoning (2 endpoints)131k2000k$0.20$0.43$0.50$1.73
xai xai/​grok-​beta
grok-​beta
131k$5.00$15.00
xai xai/​grok-​code-​fastℹ️
grok-​code-​fast
256k$0.20$1.50
xai, openrouter, azure_ai grok-code-fast-1 (4 endpoints)131k256k$0.20$3.50$1.50$17.50
xai xai/​grok-​vision-​beta
grok-​vision-​beta
8k$5.00$15.00
openrouter nousresearch/​hermes-​2-​pro-​llama-​3-​8b
hermes-​2-​pro-​llama-​3-​8b
32k$0.025$0.08
lambda_ai, lambda hermes3-405b (2 endpoints)131k131k$0.80$0.80
lambda_ai, lambda hermes3-70b (2 endpoints)131k131k$0.12$0.30
lambda_ai, lambda hermes3-8b (2 endpoints)131k131k$0.025$0.04
deepinfra, openrouter hermes-3-llama-3.1-405b (2 endpoints)131k$1.00$1.00
deepinfra, openrouter, hyperbolic hermes-3-llama-3.1-70b (3 endpoints)32k131k$0.12$0.30$0.30
openrouter nousresearch/​hermes-​4-​405b
hermes-​4-​405b
131k$0.30$1.20
openrouter nousresearch/​hermes-​4-​70b
hermes-​4-​70b
131k$0.11$0.38
openrouter hunyuan-a13b-instruct (2 endpoints)32k$0.00$0.03$0.00$0.03
openrouter inflection/​inflection-​3-​pi
inflection-​3-​pi
8k$2.50$10.00
openrouter inflection/​inflection-​3-​productivity
inflection-​3-​productivity
8k$2.50$10.00
openrouter opengvlab/​internvl3-​78b
internvl3-​78b
32k$0.07$0.26
bedrock ai21.​j2-​mid-​v1
j2-​mid-​v1
8k$12.50$12.50
bedrock ai21.​j2-​ultra-​v1
j2-​ultra-​v1
8k$18.80$18.80
azure_ai azure_ai/​jais-​30b-​chatℹ️
jais-​30b-​chat
8k$3,200.00$9,710.00
ai21, vertex_ai-ai21_models jamba-1.5 (2 endpoints)256k$0.20$0.40
ai21, vertex_ai-ai21_models, bedrock jamba-1.5-large (3 endpoints)256k$2.00$8.00
ai21, vertex_ai-ai21_models jamba-1.5-large@001 (2 endpoints)256k$2.00$8.00
ai21, vertex_ai-ai21_models, bedrock jamba-1.5-mini (3 endpoints)256k$0.20$0.40
ai21, vertex_ai-ai21_models jamba-1.5-mini@001 (2 endpoints)256k$0.20$0.40
azure_ai, bedrock jamba-instruct (2 endpoints)70k$0.50$0.70
ai21 jamba-​large-​1.6256k$2.00$8.00
ai21, openrouter jamba-large-1.7 (2 endpoints)256k$2.00$8.00
ai21 jamba-​mini-​1.6256k$0.20$0.40
ai21, openrouter jamba-mini-1.7 (2 endpoints)256k$0.20$0.40
openrouter kimi-dev-72b (2 endpoints)131k$0.00$0.29$0.00$1.15
moonshotmoonshot/​kimi-​k2-​0711-​previewℹ️
kimi-​k2-​0711-​preview
131k$0.60$2.50
deepinfra, groq, fireworks_ai, hyperbolic, wandb, together_ai kimi-k2-instruct (8 endpoints)128k262k$0.50$135,000.00$2.00$400,000.00
moonshotmoonshot/​kimi-​latestℹ️
kimi-​latest
131k$2.00$5.00
moonshotmoonshot/​kimi-​latest-​128kℹ️
kimi-​latest-​128k
131k$2.00$5.00
moonshotmoonshot/​kimi-​latest-​32kℹ️
kimi-​latest-​32k
32k$1.00$3.00
moonshotmoonshot/​kimi-​latest-​8kℹ️
kimi-​latest-​8k
8k$0.20$2.00
moonshotmoonshot/​kimi-​thinking-​previewℹ️
kimi-​thinking-​preview
131k$30.00$30.00
openrouter moonshotai/​kimi-​vl-​a3b-​thinking:free
kimi-​vl-​a3b-​thinking
131k$0.00$0.00
deepinfra deepinfra/​Sao10K/​L3.1-​70B-​Euryale-​v2.2
l3.1-​70b-​euryale-​v2.2
131k$0.65$0.75
openrouter sao10k/​l3.1-​euryale-​70b
l3.1-​euryale-​70b
32k$0.65$0.75
deepinfra deepinfra/​Sao10K/​L3.3-​70B-​Euryale-​v2.3
l3.3-​70b-​euryale-​v2.3
131k$0.65$0.75
openrouter sao10k/​l3.3-​euryale-​70b
l3.3-​euryale-​70b
131k$0.65$0.75
deepinfra deepinfra/​Sao10K/​L3-​8B-​Lunaris-​v1-​Turbo
l3-​8b-​lunaris-​v1-​turbo
8k$0.04$0.05
openrouter sao10k/​l3-​euryale-​70b
l3-​euryale-​70b
8k$1.48$1.48
openrouter sao10k/​l3-​lunaris-​8b
l3-​lunaris-​8b
8k$0.04$0.05
gemini gemini/​learnlm-​1.5-​pro-​experimentalℹ️
learnlm-​1.5-​pro-​experimental
32k$0.00
>128k: $0.00
$0.00
>128k: $0.00
openrouter liquid/​lfm-​3b
lfm-​3b
32k$0.02$0.02
lambda_ai, lambda lfm-40b (2 endpoints)66k131k$0.10$0.15$0.15$0.20
lambda_ai, openrouter lfm-7b (2 endpoints)32k131k$0.01$0.025$0.01$0.04
replicate replicate/​meta/​llama-​2-​13b
llama-​2-​13b
4k$0.10$0.50
replicate, anyscale llama-2-13b-chat (2 endpoints)4k$0.10$0.25$0.25$0.50
bedrock meta.​llama2-​13b-​chat-​v1
llama2-​13b-​chat-​v1
4k$0.75$1.00
replicate, groq llama-2-70b (2 endpoints)4k$0.65$0.70$0.80$2.75
replicate, perplexity, anyscale llama-2-70b-chat (3 endpoints)4k$0.65$1.00$1.00$2.80
bedrock meta.​llama2-​70b-​chat-​v1
llama2-​70b-​chat-​v1
4k$1.95$2.56
replicate replicate/​meta/​llama-​2-​7b
llama-​2-​7b
4k$0.05$0.25
replicate, anyscale llama-2-7b-chat (2 endpoints)4k$0.05$0.15$0.15$0.25
cloudflare cloudflare/​@cf/​meta/​llama-​2-​7b-​chat-​fp16
llama-​2-​7b-​chat-​fp16
3k$1.923$1.923
cloudflare cloudflare/​@cf/​meta/​llama-​2-​7b-​chat-​int8
llama-​2-​7b-​chat-​int8
2k$1.923$1.923
openrouter meta-​llama/​llama-​3.1-​405b
llama-​3.1-​405b
32k$2.00$2.00
bedrock, oci, openrouter llama-3.1-405b-instruct (4 endpoints)32k128k$0.80$10.68$0.80$16.00
lambda_ai, lambda llama3.1-405b-instruct-fp8 (2 endpoints)131k131k$0.80$0.80
vertex_ai-llama_models vertex_ai/​meta/​llama-​3.1-​405b-​instruct-​maasℹ️
llama-​3.1-​405b-​instruct-​maas
128k$5.00$16.00
groq groq/​llama-​3.1-​405b-​reasoning
llama-​3.1-​405b-​reasoning
8k$0.59$0.79
cerebras, vercel_ai_gateway llama3.1-70b (2 endpoints)128k$0.60$0.72$0.60$0.72
openrouter, perplexity, bedrock llama-3.1-70b-instruct (4 endpoints)128k131k$0.40$1.00$0.40$1.00
lambda_ai, lambda llama3.1-70b-instruct-fp8 (2 endpoints)131k131k$0.12$0.30
vertex_ai-llama_models vertex_ai/​meta/​llama-​3.1-​70b-​instruct-​maasℹ️
llama-​3.1-​70b-​instruct-​maas
128k$0.00$0.00
groq groq/​llama-​3.1-​70b-​versatile
llama-​3.1-​70b-​versatile
8k$0.59$0.79
vercel_ai_gateway, cerebras llama-3.1-8b (2 endpoints)128k131k$0.05$0.10$0.08$0.10
groq groq/​llama-​3.1-​8b-​instant
llama-​3.1-​8b-​instant
128k$0.05$0.08
lambda_ai, perplexity, lambda, ovhcloud, bedrock, wandb, openrouter, nscale llama3.1-8b-instruct (9 endpoints)16k131k$0.02$22,000.00$0.03$22,000.00
vertex_ai-llama_models vertex_ai/​meta/​llama-​3.1-​8b-​instruct-​maasℹ️
llama-​3.1-​8b-​instruct-​maas
128k$0.00$0.00
openrouter neversleep/​llama-​3.1-​lumimaid-​8b
llama-​3.1-​lumimaid-​8b
32k$0.09$0.60
deepinfra, openrouter llama-3.1-nemotron-70b-instruct (2 endpoints)131k$0.60$0.60
lambda_ai, lambda llama3.1-nemotron-70b-instruct-fp8 (2 endpoints)131k131k$0.12$0.30
openrouter nvidia/​llama-​3.1-​nemotron-​ultra-​253b-​v1
llama-​3.1-​nemotron-​ultra-​253b-​v1
131k$0.60$1.80
perplexity perplexity/​llama-​3.1-​sonar-​huge-​128k-​online
llama-​3.1-​sonar-​huge-​128k-​online
127k$5.00$5.00
perplexity perplexity/​llama-​3.1-​sonar-​large-​128k-​chat
llama-​3.1-​sonar-​large-​128k-​chat
131k$1.00$1.00
perplexity perplexity/​llama-​3.1-​sonar-​large-​128k-​online
llama-​3.1-​sonar-​large-​128k-​online
127k$1.00$1.00
perplexity perplexity/​llama-​3.1-​sonar-​small-​128k-​chat
llama-​3.1-​sonar-​small-​128k-​chat
131k$0.20$0.20
perplexity perplexity/​llama-​3.1-​sonar-​small-​128k-​online
llama-​3.1-​sonar-​small-​128k-​online
127k$0.20$0.20
vercel_ai_gatewayvercel_ai_gateway/​meta/​llama-​3.2-​11b
llama-​3.2-​11b
128k$0.16$0.16
bedrock llama3-2-11b-instruct-v1.0 (2 endpoints)128k$0.35$0.35
groq groq/​llama-​3.2-​11b-​text-​preview
llama-​3.2-​11b-​text-​preview
8k$0.18$0.18
lambda_ai, deepinfra, openrouter, azure_ai llama3.2-11b-vision-instruct (4 endpoints)128k131k$0.015$0.37$0.025$0.37
groq groq/​llama-​3.2-​11b-​vision-​preview
llama-​3.2-​11b-​vision-​preview
8k$0.18$0.18
vercel_ai_gatewayvercel_ai_gateway/​meta/​llama-​3.2-​1b
llama-​3.2-​1b
128k$0.10$0.10
openrouter, bedrock llama-3.2-1b-instruct (4 endpoints)128k131k$0.005$0.13$0.01$0.13
groq groq/​llama-​3.2-​1b-​preview
llama-​3.2-​1b-​preview
8k$0.04$0.04
vercel_ai_gatewayvercel_ai_gateway/​meta/​llama-​3.2-​3b
llama-​3.2-​3b
128k$0.15$0.15
openrouter, lambda_ai, deepinfra, lambda, bedrock, hyperbolic llama-3.2-3b-instruct (9 endpoints)16k131k$0.00$0.19$0.00$0.30
groq groq/​llama-​3.2-​3b-​preview
llama-​3.2-​3b-​preview
8k$0.06$0.06
vercel_ai_gatewayvercel_ai_gateway/​meta/​llama-​3.2-​90b
llama-​3.2-​90b
128k$0.72$0.72
bedrock llama3-2-90b-instruct-v1.0 (2 endpoints)128k$2.00$2.00
groq groq/​llama-​3.2-​90b-​text-​preview
llama-​3.2-​90b-​text-​preview
8k$0.90$0.90
oci, azure_ai, openrouter llama-3.2-90b-vision-instruct (3 endpoints)32k128k$0.35$2.04$0.40$2.04
vertex_ai-llama_models vertex_ai/​meta/​llama-​3.2-​90b-​vision-​instruct-​maasℹ️
llama-​3.2-​90b-​vision-​instruct-​maas
128k$0.00$0.00
groq groq/​llama-​3.2-​90b-​vision-​preview
llama-​3.2-​90b-​vision-​preview
8k$0.90$0.90
vercel_ai_gateway, cerebras llama-3.3-70b (2 endpoints)128k$0.72$0.85$0.72$1.20
openrouter, hyperbolic, deepinfra, azure_ai, oci, bedrock_converse, wandb, nscale, gradient_ai llama-3.3-70b-instruct (11 endpoints)65k131k$0.00$71,000.00$0.00$71,000.00
lambda_ai, lambda llama3.3-70b-instruct-fp8 (2 endpoints)131k131k$0.12$0.30
deepinfra, together_ai llama-3.3-70b-instruct-turbo (2 endpoints)131k$0.13$0.88$0.39$0.88
together_ai together_ai/​meta-​llama/​Llama-​3.3-​70B-​Instruct-​Turbo-​Free
llama-​3.3-​70b-​instruct-​turbo-​free
$0.00$0.00
groq groq/​llama-​3.3-​70b-​specdec
llama-​3.3-​70b-​specdec
8k$0.59$0.99
groq groq/​llama-​3.3-​70b-​versatile
llama-​3.3-​70b-​versatile
128k$0.59$0.79
openrouter meta-​llama/​llama-​3.3-​8b-​instruct:free
llama-​3.3-​8b-​instruct
128k$0.00$0.00
deepinfra deepinfra/​nvidia/​Llama-​3.3-​Nemotron-​Super-​49B-​v1.5
llama-​3.3-​nemotron-​super-​49b-​v1.5
131k$0.10$0.40
vertex_ai-llama_models vertex_ai/​meta/​llama3-​405b-​instruct-​maasℹ️
llama3-​405b-​instruct-​maas
32k$0.00$0.00
vercel_ai_gateway, replicate llama-3-70b (2 endpoints)8k$0.59$0.65$0.79$2.75
openrouter, replicate, bedrock llama-3-70b-instruct (12 endpoints)8k8k$0.30$4.45$0.40$5.88
vertex_ai-llama_models vertex_ai/​meta/​llama3-​70b-​instruct-​maasℹ️
llama3-​70b-​instruct-​maas
32k$0.00$0.00
vercel_ai_gateway, replicate llama-3-8b (2 endpoints)8k8k$0.05$0.08$0.25
openrouter, bedrock, replicate, gradient_ai llama-3-8b-instruct (13 endpoints)8k8k$0.03$0.50$0.06$2.65
vertex_ai-llama_models vertex_ai/​meta/​llama3-​8b-​instruct-​maasℹ️
llama3-​8b-​instruct-​maas
32k$0.00$0.00
groq groq/​llama3-​groq-​70b-​8192-​tool-​use-​preview
llama3-​groq-​70b-​8192-​tool-​use-​preview
8k$0.89$0.89
groq groq/​llama3-​groq-​8b-​8192-​tool-​use-​preview
llama3-​groq-​8b-​8192-​tool-​use-​preview
8k$0.19$0.19
groq, sambanova llama-4-maverick-17b-128e-instruct (2 endpoints)131k$0.20$0.63$0.60$1.80
deepinfra, azure_ai, oci, lambda_ai, together_ai llama-4-maverick-17b-128e-instruct-fp8 (5 endpoints)131k1048k$0.05$1.41$0.10$0.85
vertex_ai-llama_models vertex_ai/​meta/​llama-​4-​maverick-​17b-​128e-​instruct-​maasℹ️
llama-​4-​maverick-​17b-​128e-​instruct-​maas
1000k$0.35$1.15
vertex_ai-llama_models vertex_ai/​meta/​llama-​4-​maverick-​17b-​16e-​instruct-​maasℹ️
llama-​4-​maverick-​17b-​16e-​instruct-​maas
1000k$0.35$1.15
bedrock_converse llama4-maverick-17b-instruct-v1.0 (2 endpoints)128k$0.24$0.97
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama4-​maverick-​instruct-​basicℹ️
llama4-​maverick-​instruct-​basic
131k$0.22$0.88
vertex_ai-llama_models vertex_ai/​meta/​llama-​4-​scout-​17b-​128e-​instruct-​maasℹ️
llama-​4-​scout-​17b-​128e-​instruct-​maas
10000k$0.25$0.70
azure_ai, deepinfra, oci, groq, wandb, lambda_ai, sambanova, nscale, together_ai llama-4-scout-17b-16e-instruct (9 endpoints)8k10000k$0.05$17,000.00$0.10$66,000.00
vertex_ai-llama_models vertex_ai/​meta/​llama-​4-​scout-​17b-​16e-​instruct-​maasℹ️
llama-​4-​scout-​17b-​16e-​instruct-​maas
10000k$0.25$0.70
bedrock_converse llama4-scout-17b-instruct-v1.0 (2 endpoints)128k$0.17$0.66
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama4-​scout-​instruct-​basicℹ️
llama4-​scout-​instruct-​basic
131k$0.15$0.60
openrouter meta-​llama/​llama-​guard-​2-​8b
llama-​guard-​2-​8b
8k$0.20$0.20
openrouter, deepinfra, groq llama-guard-3-8b (3 endpoints)8k131k$0.02$0.20$0.055$0.20
deepinfra, openrouter llama-guard-4-12b (2 endpoints)163k$0.18$0.18
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p1-​405b-​instructℹ️
llama-​v3p1-​405b-​instruct
128k$3.00$3.00
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p1-​8b-​instructℹ️
llama-​v3p1-​8b-​instruct
16k$0.10$0.10
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p2-​11b-​vision-​instructℹ️
llama-​v3p2-​11b-​vision-​instruct
16k$0.20$0.20
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p2-​1b-​instructℹ️
llama-​v3p2-​1b-​instruct
16k$0.10$0.10
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p2-​3b-​instructℹ️
llama-​v3p2-​3b-​instruct
16k$0.10$0.10
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p2-​90b-​vision-​instructℹ️
llama-​v3p2-​90b-​vision-​instruct
16k$0.90$0.90
ovhcloudovhcloud/​llava-​v1.6-​mistral-​7b-​hfℹ️
llava-​v1.6-​mistral-​7b
32k$0.29$0.29
openrouter eleutherai/​llemma_7b
llemma_7b
4k$0.80$1.20
openrouter longcat-flash-chat (2 endpoints)131k$0.00$0.15$0.00$0.75
aleph_alpha luminous-​base-​control$37.50$41.25
aleph_alpha luminous-​extended-​control$56.25$61.875
aleph_alpha luminous-​supreme-​control$218.75$240.625
openrouter arcee-​ai/​maestro-​reasoning
maestro-​reasoning
131k$0.90$3.30
vercel_ai_gateway, openrouter, mistral magistral-medium (4 endpoints)40k128k$2.00$5.00
openrouter mistralai/​magistral-​medium-​2506:thinking
magistral-​medium-​2506:thinking
40k$2.00$5.00
vercel_ai_gateway, mistral, openrouter magistral-small (4 endpoints)40k128k$0.50$1.50
openrouter anthracite-​org/​magnum-​v2-​72b
magnum-​v2-​72b
32k$3.00$3.00
openrouter anthracite-​org/​magnum-​v4-​72b
magnum-​v4-​72b
16k$2.50$5.00
openrouter mai-ds-r1 (2 endpoints)163k$0.00$0.30$0.00$1.20
ovhcloudovhcloud/​mamba-​codestral-​7B-​v0.1ℹ️
mamba-​codestral-​7b-​v0.1
256k$0.19$0.19
openrouter inception/​mercury
mercury
128k$0.25$1.00
openrouter inception/​mercury-​coder
mercury-​coder
128k$0.25$1.00
vercel_ai_gatewayvercel_ai_gateway/​inception/​mercury-​coder-​small
mercury-​coder-​small
32k$0.25$1.00
ovhcloudovhcloud/​Meta-​Llama-​3_1-​70B-​Instructℹ️
meta-​llama-​3_1-​70b-​instruct
131k$0.67$0.67
ovhcloudovhcloud/​Meta-​Llama-​3_3-​70B-​Instructℹ️
meta-​llama-​3_3-​70b-​instruct
131k$0.67$0.67
azure_ai, hyperbolic, sambanova meta-llama-3.1-405b-instruct (3 endpoints)16k128k$0.12$5.33$0.30$16.00
together_ai together_ai/​meta-​llama/​Meta-​Llama-​3.1-​405B-​Instruct-​Turbo
meta-​llama-​3.1-​405b-​instruct-​turbo
$3.50$3.50
deepinfra, azure_ai, hyperbolic, friendliai meta-llama-3.1-70b-instruct (4 endpoints)8k131k$0.12$2.68$0.30$3.54
deepinfra, together_ai meta-llama-3.1-70b-instruct-turbo (2 endpoints)131k$0.10$0.88$0.28$0.88
deepinfra, azure_ai, hyperbolic, sambanova, friendliai meta-llama-3.1-8b-instruct (5 endpoints)8k131k$0.03$0.30$0.05$0.61
deepinfra, together_ai meta-llama-3.1-8b-instruct-turbo (2 endpoints)131k$0.02$0.18$0.03$0.18
sambanova sambanova/​Meta-​Llama-​3.2-​1B-​Instructℹ️
meta-​llama-​3.2-​1b-​instruct
16k$0.04$0.08
sambanova sambanova/​Meta-​Llama-​3.2-​3B-​Instructℹ️
meta-​llama-​3.2-​3b-​instruct
4k$0.08$0.16
sambanova sambanova/​Meta-​Llama-​3.3-​70B-​Instructℹ️
meta-​llama-​3.3-​70b-​instruct
131k$0.60$1.20
hyperbolic, anyscale, azure_ai meta-llama-3-70b-instruct (3 endpoints)8k131k$0.12$1.10$0.30$1.00
deepinfra, anyscale meta-llama-3-8b-instruct (2 endpoints)8k$0.03$0.15$0.06$0.15
sambanova sambanova/​Meta-​Llama-​Guard-​3-​8Bℹ️
meta-​llama-​guard-​3-​8b
16k$0.30$0.30
sagemakersagemaker/​meta-​textgeneration-​llama-​2-​13b-​f
meta-​textgeneration-​llama-​2-​13b-​f
4k$0.00$0.00
sagemakersagemaker/​meta-​textgeneration-​llama-​2-​70b-​b-​f
meta-​textgeneration-​llama-​2-​70b-​b-​f
4k$0.00$0.00
sagemakersagemaker/​meta-​textgeneration-​llama-​2-​7b-​f
meta-​textgeneration-​llama-​2-​7b-​f
4k$0.00$0.00
openrouter minimax/​minimax-​01
minimax-​01
1000k$0.20$1.10
openrouter minimax/​minimax-​m1
minimax-​m1
1000k$0.30$1.65
azure_ai, vercel_ai_gateway, openrouter ministral-3b (3 endpoints)32k128k$0.04$0.04
vercel_ai_gateway, openrouter ministral-8b (2 endpoints)128k$0.10$0.10
openrouter, perplexity mistral-7b-instruct (3 endpoints)4k32k$0.00$0.07$0.00$0.28
anyscale, cloudflare, openrouter mistral-7b-instruct-v0.1 (3 endpoints)2k16k$0.11$1.923$0.15$1.923
bedrock, replicate mistral-7b-instruct-v0.2 (5 endpoints)4k32k$0.05$0.20$0.20$0.26
ovhcloud, openrouter mistral-7b-instruct-v0.3 (2 endpoints)32k127k$0.028$0.10$0.054$0.10
replicate replicate/​mistralai/​mistral-​7b-​v0.1
mistral-​7b-​v0.1
4k$0.05$0.25
vercel_ai_gatewayvercel_ai_gateway/​mistral/​mistral-​embed
mistral-​embed
0$0.10$0.00
openrouter, watsonx, azure_ai, vertex_ai-mistral_models, mistral, bedrock, vercel_ai_gateway, azure mistral-large (21 endpoints)32k131k$2.00$10.40$6.00$31.20
vertex_ai-mistral_models vertex_ai/​mistral-​large@2411-​001
mistral-​large@2411-​001
128k$2.00$6.00
vertex_ai-mistral_models vertex_ai/​mistral-​large@latest
mistral-​large@latest
128k$2.00$6.00
azure_ai, mistral mistral-medium-latest (5 endpoints)32k131k$0.40$2.70$2.00$8.10
openrouter mistralai/​mistral-​medium-​3
mistral-​medium-​3
131k$0.40$2.00
openrouter mistralai/​mistral-​medium-​3.1
mistral-​medium-​3.1
131k$0.40$2.00
openrouter, azure_ai, vertex_ai-mistral_models mistral-nemo (4 endpoints)128k131k$0.00$3.00$0.00$3.00
deepinfra, ovhcloud, gradient_ai mistral-nemo-instruct-2407 (3 endpoints)118k131k$0.02$0.30$0.04$0.30
vertex_ai-mistral_models vertex_ai/​mistral-​nemo@latest
mistral-​nemo@latest
128k$0.15$0.15
openrouter mistralai/​mistral-​saba
mistral-​saba
32k$0.20$0.60
vercel_ai_gateway, groq mistral-saba-24b (2 endpoints)32k32k$0.79$0.79
openrouter, deepinfra mistral-small-24b-instruct-2501 (3 endpoints)32k$0.00$0.05$0.00$0.08
vertex_ai-mistral_models vertex_ai/​mistral-​small-​2503@001
mistral-​small-​2503@001
32k$1.00$3.00
openrouter mistral-small-3.1-24b-instruct (2 endpoints)128k$0.00$0.05$0.00$0.10
openrouter, deepinfra, ovhcloud mistral-small-3.2-24b-instruct (4 endpoints)128k131k$0.00$0.09$0.00$0.28
openrouter, mistral mistral-tiny (2 endpoints)32k32k$0.25$0.25
openrouter, fireworks_ai, vercel_ai_gateway mixtral-8x22b-instruct (3 endpoints)65k$0.90$1.20$0.90$1.20
anyscale, nscale mixtral-8x22b-instruct-v0.1 (2 endpoints)65k$0.60$0.90$0.60$0.90
groq groq/​mixtral-​8x7b-​32768
mixtral-​8x7b-​32768
32k$0.24$0.24
openrouter, perplexity mixtral-8x7b-instruct (2 endpoints)4k32k$0.07$0.54$0.28$0.54
deepinfra, bedrock, ovhcloud, anyscale, replicate, together_ai mixtral-8x7b-instruct-v0.1 (9 endpoints)4k32k$0.15$0.63$0.15$1.00
openrouter allenai/​molmo-​7b-​d
molmo-​7b-​d
4k$0.10$0.20
moonshot moonshot-v1-128k (2 endpoints)131k$2.00$5.00
moonshotmoonshot/​moonshot-​v1-​128k-​vision-​previewℹ️
moonshot-​v1-​128k-​vision-​preview
131k$2.00$5.00
moonshot moonshot-v1-32k (2 endpoints)32k$1.00$3.00
moonshotmoonshot/​moonshot-​v1-​32k-​vision-​previewℹ️
moonshot-​v1-​32k-​vision-​preview
32k$1.00$3.00
moonshot moonshot-v1-8k (2 endpoints)8k$0.20$2.00
moonshotmoonshot/​moonshot-​v1-​8k-​vision-​previewℹ️
moonshot-​v1-​8k-​vision-​preview
8k$0.20$2.00
moonshotmoonshot/​moonshot-​v1-​autoℹ️
moonshot-​v1-​auto
131k$2.00$5.00
openrouter, vercel_ai_gateway, morph morph-v3-fast (3 endpoints)16k81k$0.80$1.20
openrouter, vercel_ai_gateway, morph morph-v3-large (3 endpoints)16k81k$0.90$1.90
openrouter, deepinfra mythomax-l2-13b (2 endpoints)4k$0.05$0.08$0.09
openrouter nemotron-nano-9b-v2 (2 endpoints)128k131k$0.00$0.04$0.00$0.16
openrouter neversleep/​noromaid-​20b
noromaid-​20b
4k$1.00$1.75
vercel_ai_gateway, bedrock_converse nova-lite (5 endpoints)300k$0.06$0.078$0.24$0.312
openrouter amazon/​nova-​lite-​v1
nova-​lite-​v1
300k$0.06$0.24
vercel_ai_gateway, bedrock_converse nova-micro (5 endpoints)128k$0.035$0.046$0.14$0.184
openrouter amazon/​nova-​micro-​v1
nova-​micro-​v1
128k$0.035$0.14
bedrock_converse us.​amazon.​nova-​premier-​v1:0
nova-​premier-​v1.0
1000k$2.50$12.50
vercel_ai_gateway, bedrock_converse, bedrock nova-pro (7 endpoints)300k$0.80$1.05$3.20$4.20
openrouter amazon/​nova-​pro-​v1
nova-​pro-​v1
300k$0.80$3.20
deepinfra deepinfra/​nvidia/​NVIDIA-​Nemotron-​Nano-​9B-​v2
nvidia-​nemotron-​nano-​9b-​v2
131k$0.04$0.16
azure o1-preview (4 endpoints)128k$15.00$16.50$60.00$66.00
openrouter allenai/​olmo-​2-​0325-​32b-​instruct
olmo-​2-​0325-​32b-​instruct
4k$0.20$0.35
deepinfra deepinfra/​allenai/​olmOCR-​7B-​0725-​FP8
olmocr-​7b-​0725-​fp8
16k$0.27$1.50
gradient_aigradient_ai/​openai-​o3
openai-​o3
$2.00$8.00
gradient_aigradient_ai/​openai-​o3-​mini
openai-​o3-​mini
$1.10$4.40
mistral mistral/​open-​codestral-​mambaℹ️
open-​codestral-​mamba
256k$0.25$0.25
mistral mistral/​open-​mistral-​7b
open-​mistral-​7b
32k$0.25$0.25
mistral open-mistral-nemo (2 endpoints)128k$0.30$0.30
mistral mistral/​open-​mixtral-​8x22b
open-​mixtral-​8x22b
65k$2.00$6.00
mistral mistral/​open-​mixtral-​8x7b
open-​mixtral-​8x7b
32k$0.70$0.70
bedrock pegasus-1-2-v1.0 (3 endpoints)N/A$7.50
openrouter microsoft/​phi-​3.5-​mini-​128k-​instruct
phi-​3.5-​mini-​128k-​instruct
128k$0.10$0.10
azure_ai azure_ai/​Phi-​3.5-​mini-​instructℹ️
phi-​3.5-​mini-​instruct
128k$0.13$0.52
azure_ai azure_ai/​Phi-​3.5-​MoE-​instructℹ️
phi-​3.5-​moe-​instruct
128k$0.16$0.64
azure_ai azure_ai/​Phi-​3.5-​vision-​instructℹ️
phi-​3.5-​vision-​instruct
128k$0.13$0.52
azure_ai, openrouter phi-3-medium-128k-instruct (2 endpoints)128k$0.17$1.00$0.68$1.00
azure_ai azure_ai/​Phi-​3-​medium-​4k-​instructℹ️
phi-​3-​medium-​4k-​instruct
4k$0.17$0.68
openrouter, azure_ai phi-3-mini-128k-instruct (2 endpoints)128k$0.10$0.13$0.10$0.52
azure_ai azure_ai/​Phi-​3-​mini-​4k-​instructℹ️
phi-​3-​mini-​4k-​instruct
4k$0.13$0.52
azure_ai azure_ai/​Phi-​3-​small-​128k-​instructℹ️
phi-​3-​small-​128k-​instruct
128k$0.15$0.60
azure_ai azure_ai/​Phi-​3-​small-​8k-​instructℹ️
phi-​3-​small-​8k-​instruct
8k$0.15$0.60
openrouter, deepinfra, azure_ai phi-4 (3 endpoints)16k$0.06$0.125$0.14$0.50
azure_ai, wandb phi-4-mini-instruct (2 endpoints)128k131k$0.075$8,000.00$0.30$35,000.00
openrouter, azure_ai phi-4-multimodal-instruct (2 endpoints)131k$0.05$0.08$0.10$0.32
openrouter microsoft/​phi-​4-​reasoning-​plus
phi-​4-​reasoning-​plus
32k$0.07$0.35
vercel_ai_gateway, mistral, openrouter pixtral-12b (3 endpoints)32k128k$0.10$0.15$0.10$0.15
openrouter, vercel_ai_gateway, mistral, bedrock_converse pixtral-large (6 endpoints)128k131k$2.00$6.00
perplexity perplexity/​pplx-​70b-​chat
pplx-​70b-​chat
4k$0.70$2.80
perplexity perplexity/​pplx-​70b-​online
pplx-​70b-​online
4k$0.00$2.80
perplexity perplexity/​pplx-​7b-​chat
pplx-​7b-​chat
8k$0.07$0.28
perplexity perplexity/​pplx-​7b-​online
pplx-​7b-​online
4k$0.00$0.28
hyperbolic, openrouter, deepinfra qwen2.5-72b-instruct (4 endpoints)32k131k$0.00$0.12$0.00$0.39
openrouter, deepinfra qwen-2.5-7b-instruct (2 endpoints)32k65k$0.04$0.10
lambda_ai, lambda, openrouter, hyperbolic, ovhcloud, nscale qwen25-coder-32b-instruct (7 endpoints)32k131k$0.00$0.87$0.00$0.8716.4%
nscalenscale/​Qwen/​Qwen2.5-​Coder-​3B-​Instructℹ️
qwen2.5-​coder-​3b-​instruct
$0.01$0.03
nscalenscale/​Qwen/​Qwen2.5-​Coder-​7B-​Instructℹ️
qwen2.5-​coder-​7b-​instruct
$0.01$0.03
deepinfra, openrouter qwen2.5-vl-32b-instruct (3 endpoints)8k128k$0.00$0.20$0.00$0.60
openrouter, ovhcloud qwen2.5-vl-72b-instruct (3 endpoints)32k131k$0.00$0.91$0.00$0.91
openrouter qwen/​qwen-​2.5-​vl-​7b-​instruct
qwen-​2.5-​vl-​7b-​instruct
32k$0.20$0.20
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​qwen2-​72b-​instructℹ️
qwen2-​72b-​instruct
32k$0.90$0.90
sambanova sambanova/​Qwen2-​Audio-​7B-​Instructℹ️
qwen2-​audio-​7b-​instruct
4k$0.50$100.00
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​qwen2p5-​coder-​32b-​instructℹ️
qwen2p5-​coder-​32b-​instruct
4k$0.90$0.90
openrouter, deepinfra, vercel_ai_gateway qwen3-14b (4 endpoints)40k$0.00$0.08$0.00$0.24
vercel_ai_gatewayvercel_ai_gateway/​alibaba/​qwen-​3-​235b
qwen-​3-​235b
40k$0.20$0.60
openrouter, bedrock_converse, hyperbolic, deepinfra qwen3-235b-a22b (6 endpoints)40k262k$0.00$2.00$0.00$2.00
together_ai together_ai/​Qwen/​Qwen3-​235B-​A22B-​fp8-​tputℹ️
qwen3-​235b-​a22b-​fp8-​tput
40k$0.20$0.60
deepinfra, wandb qwen3-235b-a22b-instruct-2507 (2 endpoints)262k$0.09$10,000.00$0.60$10,000.00
vertex_ai-qwen_modelsvertex_ai/​qwen/​qwen3-​235b-​a22b-​instruct-​2507-​maasℹ️
qwen3-​235b-​a22b-​instruct-​2507-​maas
262k$0.25$1.00
together_ai together_ai/​Qwen/​Qwen3-​235B-​A22B-​Instruct-​2507-​tputℹ️
qwen3-​235b-​a22b-​instruct-​2507-​tput
262k$0.20$6.00
openrouter, deepinfra, wandb, together_ai qwen3-235b-a22b-thinking-2507 (4 endpoints)256k262k$0.11$10,000.00$0.60$10,000.00
vercel_ai_gatewayvercel_ai_gateway/​alibaba/​qwen-​3-​30b
qwen-​3-​30b
40k$0.10$0.30
openrouter, deepinfra qwen3-30b-a3b (3 endpoints)40k$0.00$0.08$0.00$0.29
openrouter qwen/​qwen3-​30b-​a3b-​instruct-​2507
qwen3-​30b-​a3b-​instruct-​2507
262k$0.08$0.33
openrouter qwen/​qwen3-​30b-​a3b-​thinking-​2507
qwen3-​30b-​a3b-​thinking-​2507
262k$0.08$0.29
bedrock_converse, groq, cerebras, openrouter, deepinfra, vercel_ai_gateway, ovhcloud, sambanova qwen3-32b (8 endpoints)8k131k$0.05$0.40$0.20$0.8040%
lambda_ailambda_ai/​qwen3-​32b-​fp8
qwen3-​32b-​fp8
131k$0.05$0.10
openrouter qwen/​qwen3-​4b:free
qwen3-​4b
40k$0.00$0.00
openrouter qwen3-8b (2 endpoints)40k128k$0.00$0.035$0.00$0.138
openrouter, vercel_ai_gateway qwen3-coder (3 endpoints)262k$0.00$0.40$0.00$1.60
openrouter qwen/​qwen3-​coder-​30b-​a3b-​instruct
qwen3-​coder-​30b-​a3b-​instruct
262k$0.06$0.25
lemonadelemonade/​Qwen3-​Coder-​30B-​A3B-​Instruct-​GGUF
qwen3-​coder-​30b-​a3b-​instruct-​gguf
32k$0.00$0.00
bedrock_converse qwen.​qwen3-​coder-​30b-​a3b-​v1:0
qwen3-​coder-​30b-​a3b-​v1.0
262k$0.15$0.60
deepinfra, wandb qwen3-coder-480b-a35b-instruct (2 endpoints)262k$0.40$100,000.00$1.60$150,000.00
together_ai together_ai/​Qwen/​Qwen3-​Coder-​480B-​A35B-​Instruct-​FP8ℹ️
qwen3-​coder-​480b-​a35b-​instruct-​fp8
256k$2.00$2.00
vertex_ai-qwen_modelsvertex_ai/​qwen/​qwen3-​coder-​480b-​a35b-​instruct-​maasℹ️
qwen3-​coder-​480b-​a35b-​instruct-​maas
262k$1.00$4.00
deepinfra deepinfra/​Qwen/​Qwen3-​Coder-​480B-​A35B-​Instruct-​Turbo
qwen3-​coder-​480b-​a35b-​instruct-​turbo
262k$0.29$1.20
bedrock_converse qwen.​qwen3-​coder-​480b-​a35b-​v1:0
qwen3-​coder-​480b-​a35b-​v1.0
262k$0.22$1.80
openrouter qwen/​qwen3-​coder-​flash
qwen3-​coder-​flash
128k$0.30$1.50
openrouter qwen/​qwen3-​coder-​plus
qwen3-​coder-​plus
128k$1.00$5.00
openrouter qwen/​qwen3-​max
qwen3-​max
256k$1.20$6.00
openrouter, deepinfra qwen3-next-80b-a3b-instruct (2 endpoints)262k$0.10$0.14$0.80$1.40
vertex_ai-qwen_modelsvertex_ai/​qwen/​qwen3-​next-​80b-​a3b-​instruct-​maasℹ️
qwen3-​next-​80b-​a3b-​instruct-​maas
262k$0.15$1.20
openrouter, deepinfra qwen3-next-80b-a3b-thinking (2 endpoints)262k$0.14$1.20$1.40
vertex_ai-qwen_modelsvertex_ai/​qwen/​qwen3-​next-​80b-​a3b-​thinking-​maasℹ️
qwen3-​next-​80b-​a3b-​thinking-​maas
262k$0.15$1.20
openrouter qwen/​qwen3-​vl-​235b-​a22b-​instruct
qwen3-​vl-​235b-​a22b-​instruct
131k$0.30$1.50
openrouter qwen/​qwen3-​vl-​235b-​a22b-​thinking
qwen3-​vl-​235b-​a22b-​thinking
65k$0.50$3.50
dashscopedashscope/​qwen-​coderℹ️
qwen-​coder
1000k$0.30$1.50
openrouter, dashscope qwen-max (2 endpoints)30k32k$1.60$6.40
openrouter, dashscope qwen-plus (6 endpoints)129k1000k$0.40$1.20
openrouter qwen/​qwen-​plus-​2025-​07-​28:thinking
qwen-​plus-​2025-​07-​28:thinking
1000k$0.40$4.00
openrouter, dashscope qwen-turbo (5 endpoints)129k1000k$0.05$0.20
openrouter qwen/​qwen-​vl-​max
qwen-​vl-​max
7k$0.80$3.20
openrouter qwen/​qwen-​vl-​plus
qwen-​vl-​plus
7k$0.21$0.63
deepinfra, hyperbolic, openrouter, sambanova, nscale qwq-32b (5 endpoints)16k131k$0.15$0.50$0.20$1.0020.9%
openrouter qwq-32b-arliai-rpr-v1 (2 endpoints)32k$0.00$0.03$0.00$0.11
dashscopedashscope/​qwq-​plusℹ️
qwq-​plus
98k$0.80$2.40
openrouter perplexity/​r1-​1776
r1-​1776
128k$2.00$8.00
openrouter relace/​relace-​apply-​3
relace-​apply-​3
256k$0.85$1.25
openrouter undi95/​remm-​slerp-​l2-​13b
remm-​slerp-​l2-​13b
6k$0.45$0.65
openrouter thedrummer/​rocinante-​12b
rocinante-​12b
32k$0.17$0.43
openrouter switchpoint/​router
router
131k$0.85$3.40
openrouter shisa-v2-llama3.3-70b (2 endpoints)32k$0.00$0.05$0.00$0.22
openrouter thedrummer/​skyfall-​36b-​v2
skyfall-​36b-​v2
32k$0.08$0.33
perplexity, openrouter, vercel_ai_gateway sonar (3 endpoints)127k128k$1.00$1.00
perplexity, openrouter sonar-deep-research (2 endpoints)128k$2.00$8.00
perplexity perplexity/​sonar-​medium-​chat
sonar-​medium-​chat
16k$0.60$1.80
perplexity perplexity/​sonar-​medium-​online
sonar-​medium-​online
12k$0.00$1.80
perplexity, vercel_ai_gateway, openrouter sonar-pro (3 endpoints)200k$3.00$15.00
perplexity, vercel_ai_gateway, openrouter sonar-reasoning (3 endpoints)127k128k$1.00$5.00
perplexity, openrouter, vercel_ai_gateway sonar-reasoning-pro (3 endpoints)127k128k$2.00$8.00
perplexity perplexity/​sonar-​small-​chat
sonar-​small-​chat
16k$0.07$0.28
perplexity perplexity/​sonar-​small-​online
sonar-​small-​online
12k$0.00$0.28
openrouter raifle/​sorcererlm-​8x22b
sorcererlm-​8x22b
16k$4.50$4.50
openrouter arcee-​ai/​spotlight
spotlight
131k$0.18$0.18
openrouter stepfun-​ai/​step3
step3
65k$0.57$1.42
vercel_ai_gatewayvercel_ai_gateway/​amazon/​titan-​embed-​text-​v2
titan-​embed-​text-​v2
0$0.02$0.00
bedrock titan-text-express-v1 (3 endpoints)42k$1.30$1.70
bedrock titan-text-lite-v1 (3 endpoints)42k$0.30$0.40
bedrock titan-text-premier-v1.0 (3 endpoints)42k$0.50$1.50
together_ai together-​ai-​21.1b-​41b$0.80$0.80
together_ai together-​ai-​41.1b-​80b$0.90$0.90
together_ai together-​ai-​4.1b-​8b$0.20$0.20
together_ai together-​ai-​81.1b-​110b$1.80$1.80
together_ai together-​ai-​8.1b-​21b$0.30$0.30
together_ai together-​ai-​up-​to-​4b$0.10$0.10
openrouter tongyi-deepresearch-30b-a3b (2 endpoints)131k$0.00$0.09$0.00$0.40
openrouter bytedance/​ui-​tars-​1.5-​7b
ui-​tars-​1.5-​7b
128k$0.10$0.20
openrouter thedrummer/​unslopnemo-​12b
unslopnemo-​12b
32k$0.40$0.40
v0, vercel_ai_gateway v0-1.0-md (2 endpoints)128k$3.00$15.00
v0v0/​v0-​1.5-​lg
v0-​1.5-​lg
512k$15.00$75.00
v0, vercel_ai_gateway v0-1.5-md (2 endpoints)128k$3.00$15.00
bedrock_converse deepseek.​v3-​v1:0
v3-​v1.0
163k$0.58$1.68
openrouter arcee-​ai/​virtuoso-​large
virtuoso-​large
131k$0.75$1.20
openrouter mancer/​weaver
weaver
8k$1.125$1.125
deepinfra, openrouter wizardlm-2-8x22b (2 endpoints)65k$0.48$0.48
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​yi-​largeℹ️
yi-​large
32k$3.00$3.00
anyscale anyscale/​HuggingFaceH4/​zephyr-​7b-​beta
zephyr-​7b-​beta
16k$0.15$0.15