LLM Model Prices (per million tokens)

Thanks to the LiteLLM project, the model provider websites and APIs and random sources online for the source data. Most benchmark data is from the Epoch AI benchmarking dashboard, provided under a Creative Commons license by Epoch AI, but some data is from other sources. The Aider Polyglot data is from the Aider website. Always check the original source for the most up-to-date information as there may be errors in the data (some sources are a bit rough) and benchmark score matching is a bit fuzzy.

$
Showing 1179 models (536 rows displayed)
Provider ⬍Model ⬍Max Input Tokens ⬍Input Token Price ⬍Output Token Price ⬍GPQA (Diamond) ⬍MATH (Level 5) ⬍OTIS Mock AIME 24-25 ⬍Aider Polyglot ⬍
openai, openrouter chatgpt-4o-latest (2 endpoints)128k$5.00$15.0045.3%
vertex_ai-anthropic_models, openrouter, anthropic, bedrock claude-3-5-sonnet (14 endpoints)200k$3.00$3.60$15.00$18.0051.6%
openrouter, anthropic, vertex_ai-anthropic_models, bedrock_converse, bedrock claude-3.7-sonnet (8 endpoints)200k$3.00$15.0064.9%
vertex_ai-anthropic_models, openrouter, anthropic, bedrock_converse claude-opus-4 (7 endpoints)200k$15.00$75.0072%
vertex_ai-anthropic_models, openrouter, anthropic, bedrock_converse claude-sonnet-4 (8 endpoints)200k$3.00$15.0061.3%
openrouter, vertex_ai-mistral_models, codestral, mistral codestral-latest (8 endpoints)32k262k$0.00$1.00$0.00$3.0011.1%
lambda, openrouter, fireworks_ai, lambda_ai, hyperbolic, azure_ai, bedrock_converse, together_ai, deepseek, sambanova deepseek-r1 (16 endpoints)32k164k$0.00$5.00$0.00$8.0071.4%
openrouter, fireworks_ai, lambda_ai, azure_ai, deepseek, together_ai, hyperbolic, sambanova deepseek-v3 (14 endpoints)32k163k$0.00$3.00$0.00$4.5655.1%
vertex_ai-language-models, gemini gemini-2.0-flash (2 endpoints)1048k$0.10$0.40
vertex_ai-language-models, gemini gemini-2.0-flash-lite (2 endpoints)1048k$0.075$0.30
gemini, vertex_ai-language-models gemini-2.5-flash-preview-04-17 (4 endpoints)1048k$0.15$0.30$0.60$2.5055.1%
vertex_ai-language-models, gemini, openrouter gemini-2.5-pro-exp-03-25 (4 endpoints)1048k$0.00$1.25
>200k: $0.00$2.50
$0.00$10.00
>200k: $0.00$15.00
openrouter, vertex_ai-language-models, gemini gemini-2.5-pro-preview (7 endpoints)1048k$1.25
>200k: $2.50
$10.00
>200k: $15.00
83.1%
openai, azure, openrouter gpt-4.1 (5 endpoints)1047k$2.00$8.0052.4%
azure azure/​gpt-​4.5-​preview
gpt-​4.5-​preview
128k$75.00$150.0044.9%
openai, azure, openrouter gpt-4o (20 endpoints)128k$2.50$5.00$10.00$15.0023.1%
openai, azure, openrouter gpt-4o-mini (9 endpoints)128k$0.15$0.165$0.60$0.663.6%
xai, openrouter grok-3-beta (3 endpoints)131k$3.00$15.0053.3%
xai, openrouter grok-3-mini-beta (3 endpoints)131k$0.30$0.5049.3%
xai, openrouter, oci grok-4 (5 endpoints)128k256k$3.00$0.15$15.0079.6%
openrouter kimi-k2 (2 endpoints)32k63k$0.00$0.14$0.00$2.4959.1%
openrouter, lambda llama-4-maverick (2 endpoints)1000k1048k$0.15$0.20$0.6015.6%
openrouter, lambda llama-4-scout (2 endpoints)1000k1048k$0.08$0.10$0.30
azure_ai, vertex_ai-mistral_models, openrouter, mistral, bedrock mistral-small (7 endpoints)32k128k$0.10$1.00$0.30$3.00
openai, azure, openrouter o1 (7 endpoints)200k$15.00$16.50$60.00$66.0061.7%
openai, openrouter, azure o1-mini (8 endpoints)128k$1.10$3.00$4.40$12.0032.9%
openrouter openai/​o1-​pro
o1-​pro
200k$150.00$600.00
openai, azure, openrouter o3 (5 endpoints)200k$2.00$10.00$8.00$40.0081.3%
openai, azure, openrouter o3-mini (8 endpoints)200k$1.10$1.21$4.40$4.8460.4%
openrouter openai/​o3-​pro
o3-​pro
200k$20.00$80.0084.9%
openai, azure, openrouter o4-mini (6 endpoints)200k$1.10$4.4072%
openrouter aion-​labs/​aion-​1.0
aion-​1.0
131k$4.00$8.00
openrouter aion-​labs/​aion-​1.0-​mini
aion-​1.0-​mini
131k$0.70$1.40
openrouter aion-​labs/​aion-​rp-​llama-​3.1-​8b
aion-​rp-​llama-​3.1-​8b
32k$0.20$0.20
deepinfra deepinfra/​deepinfra/​airoboros-​70b
airoboros-​70b
4k$0.70$0.90
deepinfra deepinfra/​jondurbin/​airoboros-​l2-​70b-​gpt4-​1.4.​1
airoboros-​l2-​70b-​gpt4-​1.4.​1
4k$0.70$0.90
gradient_aigradient_ai/​anthropic-​claude-​3.5-​haiku
anthropic-​claude-​3.5-​haiku
$0.80$4.00
gradient_aigradient_ai/​anthropic-​claude-​3.5-​sonnet
anthropic-​claude-​3.5-​sonnet
$3.00$15.00
gradient_aigradient_ai/​anthropic-​claude-​3.7-​sonnet
anthropic-​claude-​3.7-​sonnet
$3.00$15.00
gradient_aigradient_ai/​anthropic-​claude-​3-​opus
anthropic-​claude-​3-​opus
$15.00$75.00
openrouter thedrummer/​anubis-​70b-​v1.1
anubis-​70b-​v1.1
16k$0.40$0.70
openrouter thedrummer/​anubis-​pro-​105b-​v1
anubis-​pro-​105b-​v1
131k$0.50$1.00
vertex_ai-chat-models, palm chat-bison (2 endpoints)8k$0.125$0.125
vertex_ai-chat-models, palm chat-bison@001 (2 endpoints)8k$0.125$0.125
vertex_ai-chat-models chat-​bison@002ℹ️8k$0.125$0.125
vertex_ai-chat-models chat-​bison-​32kℹ️32k$0.125$0.125
vertex_ai-chat-models chat-​bison-​32k@002ℹ️32k$0.125$0.125
nlp_cloud chatdolphin16k$0.50$0.50
bedrock, openrouter, anthropic, vertex_ai-anthropic_models claude-3.5-haiku (9 endpoints)200k$0.25$1.00$1.25$5.0028%
bedrock claude-3-5-sonnet-20241022-v2.0 (4 endpoints)200k$3.00$15.00
vertex_ai-anthropic_models claude-3-5-sonnet-v2 (2 endpoints)200k$3.00$15.00
vertex_ai-anthropic_models, openrouter, anthropic, bedrock claude-3-haiku (10 endpoints)200k$0.25$0.30$1.25$1.50
vertex_ai-anthropic_models, openrouter, anthropic, bedrock claude-3-opus (8 endpoints)200k$15.00$75.00
vertex_ai-anthropic_models, bedrock claude-3-sonnet (6 endpoints)200k$3.00$15.00
bedrock claude-instant-v1 (5 endpoints)100k$0.80$2.48$2.40$8.38
vertex_ai-anthropic_models, openrouter, anthropic, bedrock_converse claude-opus-4-1 (7 endpoints)200k$15.00$75.00
bedrock claude-v1 (5 endpoints)100k$8.00$24.00
bedrock claude-v2 (5 endpoints)100k$8.00$24.00
bedrock claude-v2.1 (5 endpoints)100k$8.00$24.00
vertex_ai-code-text-models code-​bisonℹ️6k$0.125$0.125
vertex_ai-code-chat-models codechat-​bisonℹ️6k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison@001ℹ️6k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison@002ℹ️6k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison-​32kℹ️32k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison-​32k@002ℹ️32k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison@latestℹ️6k$0.125$0.125
perplexity, deepinfra, anyscale codellama-34b-instruct (3 endpoints)4k16k$0.35$1.00$0.60$1.40
perplexity, anyscale codellama-70b-instruct (2 endpoints)4k16k$0.70$1.00$1.00$2.80
cloudflare cloudflare/​@hf/​thebloke/​codellama-​7b-​instruct-​awq
codellama-​7b-​instruct-​awq
4k$1.923$1.923
openrouter alfredpros/​codellama-​7b-​instruct-​solidity
codellama-​7b-​instruct-​solidity
8k$0.70$1.10
openrouter arcee-​ai/​coder-​large
coder-​large
32k$0.50$0.80
vertex_ai-mistral_models vertex_ai/​codestral@latest
codestral@latest
128k$0.20$0.60
mistral mistral/​codestral-​mamba-​latestℹ️
codestral-​mamba-​latest
256k$0.25$0.25
openrouter openai/​codex-​mini
codex-​mini
200k$1.50$6.00
openrouter cohere/​command
command
4k$1.00$2.00
cohere_chat, openrouter command-a (2 endpoints)32k256k$2.00$2.50$8.00$10.00
cohere_chat command-​light4k$0.30$0.60
bedrock cohere.​command-​light-​text-​v14
command-​light-​text-​v14
4k$0.30$0.60
cohere_chat, openrouter, bedrock command-r (6 endpoints)128k$0.15$0.50$0.60$1.50
openrouter, cohere_chat command-r7b-12-2024 (2 endpoints)128k$0.0375$0.15$0.0375$0.15
cohere_chat, openrouter, azure, bedrock command-r-plus (7 endpoints)128k$2.50$3.00$10.00$15.00
bedrock cohere.​command-​text-​v14
command-​text-​v14
4k$1.50$2.00
azure computer-use-preview (2 endpoints)8k$3.00$12.00
databricks databricks/​databricks-​claude-​3-​7-​sonnetℹ️
databricks-​claude-​3-​7-​sonnet
200k$2.50$17.857
databricks databricks/​databricks-​dbrx-​instructℹ️
databricks-​dbrx-​instruct
32k$0.75$2.249
databricks databricks/​databricks-​llama-​2-​70b-​chatℹ️
databricks-​llama-​2-​70b-​chat
4k$0.50$1.50
databricks databricks/​databricks-​llama-​4-​maverickℹ️
databricks-​llama-​4-​maverick
128k$5.00$15.00
databricks databricks/​databricks-​meta-​llama-​3-​1-​405b-​instructℹ️
databricks-​meta-​llama-​3-​1-​405b-​instruct
128k$5.00$15.00
databricks databricks/​databricks-​meta-​llama-​3-​1-​70b-​instructℹ️
databricks-​meta-​llama-​3-​1-​70b-​instruct
128k$1.00$3.00
databricks databricks/​databricks-​meta-​llama-​3-​3-​70b-​instructℹ️
databricks-​meta-​llama-​3-​3-​70b-​instruct
128k$1.00$3.00
databricks databricks/​databricks-​meta-​llama-​3-​70b-​instructℹ️
databricks-​meta-​llama-​3-​70b-​instruct
128k$1.00$3.00
databricks databricks/​databricks-​mixtral-​8x7b-​instructℹ️
databricks-​mixtral-​8x7b-​instruct
4k$0.50$0.999
databricks databricks/​databricks-​mpt-​30b-​instructℹ️
databricks-​mpt-​30b-​instruct
8k$0.999$0.999
databricks databricks/​databricks-​mpt-​7b-​instructℹ️
databricks-​mpt-​7b-​instruct
8k$0.50$0.00
openrouter deepcoder-14b-preview (2 endpoints)96k$0.00$0.015$0.00$0.015
openrouter nousresearch/​deephermes-​3-​llama-​3-​8b-​preview:free
deephermes-​3-​llama-​3-​8b-​preview
131k$0.00$0.00
openrouter nousresearch/​deephermes-​3-​mistral-​24b-​preview
deephermes-​3-​mistral-​24b-​preview
32k$0.0933$0.3734
openrouter deepseek/​deepseek-​chat-​v3.1
deepseek-​chat-​v3.1
163k$0.20$0.80
deepseek deepseek/​deepseek-​coder
deepseek-​coder
128k$0.14$0.28
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​deepseek-​coder-​v2-​instructℹ️
deepseek-​coder-​v2-​instruct
65k$1.20$1.20
lambda_ai, lambda deepseek-llama3.3-70b (2 endpoints)131k131k$0.20$0.60
openrouter deepseek/​deepseek-​prover-​v2
deepseek-​prover-​v2
163k$0.50$2.18
vertex_ai-deepseek_modelsvertex_ai/​deepseek-​ai/​deepseek-​r1-​0528-​maasℹ️
deepseek-​r1-​0528-​maas
65k$1.35$5.40
openrouter deepseek-r1-0528-qwen3-8b (2 endpoints)32k131k$0.00$0.01$0.00$0.02
together_ai together_ai/​deepseek-​ai/​DeepSeek-​R1-​0528-​tputℹ️
deepseek-​r1-​0528-​tput
128k$0.55$2.19
lambda_ailambda_ai/​deepseek-​r1-​671b
deepseek-​r1-​671b
131k$0.80$0.80
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​deepseek-​r1-​basicℹ️
deepseek-​r1-​basic
128k$0.55$2.19
openrouter, sambanova, groq, nscale, gradient_ai deepseek-r1-distill-llama-70b (6 endpoints)8k131k$0.00$0.99$0.00$1.40
openrouter, nscale deepseek-r1-distill-llama-8b (2 endpoints)32k$0.025$0.04$0.025$0.04
openrouter, nscale deepseek-r1-distill-qwen-14b (3 endpoints)64k$0.00$0.15$0.00$0.15
openrouter, nscale deepseek-r1-distill-qwen-1.5b (2 endpoints)131k$0.09$0.18$0.09$0.18
openrouter, nscale deepseek-r1-distill-qwen-32b (2 endpoints)131k$0.075$0.15$0.15
nscalenscale/​deepseek-​ai/​DeepSeek-​R1-​Distill-​Qwen-​7Bℹ️
deepseek-​r1-​distill-​qwen-​7b
$0.20$0.20
openrouter tngtech/​deepseek-​r1t2-​chimera:free
deepseek-​r1t2-​chimera
163k$0.00$0.00
openrouter deepseek-r1t-chimera (2 endpoints)163k$0.00$0.1799$0.00$0.7201
openrouter deepseek/​deepseek-​v3.1-​base
deepseek-​v3.1-​base
163k$0.20$0.80
openrouter deepseek/​deepseek-​v3-​base
deepseek-​v3-​base
163k$0.1999$0.8001
openrouter, mistral devstral-medium (2 endpoints)128k131k$0.40$2.00
openrouter, mistral devstral-small (5 endpoints)32k131k$0.00$0.10$0.00$0.30
deepinfra deepinfra/​cognitivecomputations/​dolphin-​2.6-​mixtral-​8x7b
dolphin-​2.6-​mixtral-​8x7b
32k$0.27$0.27
openrouter dolphin3.0-mistral-24b (2 endpoints)32k$0.00$0.037$0.00$0.1482
openrouter dolphin3.0-r1-mistral-24b (2 endpoints)32k$0.00$0.01$0.00$0.0341
openrouter cognitivecomputations/​dolphin-​mistral-​24b-​venice-​edition:free
dolphin-​mistral-​24b-​venice-​edition
32k$0.00$0.00
openrouter cognitivecomputations/​dolphin-​mixtral-​8x22b
dolphin-​mixtral-​8x22b
16k$0.90$0.90
openrouter baidu/​ernie-​4.5-​21b-​a3b
ernie-​4.5-​21b-​a3b
120k$0.07$0.28
openrouter baidu/​ernie-​4.5-​300b-​a47b
ernie-​4.5-​300b-​a47b
123k$0.28$1.10
openrouter baidu/​ernie-​4.5-​vl-​28b-​a3b
ernie-​4.5-​vl-​28b-​a3b
30k$0.14$0.56
openrouter baidu/​ernie-​4.5-​vl-​424b-​a47b
ernie-​4.5-​vl-​424b-​a47b
123k$0.42$1.25
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​firefunction-​v2ℹ️
firefunction-​v2
8k$0.90$0.90
vertex_ai-language-models gemini-​1.0-​proℹ️32k$0.50$1.50
vertex_ai-language-models gemini-​1.0-​pro-​001ℹ️32k$0.50$1.50
vertex_ai-language-models gemini-​1.0-​pro-​002ℹ️32k$0.50$1.50
vertex_ai-vision-models gemini-​1.0-​pro-​visionℹ️16k$0.50$1.50
vertex_ai-vision-models gemini-​1.0-​pro-​vision-​001ℹ️16k$0.50$1.50
vertex_ai-language-models gemini-​1.0-​ultraℹ️8k$0.50$1.50
vertex_ai-language-models gemini-​1.0-​ultra-​001ℹ️8k$0.50$1.50
gemini, vertex_ai-language-models gemini-1.5-flash (3 endpoints)1000k1048k$0.075
>128k: $0.15$1.00
$0.30
>128k: $0.60
gemini, vertex_ai-language-models gemini-1.5-flash-001 (2 endpoints)1000k1048k$0.075
>128k: $0.15$1.00
$0.30
>128k: $0.60
vertex_ai-language-models, gemini gemini-1.5-flash-002 (2 endpoints)1048k$0.075
>128k: $0.15$1.00
$0.30
>128k: $0.60
gemini gemini/​gemini-​1.5-​flash-​8bℹ️
gemini-​1.5-​flash-​8b
1048k$0.00
>128k: $0.00
$0.00
>128k: $0.00
gemini, vertex_ai-language-models gemini-1.5-flash-exp-0827 (2 endpoints)1000k1048k$0.00$0.0047
>128k: $0.00$1.00
$0.00$0.0047
>128k: $0.00$0.0094
vertex_ai-language-models gemini-​1.5-​flash-​preview-​0514ℹ️1000k$0.075
>128k: $1.00
$0.0047
>128k: $0.0094
vertex_ai-language-models, gemini gemini-1.5-pro (3 endpoints)1048k2097k$1.25$3.50
>128k: $2.50$7.00
$1.05$10.50
>128k: $10.00$21.00
gemini, vertex_ai-language-models gemini-1.5-pro-001 (2 endpoints)1000k2097k$1.25$3.50
>128k: $2.50$7.00
$5.00$10.50
>128k: $10.00$21.00
vertex_ai-language-models, gemini gemini-1.5-pro-002 (2 endpoints)2097k$1.25$3.50
>128k: $2.50$7.00
$5.00$10.50
>128k: $10.00$21.00
vertex_ai-language-models gemini-1.5-pro-preview-0215 (3 endpoints)1000k$0.0781
>128k: $0.1563
$0.3125
>128k: $0.625
gemini, openrouter, vertex_ai-language-models gemini-2.0-flash-001 (3 endpoints)1048k$0.10$0.15$0.40$0.60
openrouter google/​gemini-​2.0-​flash-​exp:free
gemini-​2.0-​flash-​exp
1048k$0.00$0.0022.2%
vertex_ai-language-models, openrouter gemini-2.0-flash-lite-001 (2 endpoints)1048k$0.075$0.30
gemini gemini/​gemini-​2.0-​flash-​lite-​preview-​02-​05ℹ️
gemini-​2.0-​flash-​lite-​preview-​02-​05
1048k$0.075$0.30
gemini gemini/​gemini-​2.0-​flash-​live-​001ℹ️
gemini-​2.0-​flash-​live-​001
1048k$0.35$1.50
vertex_ai-language-models gemini-​2.0-​flash-​live-​preview-​04-​09ℹ️1048k$0.50$2.00
vertex_ai-language-models, gemini gemini-2.0-flash-preview-image-generation (2 endpoints)1048k$0.10$0.40
vertex_ai-language-models, gemini gemini-2.0-flash-thinking-exp (4 endpoints)1048k$0.00
>128k: $0.00
$0.00
>128k: $0.00
18.2%
gemini, vertex_ai-language-models, openrouter gemini-2.5-flash (3 endpoints)1048k$0.30$2.50
gemini, vertex_ai-language-models, openrouter gemini-2.5-flash-lite (3 endpoints)1048k$0.10$0.40
gemini, vertex_ai-language-models, openrouter gemini-2.5-flash-lite-preview-06-17 (3 endpoints)1048k$0.10$0.40
gemini gemini/​gemini-​2.5-​flash-​preview-​ttsℹ️
gemini-​2.5-​flash-​preview-​tts
1048k$0.15$0.60
vertex_ai-language-models, gemini, openrouter gemini-2.5-pro (3 endpoints)1048k$1.25
>200k: $2.50
$10.00
>200k: $15.00
vertex_ai-language-models, gemini gemini-2.5-pro-preview-tts (2 endpoints)1048k$1.25
>200k: $2.50
$10.00
>200k: $15.00
openrouter google/​gemini-​flash-​1.5
gemini-​flash-​1.5
1000k$0.075$0.30
openrouter google/​gemini-​flash-​1.5-​8b
gemini-​flash-​1.5-​8b
1000k$0.0375$0.15
vertex_ai-language-models gemini-​flash-​experimentalℹ️1000k$0.00$0.00
gemini gemini/​gemini-​gemma-​2-​27b-​itℹ️
gemini-​gemma-​2-​27b-​it
$0.35$1.05
gemini gemini/​gemini-​gemma-​2-​9b-​itℹ️
gemini-​gemma-​2-​9b-​it
$0.35$1.05
gemini, vertex_ai-language-models gemini-pro (2 endpoints)32k$0.35$0.50
>128k: $0.70
$1.05$1.50
>128k: $2.10
openrouter google/​gemini-​pro-​1.5
gemini-​pro-​1.5
2000k$1.25$5.00
vertex_ai-language-models gemini-​pro-​experimentalℹ️1000k$0.00$0.00
gemini, vertex_ai-vision-models gemini-pro-vision (2 endpoints)16k30k$0.35$0.50
>128k: $0.70
$1.05$1.50
>128k: $2.10
openrouter google/​gemma-​2-​27b-​it
gemma-​2-​27b-​it
8k$0.65$0.65
openrouter, groq gemma-2-9b-it (3 endpoints)8k$0.00$0.20$0.00$0.20
openrouter gemma-3-12b-it (2 endpoints)32k96k$0.00$0.0481$0.00$0.1926
gemini, openrouter gemma-3-27b-it (3 endpoints)96k131k$0.00$0.0666
>128k: $0.00
$0.00$0.2667
>128k: $0.00
4.9%
openrouter gemma-3-4b-it (2 endpoints)32k131k$0.00$0.02$0.00$0.04
openrouter google/​gemma-​3n-​e2b-​it:free
gemma-​3n-​e2b-​it
8k$0.00$0.00
openrouter gemma-3n-e4b-it (2 endpoints)8k32k$0.00$0.02$0.00$0.04
groq, anyscale gemma-7b-it (2 endpoints)8k$0.07$0.15$0.07$0.15
openrouter thudm/​glm-​4.1v-​9b-​thinking
glm-​4.1v-​9b-​thinking
65k$0.035$0.138
openrouter glm-4-32b (2 endpoints)32k128k$0.10$0.24$0.10$0.24
openrouter z-​ai/​glm-​4.5
glm-​4.5
131k$0.1999$0.8001
openrouter glm-4.5-air (2 endpoints)131k$0.00$0.20$0.00$1.10
together_ai together_ai/​zai-​org/​GLM-​4.5-​Air-​FP8ℹ️
glm-​4.5-​air-​fp8
128k$0.20$1.10
openrouter z-​ai/​glm-​4.5v
glm-​4.5v
65k$0.50$1.70
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​glm-​4p5ℹ️
glm-​4p5
128k$0.55$2.19
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​glm-​4p5-​airℹ️
glm-​4p5-​air
128k$0.22$0.88
openrouter thudm/​glm-​z1-​32b
glm-​z1-​32b
32k$0.02$0.08
openrouter alpindale/​goliath-​120b
goliath-​120b
6k$4.00$5.50
openrouter, openai, azure gpt-3.5-turbo (12 endpoints)4k16k$0.20$1.50$1.50$2.00
openai, azure, openrouter gpt-3.5-turbo-16k (4 endpoints)16k$3.00$4.00
openrouter openai/​gpt-​3.5-​turbo-​instruct
gpt-​3.5-​turbo-​instruct
4k$1.50$2.00
openai, azure, openrouter gpt-4 (6 endpoints)8k8k$30.00$60.00
openai, azure gpt-4-0125-preview (2 endpoints)128k$10.00$30.00
openai, azure, openrouter gpt-4-1106-preview (3 endpoints)128k$10.00$30.00
openai, azure, openrouter gpt-4.1-mini (5 endpoints)1047k$0.40$1.6032.4%
openai, azure, openrouter gpt-4.1-nano (5 endpoints)1047k$0.10$0.408.9%
azure gpt-4-32k (2 endpoints)32k$60.00$120.00
openai, openrouter, azure gpt-4o-audio-preview (6 endpoints)128k$2.50$10.00
openrouter openai/​gpt-​4o:extended
gpt-​4o:extended
128k$6.00$18.00
openai, azure gpt-4o-mini-audio-preview (3 endpoints)128k$0.15$2.50$0.60$10.00
openai, azure gpt-4o-mini-realtime-preview (5 endpoints)128k$0.60$0.66$2.40$2.64
openai, openrouter gpt-4o-mini-search-preview (3 endpoints)128k$0.15$0.60
openai, azure gpt-4o-realtime-preview (10 endpoints)128k$5.00$5.50$20.00$22.00
openai, openrouter gpt-4o-search-preview (3 endpoints)128k$2.50$10.00
openai, azure, openrouter gpt-4-turbo (5 endpoints)128k$10.00$30.00
openai, openrouter gpt-4-turbo-preview (2 endpoints)128k$10.00$30.00
azure azure/​gpt-​4-​turbo-​vision-​preview
gpt-​4-​turbo-​vision-​preview
128k$10.00$30.00
openai, openrouter, azure gpt-5 (5 endpoints)272k400k$1.25$10.00
openrouter, openai, azure gpt-5-chat (4 endpoints)272k400k$1.25$10.00
openai, openrouter, azure gpt-5-mini (5 endpoints)272k400k$0.25$2.00
openai, openrouter, azure gpt-5-nano (5 endpoints)272k400k$0.05$0.40
fireworks_ai, groq, cerebras, openrouter, together_ai gpt-oss-120b (5 endpoints)128k131k$0.072$0.25$0.28$0.7541.8%
bedrock_converse openai.​gpt-​oss-​120b-​1:0
gpt-​oss-​120b-​1:0
128k$0.15$0.60
openrouter, fireworks_ai, cerebras, groq, together_ai gpt-oss-20b (6 endpoints)128k131k$0.00$0.10$0.00$0.50
bedrock_converse openai.​gpt-​oss-​20b-​1:0
gpt-​oss-​20b-​1:0
128k$0.07$0.30
watsonx watsonx/​ibm/​granite-​3-​8b-​instruct
granite-​3-​8b-​instruct
8k$200.00$200.00
xai, openrouter grok-2 (4 endpoints)131k$2.00$10.00
xai, openrouter grok-2-vision (4 endpoints)32k$2.00$10.00
oci, azure_ai, xai, openrouter grok-3 (6 endpoints)131k$3.00$3.30$0.15$16.50
oci, xai grok-3-fast (2 endpoints)131k$5.00$25.00
xai xai/​grok-​3-​fast-​betaℹ️
grok-​3-​fast-​beta
131k$5.00$25.00
azure_ai, xai, oci, openrouter grok-3-mini (6 endpoints)131k$0.25$0.30$0.50$1.38
xai, oci grok-3-mini-fast (3 endpoints)131k$0.60$4.00
xai xai/​grok-​3-​mini-​fast-​betaℹ️
grok-​3-​mini-​fast-​beta
131k$0.60$4.00
xai xai/​grok-​beta
grok-​beta
131k$5.00$15.00
xai, openrouter grok-vision-beta (2 endpoints)8k$5.00$15.00
openrouter nousresearch/​hermes-​2-​pro-​llama-​3-​8b
hermes-​2-​pro-​llama-​3-​8b
131k$0.025$0.04
lambda_ai, lambda hermes3-405b (2 endpoints)131k131k$0.80$0.80
lambda_ai, lambda hermes3-70b (2 endpoints)131k131k$0.12$0.30
lambda_ai, lambda hermes3-8b (2 endpoints)131k131k$0.025$0.04
openrouter nousresearch/​hermes-​3-​llama-​3.1-​405b
hermes-​3-​llama-​3.1-​405b
131k$0.70$0.80
openrouter, hyperbolic hermes-3-llama-3.1-70b (2 endpoints)32k131k$0.10$0.12$0.28$0.30
openrouter hunyuan-a13b-instruct (2 endpoints)32k$0.00$0.03$0.00$0.03
openrouter inflection/​inflection-​3-​pi
inflection-​3-​pi
8k$2.50$10.00
openrouter inflection/​inflection-​3-​productivity
inflection-​3-​productivity
8k$2.50$10.00
openrouter opengvlab/​internvl3-​14b
internvl3-​14b
12k$0.20$0.40
bedrock ai21.​j2-​mid-​v1
j2-​mid-​v1
8k$12.50$12.50
bedrock ai21.​j2-​ultra-​v1
j2-​ultra-​v1
8k$18.80$18.80
azure_ai azure_ai/​jais-​30b-​chatℹ️
jais-​30b-​chat
8k$3,200.00$9,710.00
vertex_ai-ai21_models, ai21 jamba-1.5 (2 endpoints)256k$0.20$0.40
vertex_ai-ai21_models, ai21, bedrock jamba-1.5-large (3 endpoints)256k$2.00$8.00
vertex_ai-ai21_models, ai21 jamba-1.5-large@001 (2 endpoints)256k$2.00$8.00
vertex_ai-ai21_models, ai21, bedrock jamba-1.5-mini (3 endpoints)256k$0.20$0.40
vertex_ai-ai21_models, ai21 jamba-1.5-mini@001 (2 endpoints)256k$0.20$0.40
azure_ai, bedrock jamba-instruct (2 endpoints)70k$0.50$0.70
ai21 jamba-​large-​1.6256k$2.00$8.00
ai21, openrouter jamba-large-1.7 (2 endpoints)256k$2.00$8.00
ai21 jamba-​mini-​1.6256k$0.20$0.40
ai21, openrouter jamba-mini-1.7 (2 endpoints)256k$0.20$0.40
openrouter moonshotai/​kimi-​dev-​72b:free
kimi-​dev-​72b
131k$0.00$0.00
moonshotmoonshot/​kimi-​k2-​0711-​previewℹ️
kimi-​k2-​0711-​preview
131k$0.60$2.50
fireworks_ai, groq, hyperbolic, together_ai kimi-k2-instruct (4 endpoints)131k$0.60$2.00$2.00$3.00
moonshotmoonshot/​kimi-​latestℹ️
kimi-​latest
131k$2.00$5.00
moonshotmoonshot/​kimi-​latest-​128kℹ️
kimi-​latest-​128k
131k$2.00$5.00
moonshotmoonshot/​kimi-​latest-​32kℹ️
kimi-​latest-​32k
32k$1.00$3.00
moonshotmoonshot/​kimi-​latest-​8kℹ️
kimi-​latest-​8k
8k$0.20$2.00
moonshotmoonshot/​kimi-​thinking-​previewℹ️
kimi-​thinking-​preview
131k$30.00$30.00
openrouter kimi-vl-a3b-thinking (2 endpoints)131k$0.00$0.025$0.00$0.10
openrouter sao10k/​l3.1-​euryale-​70b
l3.1-​euryale-​70b
32k$0.65$0.75
openrouter sao10k/​l3.3-​euryale-​70b
l3.3-​euryale-​70b
131k$0.65$0.75
openrouter sao10k/​l3-​euryale-​70b
l3-​euryale-​70b
8k$1.48$1.48
openrouter sao10k/​l3-​lunaris-​8b
l3-​lunaris-​8b
8k$0.02$0.05
gemini gemini/​learnlm-​1.5-​pro-​experimentalℹ️
learnlm-​1.5-​pro-​experimental
32k$0.00
>128k: $0.00
$0.00
>128k: $0.00
openrouter liquid/​lfm-​3b
lfm-​3b
32k$0.02$0.02
lambda_ai, lambda lfm-40b (2 endpoints)66k131k$0.10$0.15$0.15$0.20
lambda_ai, openrouter lfm-7b (2 endpoints)32k131k$0.01$0.025$0.01$0.04
replicate replicate/​meta/​llama-​2-​13b
llama-​2-​13b
4k$0.10$0.50
replicate, deepinfra, anyscale llama-2-13b-chat (3 endpoints)4k$0.10$0.25$0.22$0.50
bedrock meta.​llama2-​13b-​chat-​v1
llama2-​13b-​chat-​v1
4k$0.75$1.00
replicate, groq llama-2-70b (2 endpoints)4k$0.65$0.70$0.80$2.75
replicate, deepinfra, perplexity, anyscale llama-2-70b-chat (4 endpoints)4k$0.65$1.00$0.90$2.80
bedrock meta.​llama2-​70b-​chat-​v1
llama2-​70b-​chat-​v1
4k$1.95$2.56
replicate replicate/​meta/​llama-​2-​7b
llama-​2-​7b
4k$0.05$0.25
replicate, deepinfra, anyscale llama-2-7b-chat (3 endpoints)4k$0.05$0.15$0.13$0.25
cloudflare cloudflare/​@cf/​meta/​llama-​2-​7b-​chat-​fp16
llama-​2-​7b-​chat-​fp16
3k$1.923$1.923
cloudflare cloudflare/​@cf/​meta/​llama-​2-​7b-​chat-​int8
llama-​2-​7b-​chat-​int8
2k$1.923$1.923
openrouter meta-​llama/​llama-​3.1-​405b
llama-​3.1-​405b
32k$2.00$2.00
bedrock, oci, openrouter llama-3.1-405b-instruct (5 endpoints)32k128k$0.00$10.68$0.00$16.00
lambda_ai, lambda llama3.1-405b-instruct-fp8 (2 endpoints)131k131k$0.80$0.80
vertex_ai-llama_models vertex_ai/​meta/​llama-​3.1-​405b-​instruct-​maasℹ️
llama-​3.1-​405b-​instruct-​maas
128k$5.00$16.00
groq groq/​llama-​3.1-​405b-​reasoning
llama-​3.1-​405b-​reasoning
8k$0.59$0.79
cerebras cerebras/​llama3.1-​70b
llama3.1-​70b
128k$0.60$0.60
openrouter, perplexity, bedrock llama-3.1-70b-instruct (4 endpoints)128k131k$0.10$1.00$0.28$1.00
lambda_ai, lambda llama3.1-70b-instruct-fp8 (2 endpoints)131k131k$0.12$0.30
vertex_ai-llama_models vertex_ai/​meta/​llama-​3.1-​70b-​instruct-​maasℹ️
llama-​3.1-​70b-​instruct-​maas
128k$0.00$0.00
groq groq/​llama-​3.1-​70b-​versatile
llama-​3.1-​70b-​versatile
8k$0.59$0.79
cerebras cerebras/​llama3.1-​8b
llama3.1-​8b
128k$0.10$0.10
groq groq/​llama-​3.1-​8b-​instant
llama-​3.1-​8b-​instant
128k$0.05$0.08
openrouter, lambda_ai, perplexity, lambda, bedrock, nscale llama-3.1-8b-instruct (7 endpoints)128k131k$0.015$0.22$0.02$0.22
vertex_ai-llama_models vertex_ai/​meta/​llama-​3.1-​8b-​instruct-​maasℹ️
llama-​3.1-​8b-​instruct-​maas
128k$0.00$0.00
openrouter neversleep/​llama-​3.1-​lumimaid-​8b
llama-​3.1-​lumimaid-​8b
32k$0.09$0.60
openrouter nvidia/​llama-​3.1-​nemotron-​70b-​instruct
llama-​3.1-​nemotron-​70b-​instruct
131k$0.12$0.30
lambda_ai, lambda llama3.1-nemotron-70b-instruct-fp8 (2 endpoints)131k131k$0.12$0.30
openrouter llama-3.1-nemotron-ultra-253b-v1 (2 endpoints)131k$0.00$0.60$0.00$1.80
perplexity perplexity/​llama-​3.1-​sonar-​huge-​128k-​online
llama-​3.1-​sonar-​huge-​128k-​online
127k$5.00$5.00
perplexity perplexity/​llama-​3.1-​sonar-​large-​128k-​chat
llama-​3.1-​sonar-​large-​128k-​chat
131k$1.00$1.00
perplexity perplexity/​llama-​3.1-​sonar-​large-​128k-​online
llama-​3.1-​sonar-​large-​128k-​online
127k$1.00$1.00
perplexity perplexity/​llama-​3.1-​sonar-​small-​128k-​chat
llama-​3.1-​sonar-​small-​128k-​chat
131k$0.20$0.20
perplexity perplexity/​llama-​3.1-​sonar-​small-​128k-​online
llama-​3.1-​sonar-​small-​128k-​online
127k$0.20$0.20
openrouter scb10x/​llama3.1-​typhoon2-​70b-​instruct
llama3.1-​typhoon2-​70b-​instruct
8k$0.88$0.88
bedrock llama3-2-11b-instruct-v1.0 (2 endpoints)128k$0.35$0.35
groq groq/​llama-​3.2-​11b-​text-​preview
llama-​3.2-​11b-​text-​preview
8k$0.18$0.18
openrouter, lambda_ai, azure_ai llama-3.2-11b-vision-instruct (4 endpoints)128k131k$0.00$0.37$0.00$0.37
groq groq/​llama-​3.2-​11b-​vision-​preview
llama-​3.2-​11b-​vision-​preview
8k$0.18$0.18
openrouter, bedrock llama-3.2-1b-instruct (4 endpoints)128k131k$0.005$0.13$0.01$0.13
groq groq/​llama-​3.2-​1b-​preview
llama-​3.2-​1b-​preview
8k$0.04$0.04
openrouter, lambda_ai, lambda, bedrock, hyperbolic llama-3.2-3b-instruct (8 endpoints)20k131k$0.00$0.19$0.00$0.30
groq groq/​llama-​3.2-​3b-​preview
llama-​3.2-​3b-​preview
8k$0.06$0.06
bedrock llama3-2-90b-instruct-v1.0 (2 endpoints)128k$2.00$2.00
groq groq/​llama-​3.2-​90b-​text-​preview
llama-​3.2-​90b-​text-​preview
8k$0.90$0.90
openrouter, oci, azure_ai llama-3.2-90b-vision-instruct (3 endpoints)128k131k$1.20$2.04$1.20$2.04
vertex_ai-llama_models vertex_ai/​meta/​llama-​3.2-​90b-​vision-​instruct-​maasℹ️
llama-​3.2-​90b-​vision-​instruct-​maas
128k$0.00$0.00
groq groq/​llama-​3.2-​90b-​vision-​preview
llama-​3.2-​90b-​vision-​preview
8k$0.90$0.90
cerebras cerebras/​llama-​3.3-​70b
llama-​3.3-​70b
128k$0.85$1.20
openrouter, hyperbolic, azure_ai, oci, bedrock_converse, nscale, gradient_ai llama-3.3-70b-instruct (9 endpoints)65k131k$0.00$0.72$0.00$0.72
lambda_ai, lambda llama3.3-70b-instruct-fp8 (2 endpoints)131k131k$0.12$0.30
together_ai together_ai/​meta-​llama/​Llama-​3.3-​70B-​Instruct-​Turbo
llama-​3.3-​70b-​instruct-​turbo
$0.88$0.88
together_ai together_ai/​meta-​llama/​Llama-​3.3-​70B-​Instruct-​Turbo-​Free
llama-​3.3-​70b-​instruct-​turbo-​free
$0.00$0.00
groq groq/​llama-​3.3-​70b-​specdec
llama-​3.3-​70b-​specdec
8k$0.59$0.99
groq groq/​llama-​3.3-​70b-​versatile
llama-​3.3-​70b-​versatile
128k$0.59$0.79
openrouter nvidia/​llama-​3.3-​nemotron-​super-​49b-​v1
llama-​3.3-​nemotron-​super-​49b-​v1
131k$0.13$0.40
vertex_ai-llama_models vertex_ai/​meta/​llama3-​405b-​instruct-​maasℹ️
llama3-​405b-​instruct-​maas
32k$0.00$0.00
groq, replicate llama-3-70b (2 endpoints)8k$0.59$0.65$0.79$2.75
openrouter, replicate, bedrock llama-3-70b-instruct (12 endpoints)8k8k$0.30$4.45$0.40$5.88
vertex_ai-llama_models vertex_ai/​meta/​llama3-​70b-​instruct-​maasℹ️
llama3-​70b-​instruct-​maas
32k$0.00$0.00
groq, replicate llama-3-8b (2 endpoints)8k8k$0.05$0.08$0.25
openrouter, bedrock, replicate, gradient_ai llama-3-8b-instruct (13 endpoints)8k8k$0.03$0.50$0.06$2.65
vertex_ai-llama_models vertex_ai/​meta/​llama3-​8b-​instruct-​maasℹ️
llama3-​8b-​instruct-​maas
32k$0.00$0.00
groq groq/​llama3-​groq-​70b-​8192-​tool-​use-​preview
llama3-​groq-​70b-​8192-​tool-​use-​preview
8k$0.89$0.89
groq groq/​llama3-​groq-​8b-​8192-​tool-​use-​preview
llama3-​groq-​8b-​8192-​tool-​use-​preview
8k$0.19$0.19
openrouter neversleep/​llama-​3-​lumimaid-​70b
llama-​3-​lumimaid-​70b
8k$4.00$6.00
groq, sambanova llama-4-maverick-17b-128e-instruct (2 endpoints)131k$0.20$0.63$0.60$1.80
azure_ai, oci, lambda_ai, together_ai llama-4-maverick-17b-128e-instruct-fp8 (4 endpoints)131k1000k$0.05$1.41$0.10$0.85
vertex_ai-llama_models vertex_ai/​meta/​llama-​4-​maverick-​17b-​128e-​instruct-​maasℹ️
llama-​4-​maverick-​17b-​128e-​instruct-​maas
1000k$0.35$1.15
vertex_ai-llama_models vertex_ai/​meta/​llama-​4-​maverick-​17b-​16e-​instruct-​maasℹ️
llama-​4-​maverick-​17b-​16e-​instruct-​maas
1000k$0.35$1.15
bedrock_converse llama4-maverick-17b-instruct-v1.0 (2 endpoints)128k$0.24$0.97
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama4-​maverick-​instruct-​basicℹ️
llama4-​maverick-​instruct-​basic
131k$0.22$0.88
vertex_ai-llama_models vertex_ai/​meta/​llama-​4-​scout-​17b-​128e-​instruct-​maasℹ️
llama-​4-​scout-​17b-​128e-​instruct-​maas
10000k$0.25$0.70
azure_ai, oci, groq, lambda_ai, sambanova, nscale, together_ai llama-4-scout-17b-16e-instruct (7 endpoints)8k10000k$0.05$0.72$0.10$0.78
vertex_ai-llama_models vertex_ai/​meta/​llama-​4-​scout-​17b-​16e-​instruct-​maasℹ️
llama-​4-​scout-​17b-​16e-​instruct-​maas
10000k$0.25$0.70
bedrock_converse llama4-scout-17b-instruct-v1.0 (2 endpoints)128k$0.17$0.66
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama4-​scout-​instruct-​basicℹ️
llama4-​scout-​instruct-​basic
131k$0.15$0.60
openrouter meta-​llama/​llama-​guard-​2-​8b
llama-​guard-​2-​8b
8k$0.20$0.20
openrouter, groq llama-guard-3-8b (2 endpoints)8k131k$0.02$0.20$0.06$0.20
openrouter meta-​llama/​llama-​guard-​4-​12b
llama-​guard-​4-​12b
163k$0.18$0.18
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p1-​405b-​instructℹ️
llama-​v3p1-​405b-​instruct
128k$3.00$3.00
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p1-​8b-​instructℹ️
llama-​v3p1-​8b-​instruct
16k$0.10$0.10
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p2-​11b-​vision-​instructℹ️
llama-​v3p2-​11b-​vision-​instruct
16k$0.20$0.20
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p2-​1b-​instructℹ️
llama-​v3p2-​1b-​instruct
16k$0.10$0.10
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p2-​3b-​instructℹ️
llama-​v3p2-​3b-​instruct
16k$0.10$0.10
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p2-​90b-​vision-​instructℹ️
llama-​v3p2-​90b-​vision-​instruct
16k$0.90$0.90
openrouter eleutherai/​llemma_7b
llemma_7b
4k$0.80$1.20
aleph_alpha luminous-​base-​control$37.50$41.25
aleph_alpha luminous-​extended-​control$56.25$61.875
aleph_alpha luminous-​supreme-​control$218.75$240.625
deepinfra deepinfra/​lizpreciatior/​lzlv_70b_fp16_hf
lzlv_70b_fp16_hf
4k$0.70$0.90
openrouter arcee-​ai/​maestro-​reasoning
maestro-​reasoning
131k$0.90$3.30
openrouter, mistral magistral-medium-latest (3 endpoints)40k40k$2.00$5.00
openrouter mistralai/​magistral-​medium-​2506:thinking
magistral-​medium-​2506:thinking
40k$2.00$5.00
mistral, openrouter magistral-small-latest (3 endpoints)40k$0.50$1.50
openrouter anthracite-​org/​magnum-​v2-​72b
magnum-​v2-​72b
32k$3.00$3.00
openrouter anthracite-​org/​magnum-​v4-​72b
magnum-​v4-​72b
16k$2.00$5.00
openrouter mai-ds-r1 (2 endpoints)163k$0.00$0.1999$0.00$0.8001
openrouter inception/​mercury
mercury
128k$0.25$1.00
openrouter inception/​mercury-​coder
mercury-​coder
128k$0.25$1.00
azure_ai, hyperbolic, deepinfra, sambanova meta-llama-3.1-405b-instruct (4 endpoints)16k128k$0.12$5.33$0.30$16.00
together_ai together_ai/​meta-​llama/​Meta-​Llama-​3.1-​405B-​Instruct-​Turbo
meta-​llama-​3.1-​405b-​instruct-​turbo
$3.50$3.50
azure_ai, hyperbolic, friendliai meta-llama-3.1-70b-instruct (3 endpoints)8k128k$0.12$2.68$0.30$3.54
together_ai together_ai/​meta-​llama/​Meta-​Llama-​3.1-​70B-​Instruct-​Turbo
meta-​llama-​3.1-​70b-​instruct-​turbo
$0.88$0.88
azure_ai, hyperbolic, sambanova, friendliai meta-llama-3.1-8b-instruct (4 endpoints)8k128k$0.10$0.30$0.10$0.61
together_ai together_ai/​meta-​llama/​Meta-​Llama-​3.1-​8B-​Instruct-​Turbo
meta-​llama-​3.1-​8b-​instruct-​turbo
$0.18$0.18
sambanova sambanova/​Meta-​Llama-​3.2-​1B-​Instructℹ️
meta-​llama-​3.2-​1b-​instruct
16k$0.04$0.08
sambanova sambanova/​Meta-​Llama-​3.2-​3B-​Instructℹ️
meta-​llama-​3.2-​3b-​instruct
4k$0.08$0.16
sambanova sambanova/​Meta-​Llama-​3.3-​70B-​Instructℹ️
meta-​llama-​3.3-​70b-​instruct
131k$0.60$1.20
hyperbolic, anyscale, azure_ai, deepinfra meta-llama-3-70b-instruct (4 endpoints)8k131k$0.12$1.10$0.30$1.00
anyscale, deepinfra meta-llama-3-8b-instruct (2 endpoints)8k8k$0.08$0.15$0.08$0.15
sambanova sambanova/​Meta-​Llama-​Guard-​3-​8Bℹ️
meta-​llama-​guard-​3-​8b
16k$0.30$0.30
sagemakersagemaker/​meta-​textgeneration-​llama-​2-​13b-​f
meta-​textgeneration-​llama-​2-​13b-​f
4k$0.00$0.00
sagemakersagemaker/​meta-​textgeneration-​llama-​2-​70b-​b-​f
meta-​textgeneration-​llama-​2-​70b-​b-​f
4k$0.00$0.00
sagemakersagemaker/​meta-​textgeneration-​llama-​2-​7b-​f
meta-​textgeneration-​llama-​2-​7b-​f
4k$0.00$0.00
openrouter sophosympatheia/​midnight-​rose-​70b
midnight-​rose-​70b
4k$0.80$0.80
openrouter minimax/​minimax-​01
minimax-​01
1000k$0.20$1.10
openrouter minimax/​minimax-​m1
minimax-​m1
1000k$0.30$1.65
azure_ai, openrouter ministral-3b (2 endpoints)32k128k$0.04$0.04
openrouter mistralai/​ministral-​8b
ministral-​8b
128k$0.10$0.10
openrouter, perplexity mistral-7b-instruct (3 endpoints)4k32k$0.00$0.07$0.00$0.28
deepinfra, anyscale, cloudflare, openrouter mistral-7b-instruct-v0.1 (4 endpoints)2k32k$0.11$1.923$0.13$1.923
bedrock, replicate mistral-7b-instruct-v0.2 (5 endpoints)4k32k$0.05$0.20$0.20$0.26
openrouter mistralai/​mistral-​7b-​instruct-​v0.3
mistral-​7b-​instruct-​v0.3
32k$0.028$0.054
replicate replicate/​mistralai/​mistral-​7b-​v0.1
mistral-​7b-​v0.1
4k$0.05$0.25
openrouter, watsonx, azure_ai, vertex_ai-mistral_models, mistral, bedrock, azure mistral-large (20 endpoints)32k131k$2.00$10.40$6.00$31.20
vertex_ai-mistral_models vertex_ai/​mistral-​large@2411-​001
mistral-​large@2411-​001
128k$2.00$6.00
vertex_ai-mistral_models vertex_ai/​mistral-​large@latest
mistral-​large@latest
128k$2.00$6.00
deepinfra deepinfra/​amazon/​MistralLite
mistrallite
32k$0.20$0.20
azure_ai, mistral mistral-medium-latest (5 endpoints)32k131k$0.40$2.70$2.00$8.10
openrouter mistralai/​mistral-​medium-​3
mistral-​medium-​3
131k$0.40$2.00
openrouter mistralai/​mistral-​medium-​3.1
mistral-​medium-​3.1
262k$0.40$2.00
openrouter, azure_ai, vertex_ai-mistral_models mistral-nemo (4 endpoints)32k131k$0.00$3.00$0.00$3.00
gradient_aigradient_ai/​mistral-​nemo-​instruct-​2407
mistral-​nemo-​instruct-​2407
$0.30$0.30
vertex_ai-mistral_models vertex_ai/​mistral-​nemo@latest
mistral-​nemo@latest
128k$0.15$0.15
openrouter mistralai/​mistral-​saba
mistral-​saba
32k$0.20$0.60
groq groq/​mistral-​saba-​24b
mistral-​saba-​24b
32k$0.79$0.79
openrouter mistral-small-24b-instruct-2501 (2 endpoints)32k$0.00$0.02$0.00$0.08
vertex_ai-mistral_models vertex_ai/​mistral-​small-​2503@001
mistral-​small-​2503@001
32k$1.00$3.00
openrouter mistral-small-3.1-24b-instruct (2 endpoints)128k131k$0.00$0.018$0.00$0.072
openrouter mistral-small-3.2-24b-instruct (2 endpoints)128k131k$0.00$0.05$0.00$0.10
openrouter, mistral mistral-tiny (2 endpoints)32k32k$0.25$0.25
openrouter, fireworks_ai mixtral-8x22b-instruct (2 endpoints)65k$0.90$1.20$0.90$1.20
anyscale, nscale mixtral-8x22b-instruct-v0.1 (2 endpoints)65k$0.60$0.90$0.60$0.90
groq groq/​mixtral-​8x7b-​32768
mixtral-​8x7b-​32768
32k$0.24$0.24
openrouter, perplexity mixtral-8x7b-instruct (2 endpoints)4k32k$0.07$0.08$0.24$0.28
deepinfra, bedrock, anyscale, replicate, together_ai mixtral-8x7b-instruct-v0.1 (8 endpoints)4k32k$0.15$0.60$0.15$1.00
openrouter infermatic/​mn-​inferor-​12b
mn-​inferor-​12b
8k$0.60$1.00
moonshot moonshot-v1-128k (2 endpoints)131k$2.00$5.00
moonshotmoonshot/​moonshot-​v1-​128k-​vision-​previewℹ️
moonshot-​v1-​128k-​vision-​preview
131k$2.00$5.00
moonshot moonshot-v1-32k (2 endpoints)32k$1.00$3.00
moonshotmoonshot/​moonshot-​v1-​32k-​vision-​previewℹ️
moonshot-​v1-​32k-​vision-​preview
32k$1.00$3.00
moonshot moonshot-v1-8k (2 endpoints)8k$0.20$2.00
moonshotmoonshot/​moonshot-​v1-​8k-​vision-​previewℹ️
moonshot-​v1-​8k-​vision-​preview
8k$0.20$2.00
moonshotmoonshot/​moonshot-​v1-​autoℹ️
moonshot-​v1-​auto
131k$2.00$5.00
openrouter, morph morph-v3-fast (2 endpoints)16k81k$0.80$0.90$1.20$1.90
openrouter, morph morph-v3-large (2 endpoints)16k81k$0.90$1.90
openrouter pygmalionai/​mythalion-​13b
mythalion-​13b
4k$0.70$1.10
openrouter, deepinfra mythomax-l2-13b (2 endpoints)4k$0.06$0.22$0.06$0.22
openrouter neversleep/​noromaid-​20b
noromaid-​20b
4k$1.00$1.75
openrouter nousresearch/​nous-​hermes-​2-​mixtral-​8x7b-​dpo
nous-​hermes-​2-​mixtral-​8x7b-​dpo
32k$0.60$0.60
openrouter amazon/​nova-​lite-​v1
nova-​lite-​v1
300k$0.06$0.24
bedrock_converse nova-lite-v1.0 (4 endpoints)300k$0.06$0.078$0.24$0.312
openrouter amazon/​nova-​micro-​v1
nova-​micro-​v1
128k$0.035$0.14
bedrock_converse nova-micro-v1.0 (4 endpoints)128k$0.035$0.046$0.14$0.184
bedrock_converse us.​amazon.​nova-​premier-​v1:0
nova-​premier-​v1.0
1000k$2.50$12.50
openrouter amazon/​nova-​pro-​v1
nova-​pro-​v1
300k$0.80$3.20
bedrock_converse, bedrock nova-pro-v1.0 (6 endpoints)300k$0.80$1.05$3.20$4.20
azure o1-preview (4 endpoints)128k$15.00$16.50$60.00$66.00
gradient_aigradient_ai/​openai-​o3
openai-​o3
$2.00$8.00
gradient_aigradient_ai/​openai-​o3-​mini
openai-​o3-​mini
$1.10$4.40
deepinfra deepinfra/​openchat/​openchat_3.5
openchat_3.5
4k$0.13$0.13
mistral mistral/​open-​codestral-​mambaℹ️
open-​codestral-​mamba
256k$0.25$0.25
mistral mistral/​open-​mistral-​7b
open-​mistral-​7b
32k$0.25$0.25
mistral open-mistral-nemo (2 endpoints)128k$0.30$0.30
mistral mistral/​open-​mixtral-​8x22b
open-​mixtral-​8x22b
65k$2.00$6.00
mistral mistral/​open-​mixtral-​8x7b
open-​mixtral-​8x7b
32k$0.70$0.70
openrouter microsoft/​phi-​3.5-​mini-​128k-​instruct
phi-​3.5-​mini-​128k-​instruct
128k$0.10$0.10
azure_ai azure_ai/​Phi-​3.5-​mini-​instructℹ️
phi-​3.5-​mini-​instruct
128k$0.13$0.52
azure_ai azure_ai/​Phi-​3.5-​MoE-​instructℹ️
phi-​3.5-​moe-​instruct
128k$0.16$0.64
azure_ai azure_ai/​Phi-​3.5-​vision-​instructℹ️
phi-​3.5-​vision-​instruct
128k$0.13$0.52
azure_ai, openrouter phi-3-medium-128k-instruct (2 endpoints)128k$0.17$1.00$0.68$1.00
azure_ai azure_ai/​Phi-​3-​medium-​4k-​instructℹ️
phi-​3-​medium-​4k-​instruct
4k$0.17$0.68
openrouter, azure_ai phi-3-mini-128k-instruct (2 endpoints)128k$0.10$0.13$0.10$0.52
azure_ai azure_ai/​Phi-​3-​mini-​4k-​instructℹ️
phi-​3-​mini-​4k-​instruct
4k$0.13$0.52
azure_ai azure_ai/​Phi-​3-​small-​128k-​instructℹ️
phi-​3-​small-​128k-​instruct
128k$0.15$0.60
azure_ai azure_ai/​Phi-​3-​small-​8k-​instructℹ️
phi-​3-​small-​8k-​instruct
8k$0.15$0.60
openrouter, azure_ai phi-4 (2 endpoints)16k$0.06$0.125$0.14$0.50
azure_ai azure_ai/​Phi-​4-​mini-​instructℹ️
phi-​4-​mini-​instruct
131k$0.075$0.30
openrouter, azure_ai phi-4-multimodal-instruct (2 endpoints)131k$0.05$0.08$0.10$0.32
openrouter microsoft/​phi-​4-​reasoning-​plus
phi-​4-​reasoning-​plus
32k$0.07$0.35
deepinfra deepinfra/​Phind/​Phind-​CodeLlama-​34B-​v2
phind-​codellama-​34b-​v2
16k$0.60$0.60
mistral, openrouter pixtral-12b (2 endpoints)32k128k$0.10$0.15$0.10$0.15
openrouter, mistral, bedrock_converse pixtral-large-latest (5 endpoints)128k131k$2.00$6.00
perplexity perplexity/​pplx-​70b-​chat
pplx-​70b-​chat
4k$0.70$2.80
perplexity perplexity/​pplx-​70b-​online
pplx-​70b-​online
4k$0.00$2.80
perplexity perplexity/​pplx-​7b-​chat
pplx-​7b-​chat
8k$0.07$0.28
perplexity perplexity/​pplx-​7b-​online
pplx-​7b-​online
4k$0.00$0.28
hyperbolic, openrouter qwen2.5-72b-instruct (3 endpoints)32k131k$0.00$0.12$0.00$0.30
openrouter qwen/​qwen-​2.5-​7b-​instruct
qwen-​2.5-​7b-​instruct
65k$0.04$0.10
lambda_ai, lambda, openrouter, hyperbolic, nscale qwen25-coder-32b-instruct (6 endpoints)32k131k$0.00$0.12$0.00$0.3016.4%
nscalenscale/​Qwen/​Qwen2.5-​Coder-​3B-​Instructℹ️
qwen2.5-​coder-​3b-​instruct
$0.01$0.03
nscalenscale/​Qwen/​Qwen2.5-​Coder-​7B-​Instructℹ️
qwen2.5-​coder-​7b-​instruct
$0.01$0.03
openrouter qwen2.5-vl-32b-instruct (2 endpoints)8k16k$0.00$0.02$0.00$0.08
openrouter qwen2.5-vl-72b-instruct (2 endpoints)32k$0.00$0.10$0.00$0.40
openrouter qwen/​qwen-​2.5-​vl-​7b-​instruct
qwen-​2.5-​vl-​7b-​instruct
32k$0.20$0.20
fireworks_ai, openrouter qwen2-72b-instruct (2 endpoints)32k$0.90$0.90
sambanova sambanova/​Qwen2-​Audio-​7B-​Instructℹ️
qwen2-​audio-​7b-​instruct
4k$0.50$100.00
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​qwen2p5-​coder-​32b-​instructℹ️
qwen2p5-​coder-​32b-​instruct
4k$0.90$0.90
openrouter qwen3-14b (2 endpoints)40k$0.00$0.06$0.00$0.24
openrouter, hyperbolic qwen3-235b-a22b (4 endpoints)40k262k$0.00$2.00$0.00$2.00
together_ai together_ai/​Qwen/​Qwen3-​235B-​A22B-​fp8-​tputℹ️
qwen3-​235b-​a22b-​fp8-​tput
40k$0.20$0.60
vertex_ai-qwen_modelsvertex_ai/​qwen/​qwen3-​235b-​a22b-​instruct-​2507-​maasℹ️
qwen3-​235b-​a22b-​instruct-​2507-​maas
262k$0.25$1.00
together_ai together_ai/​Qwen/​Qwen3-​235B-​A22B-​Instruct-​2507-​tputℹ️
qwen3-​235b-​a22b-​instruct-​2507-​tput
262k$0.20$6.00
openrouter, together_ai qwen3-235b-a22b-thinking-2507 (2 endpoints)256k262k$0.078$0.65$0.312$3.00
openrouter qwen3-30b-a3b (2 endpoints)40k$0.00$0.02$0.00$0.08
openrouter qwen/​qwen3-​30b-​a3b-​instruct-​2507
qwen3-​30b-​a3b-​instruct-​2507
131k$0.20$0.80
groq, cerebras, openrouter, sambanova qwen3-32b (4 endpoints)8k131k$0.018$0.40$0.072$0.8040%
lambda_ailambda_ai/​qwen3-​32b-​fp8
qwen3-​32b-​fp8
131k$0.05$0.10
openrouter qwen/​qwen3-​4b:free
qwen3-​4b
40k$0.00$0.00
openrouter qwen3-8b (2 endpoints)40k128k$0.00$0.035$0.00$0.138
openrouter qwen3-coder (2 endpoints)262k$0.00$0.20$0.00$0.80
together_ai together_ai/​Qwen/​Qwen3-​Coder-​480B-​A35B-​Instruct-​FP8ℹ️
qwen3-​coder-​480b-​a35b-​instruct-​fp8
256k$2.00$2.00
vertex_ai-qwen_modelsvertex_ai/​qwen/​qwen3-​coder-​480b-​a35b-​instruct-​maasℹ️
qwen3-​coder-​480b-​a35b-​instruct-​maas
262k$1.00$4.00
openrouter qwen/​qwen-​max
qwen-​max
32k$1.60$6.40
openrouter qwen/​qwen-​plus
qwen-​plus
131k$0.40$1.20
openrouter qwen/​qwen-​turbo
qwen-​turbo
1000k$0.05$0.20
openrouter qwen/​qwen-​vl-​max
qwen-​vl-​max
7k$0.80$3.20
openrouter qwen/​qwen-​vl-​plus
qwen-​vl-​plus
7k$0.21$0.63
openrouter featherless/​qwerky-​72b:free
qwerky-​72b
32k$0.00$0.00
openrouter, hyperbolic, sambanova, nscale qwq-32b (5 endpoints)16k131k$0.00$0.50$0.00$1.0020.9%
openrouter qwq-32b-arliai-rpr-v1 (2 endpoints)32k$0.00$0.01$0.00$0.04
openrouter qwen/​qwq-​32b-​preview
qwq-​32b-​preview
32k$0.20$0.20
openrouter perplexity/​r1-​1776
r1-​1776
128k$2.00$8.00
openrouter rekaai/​reka-​flash-​3:free
reka-​flash-​3
32k$0.00$0.00
openrouter undi95/​remm-​slerp-​l2-​13b
remm-​slerp-​l2-​13b
6k$0.45$0.65
openrouter thedrummer/​rocinante-​12b
rocinante-​12b
32k$0.17$0.43
openrouter switchpoint/​router
router
131k$0.85$3.40
openrouter sarvamai/​sarvam-​m:free
sarvam-​m
32k$0.00$0.00
openrouter shisa-v2-llama3.3-70b (2 endpoints)32k$0.00$0.02$0.00$0.08
openrouter thedrummer/​skyfall-​36b-​v2
skyfall-​36b-​v2
32k$0.0481$0.1926
perplexity, openrouter sonar (2 endpoints)127k128k$1.00$1.00
perplexity, openrouter sonar-deep-research (2 endpoints)128k$2.00$8.00
perplexity perplexity/​sonar-​medium-​chat
sonar-​medium-​chat
16k$0.60$1.80
perplexity perplexity/​sonar-​medium-​online
sonar-​medium-​online
12k$0.00$1.80
perplexity, openrouter sonar-pro (2 endpoints)200k$3.00$15.00
perplexity, openrouter sonar-reasoning (2 endpoints)127k128k$1.00$5.00
perplexity, openrouter sonar-reasoning-pro (2 endpoints)128k$2.00$8.00
perplexity perplexity/​sonar-​small-​chat
sonar-​small-​chat
16k$0.07$0.28
perplexity perplexity/​sonar-​small-​online
sonar-​small-​online
12k$0.00$0.28
openrouter raifle/​sorcererlm-​8x22b
sorcererlm-​8x22b
16k$4.50$4.50
openrouter arcee-​ai/​spotlight
spotlight
131k$0.18$0.18
bedrock titan-text-express-v1 (3 endpoints)42k$1.30$1.70
bedrock titan-text-lite-v1 (3 endpoints)42k$0.30$0.40
bedrock titan-text-premier-v1.0 (3 endpoints)42k$0.50$1.50
together_ai together-​ai-​21.1b-​41b$0.80$0.80
together_ai together-​ai-​41.1b-​80b$0.90$0.90
together_ai together-​ai-​4.1b-​8b$0.20$0.20
together_ai together-​ai-​81.1b-​110b$1.80$1.80
together_ai together-​ai-​8.1b-​21b$0.30$0.30
together_ai together-​ai-​up-​to-​4b$0.10$0.10
openrouter bytedance/​ui-​tars-​1.5-​7b
ui-​tars-​1.5-​7b
128k$0.10$0.20
openrouter thedrummer/​unslopnemo-​12b
unslopnemo-​12b
32k$0.40$0.40
v0v0/​v0-​1.0-​md
v0-​1.0-​md
128k$3.00$15.00
v0v0/​v0-​1.5-​lg
v0-​1.5-​lg
512k$15.00$75.00
v0v0/​v0-​1.5-​md
v0-​1.5-​md
128k$3.00$15.00
openrouter arcee-​ai/​virtuoso-​large
virtuoso-​large
131k$0.75$1.20
openrouter mancer/​weaver
weaver
8k$1.125$1.125
openrouter microsoft/​wizardlm-​2-​8x22b
wizardlm-​2-​8x22b
65k$0.48$0.48
deepinfra deepinfra/​01-​ai/​Yi-​34B-​Chat
yi-​34b-​chat
4k$0.60$0.60
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​yi-​largeℹ️
yi-​large
32k$3.00$3.00
anyscale anyscale/​HuggingFaceH4/​zephyr-​7b-​beta
zephyr-​7b-​beta
16k$0.15$0.15