LLM Model Prices (per million tokens)

Thanks to the LiteLLM project, the model provider websites and APIs and random sources online for the source data. Most benchmark data is from the Epoch AI benchmarking dashboard, provided under a Creative Commons license by Epoch AI, but some data is from other sources. The Aider Polyglot data is from the Aider website. Always check the original source for the most up-to-date information as there may be errors in the data (some sources are a bit rough) and benchmark score matching is a bit fuzzy.

$
Showing 1016 models (491 rows displayed)
Provider ⬍Model ⬍Max Input Tokens ⬍Input Token Price ⬍Output Token Price ⬍GPQA (Diamond) ⬍MATH (Level 5) ⬍OTIS Mock AIME 24-25 ⬍Aider Polyglot ⬍
openai, openrouter chatgpt-4o-latest (2 endpoints)128k$5.00$15.0045.3%
vertex_ai-anthropic_models, openrouter, anthropic, bedrock claude-3-5-sonnet (12 endpoints)200k$3.00$15.0051.6%
openrouter, anthropic, vertex_ai-anthropic_models, bedrock_converse, bedrock claude-3.7-sonnet (8 endpoints)200k$3.00$15.0064.9%
vertex_ai-anthropic_models, openrouter, anthropic, bedrock_converse claude-opus-4 (7 endpoints)200k$15.00$75.0072%
vertex_ai-anthropic_models, openrouter, anthropic, bedrock_converse claude-sonnet-4 (8 endpoints)200k$3.00$15.0061.3%
openrouter, vertex_ai-mistral_models, codestral, mistral codestral-latest (7 endpoints)32k262k$0.00$1.00$0.00$3.0011.1%
lambda, openrouter, fireworks_ai, azure_ai, bedrock_converse, deepseek, sambanova deepseek-r1 (12 endpoints)32k164k$0.00$5.00$0.00$8.0071.4%
openrouter, fireworks_ai, azure_ai, deepseek, sambanova deepseek-v3 (9 endpoints)16k163k$0.00$3.00$0.00$4.5655.1%
vertex_ai-language-models, gemini gemini-2.0-flash (2 endpoints)1048k$0.10$0.40
vertex_ai-language-models, gemini gemini-2.0-flash-lite (2 endpoints)1048k$0.075$0.30
openrouter, gemini, vertex_ai-language-models gemini-2.5-flash-preview (7 endpoints)1048k$0.15$0.30$0.60$3.5055.1%
vertex_ai-language-models, gemini, openrouter gemini-2.5-pro-exp-03-25 (4 endpoints)1048k$0.00$1.25
>200k: $0.00$2.50
$0.00$10.00
>200k: $0.00$15.00
openrouter, vertex_ai-language-models, gemini gemini-2.5-pro-preview (7 endpoints)1048k$1.25
>200k: $2.50
$10.00
>200k: $15.00
83.1%
openai, azure, openrouter gpt-4.1 (5 endpoints)1047k$2.00$8.0052.4%
openai, azure, openrouter gpt-4.5-preview (4 endpoints)128k$75.00$150.0044.9%
openai, azure, openrouter gpt-4o (20 endpoints)128k$2.50$5.00$10.00$15.0023.1%
openai, azure, openrouter gpt-4o-mini (9 endpoints)128k$0.15$0.165$0.60$0.663.6%
xai, openrouter grok-3-beta (3 endpoints)131k$3.00$15.0053.3%
xai, openrouter grok-3-mini-beta (3 endpoints)131k$0.30$0.5049.3%
openrouter, lambda llama-4-maverick (3 endpoints)128k1048k$0.00$0.20$0.00$0.6015.6%
openrouter, lambda llama-4-scout (3 endpoints)64k1048k$0.00$0.10$0.00$0.30
azure_ai, vertex_ai-mistral_models, openrouter, mistral, bedrock mistral-small (7 endpoints)32k128k$0.10$1.00$0.30$3.00
openai, azure, openrouter o1 (7 endpoints)200k$15.00$16.50$60.00$66.0061.7%
openai, openrouter, azure o1-mini (8 endpoints)128k$1.10$3.00$4.40$12.0032.9%
openrouter openai/​o1-​pro
o1-​pro
200k$150.00$600.00
openai, azure, openrouter o3 (5 endpoints)200k$2.00$10.00$8.00$40.0081.3%
openai, azure, openrouter o3-mini (8 endpoints)200k$1.10$1.21$4.40$4.8460.4%
openrouter openai/​o3-​pro
o3-​pro
200k$20.00$80.0084.9%
openai, azure, openrouter o4-mini (6 endpoints)200k$1.10$4.4072%
openrouter aion-​labs/​aion-​1.0
aion-​1.0
131k$4.00$8.00
openrouter aion-​labs/​aion-​1.0-​mini
aion-​1.0-​mini
131k$0.70$1.40
openrouter aion-​labs/​aion-​rp-​llama-​3.1-​8b
aion-​rp-​llama-​3.1-​8b
32k$0.20$0.20
deepinfra deepinfra/​deepinfra/​airoboros-​70b
airoboros-​70b
4k$0.70$0.90
deepinfra deepinfra/​jondurbin/​airoboros-​l2-​70b-​gpt4-​1.4.​1
airoboros-​l2-​70b-​gpt4-​1.4.​1
4k$0.70$0.90
openrouter thedrummer/​anubis-​70b-​v1.1
anubis-​70b-​v1.1
131k$0.30$0.80
openrouter thedrummer/​anubis-​pro-​105b-​v1
anubis-​pro-​105b-​v1
131k$0.80$1.00
openrouter arcee-​ai/​arcee-​blitz
arcee-​blitz
32k$0.45$0.75
openrouter arcee-​ai/​caller-​large
caller-​large
32k$0.55$0.85
vertex_ai-chat-models, palm chat-bison (2 endpoints)8k$0.125$0.125
vertex_ai-chat-models, palm chat-bison@001 (2 endpoints)8k$0.125$0.125
vertex_ai-chat-models chat-​bison@002ℹ️8k$0.125$0.125
vertex_ai-chat-models chat-​bison-​32kℹ️32k$0.125$0.125
vertex_ai-chat-models chat-​bison-​32k@002ℹ️32k$0.125$0.125
nlp_cloud chatdolphin16k$0.50$0.50
openrouter, anthropic claude-2 (3 endpoints)100k200k$8.00$24.00
openrouter anthropic/​claude-​2.0:beta
claude-​2.0:beta
100k$8.00$24.00
anthropic, openrouter claude-2.1 (2 endpoints)200k$8.00$24.00
openrouter anthropic/​claude-​2.1:beta
claude-​2.1:beta
200k$8.00$24.00
openrouter anthropic/​claude-​2:beta
claude-​2:beta
200k$8.00$24.00
bedrock, openrouter, anthropic, vertex_ai-anthropic_models claude-3.5-haiku (9 endpoints)200k$0.25$1.00$1.25$5.0028%
openrouter anthropic/​claude-​3.5-​haiku-​20241022:beta
claude-​3.5-​haiku-​20241022:beta
200k$0.80$4.00
openrouter anthropic/​claude-​3.5-​haiku:beta
claude-​3.5-​haiku:beta
200k$0.80$4.00
openrouter anthropic/​claude-​3.5-​sonnet-​20240620:beta
claude-​3.5-​sonnet-​20240620:beta
200k$3.00$15.00
bedrock claude-3-5-sonnet-20241022-v2.0 (4 endpoints)200k$3.00$15.00
openrouter anthropic/​claude-​3.5-​sonnet:beta
claude-​3.5-​sonnet:beta
200k$3.00$15.00
vertex_ai-anthropic_models claude-3-5-sonnet-v2 (2 endpoints)200k$3.00$15.00
openrouter anthropic/​claude-​3.7-​sonnet:beta
claude-​3.7-​sonnet:beta
200k$3.00$15.00
vertex_ai-anthropic_models, openrouter, anthropic, bedrock claude-3-haiku (8 endpoints)200k$0.25$1.25
openrouter anthropic/​claude-​3-​haiku:beta
claude-​3-​haiku:beta
200k$0.25$1.25
vertex_ai-anthropic_models, openrouter, anthropic, bedrock claude-3-opus (8 endpoints)200k$15.00$75.00
openrouter anthropic/​claude-​3-​opus:beta
claude-​3-​opus:beta
200k$15.00$75.00
vertex_ai-anthropic_models, openrouter, anthropic, bedrock claude-3-sonnet (8 endpoints)200k$3.00$15.00
openrouter anthropic/​claude-​3-​sonnet:beta
claude-​3-​sonnet:beta
200k$3.00$15.00
bedrock claude-instant-v1 (5 endpoints)100k$0.80$2.48$2.40$8.38
bedrock claude-v1 (5 endpoints)100k$8.00$24.00
bedrock claude-v2 (5 endpoints)100k$8.00$24.00
bedrock claude-v2.1 (5 endpoints)100k$8.00$24.00
vertex_ai-code-text-models code-​bisonℹ️6k$0.125$0.125
vertex_ai-code-chat-models codechat-​bisonℹ️6k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison@001ℹ️6k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison@002ℹ️6k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison-​32kℹ️32k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison-​32k@002ℹ️32k$0.125$0.125
vertex_ai-code-chat-models codechat-​bison@latestℹ️6k$0.125$0.125
perplexity, deepinfra, anyscale codellama-34b-instruct (3 endpoints)4k16k$0.35$1.00$0.60$1.40
perplexity, anyscale codellama-70b-instruct (2 endpoints)4k16k$0.70$1.00$1.00$2.80
cloudflare cloudflare/​@hf/​thebloke/​codellama-​7b-​instruct-​awq
codellama-​7b-​instruct-​awq
4k$1.923$1.923
openrouter alfredpros/​codellama-​7b-​instruct-​solidity
codellama-​7b-​instruct-​solidity
4k$0.80$1.20
openrouter arcee-​ai/​coder-​large
coder-​large
32k$0.50$0.80
vertex_ai-mistral_models vertex_ai/​codestral@latest
codestral@latest
128k$0.20$0.60
mistral mistral/​codestral-​mamba-​latestℹ️
codestral-​mamba-​latest
256k$0.25$0.25
openrouter openai/​codex-​mini
codex-​mini
200k$1.50$6.00
openrouter cohere/​command
command
4k$1.00$2.00
openrouter, cohere_chat command-a (2 endpoints)256k$2.50$10.00
cohere_chat command-​light4k$0.30$0.60
bedrock cohere.​command-​light-​text-​v14
command-​light-​text-​v14
4k$0.30$0.60
cohere_chat, openrouter, bedrock command-r (6 endpoints)128k$0.15$0.50$0.60$1.50
openrouter, cohere_chat command-r7b-12-2024 (2 endpoints)128k$0.0375$0.15$0.0375$0.15
cohere_chat, openrouter, azure, bedrock command-r-plus (7 endpoints)128k$2.50$3.00$10.00$15.00
bedrock cohere.​command-​text-​v14
command-​text-​v14
4k$1.50$2.00
azure computer-use-preview (2 endpoints)8k$3.00$12.00
openrouter openrouter/​cypher-​alpha:free
cypher-​alpha
1000k$0.00$0.00
databricks databricks/​databricks-​claude-​3-​7-​sonnetℹ️
databricks-​claude-​3-​7-​sonnet
200k$2.50$17.857
databricks databricks/​databricks-​dbrx-​instructℹ️
databricks-​dbrx-​instruct
32k$0.75$2.249
databricks databricks/​databricks-​llama-​2-​70b-​chatℹ️
databricks-​llama-​2-​70b-​chat
4k$0.50$1.50
databricks databricks/​databricks-​llama-​4-​maverickℹ️
databricks-​llama-​4-​maverick
128k$5.00$15.00
databricks databricks/​databricks-​meta-​llama-​3-​1-​405b-​instructℹ️
databricks-​meta-​llama-​3-​1-​405b-​instruct
128k$5.00$15.00
databricks databricks/​databricks-​meta-​llama-​3-​1-​70b-​instructℹ️
databricks-​meta-​llama-​3-​1-​70b-​instruct
128k$1.00$3.00
databricks databricks/​databricks-​meta-​llama-​3-​3-​70b-​instructℹ️
databricks-​meta-​llama-​3-​3-​70b-​instruct
128k$1.00$3.00
databricks databricks/​databricks-​meta-​llama-​3-​70b-​instructℹ️
databricks-​meta-​llama-​3-​70b-​instruct
128k$1.00$3.00
databricks databricks/​databricks-​mixtral-​8x7b-​instructℹ️
databricks-​mixtral-​8x7b-​instruct
4k$0.50$0.999
databricks databricks/​databricks-​mpt-​30b-​instructℹ️
databricks-​mpt-​30b-​instruct
8k$0.999$0.999
databricks databricks/​databricks-​mpt-​7b-​instructℹ️
databricks-​mpt-​7b-​instruct
8k$0.50$0.00
openrouter agentica-​org/​deepcoder-​14b-​preview:free
deepcoder-​14b-​preview
96k$0.00$0.00
openrouter nousresearch/​deephermes-​3-​llama-​3-​8b-​preview:free
deephermes-​3-​llama-​3-​8b-​preview
131k$0.00$0.00
openrouter deepseek/​deepseek-​chat:free
deepseek-​chat
163k$0.00$0.00
deepseek deepseek/​deepseek-​coder
deepseek-​coder
128k$0.14$0.28
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​deepseek-​coder-​v2-​instructℹ️
deepseek-​coder-​v2-​instruct
65k$1.20$1.20
lambda deepseek-​llama3.3-​70b131k$0.20$0.60
openrouter deepseek/​deepseek-​prover-​v2
deepseek-​prover-​v2
131k$0.50$2.18
openrouter deepseek-r1-0528-qwen3-8b (2 endpoints)32k131k$0.00$0.01$0.00$0.02
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​deepseek-​r1-​basicℹ️
deepseek-​r1-​basic
128k$0.55$2.19
openrouter, sambanova, groq, nscale deepseek-r1-distill-llama-70b (5 endpoints)8k131k$0.00$0.75$0.00$1.40
openrouter, nscale deepseek-r1-distill-llama-8b (2 endpoints)32k$0.025$0.04$0.025$0.04
openrouter, nscale deepseek-r1-distill-qwen-14b (3 endpoints)64k$0.00$0.15$0.00$0.15
openrouter, nscale deepseek-r1-distill-qwen-1.5b (2 endpoints)131k$0.09$0.18$0.09$0.18
openrouter, nscale deepseek-r1-distill-qwen-32b (2 endpoints)131k$0.075$0.15$0.15
openrouter, nscale deepseek-r1-distill-qwen-7b (2 endpoints)131k$0.10$0.20$0.20
openrouter tngtech/​deepseek-​r1t-​chimera:free
deepseek-​r1t-​chimera
163k$0.00$0.00
openrouter deepseek/​deepseek-​v3-​base:free
deepseek-​v3-​base
163k$0.00$0.00
openrouter, mistral devstral-small (3 endpoints)32k128k$0.00$0.10$0.00$0.30
deepinfra deepinfra/​cognitivecomputations/​dolphin-​2.6-​mixtral-​8x7b
dolphin-​2.6-​mixtral-​8x7b
32k$0.27$0.27
openrouter cognitivecomputations/​dolphin3.0-​mistral-​24b:free
dolphin3.0-​mistral-​24b
32k$0.00$0.00
openrouter cognitivecomputations/​dolphin3.0-​r1-​mistral-​24b:free
dolphin3.0-​r1-​mistral-​24b
32k$0.00$0.00
openrouter cognitivecomputations/​dolphin-​mixtral-​8x22b
dolphin-​mixtral-​8x22b
16k$0.90$0.90
openrouter baidu/​ernie-​4.5-​300b-​a47b
ernie-​4.5-​300b-​a47b
123k$0.28$1.10
openrouter eva-​unit-​01/​eva-​llama-​3.33-​70b
eva-​llama-​3.33-​70b
16k$4.00$6.00
openrouter eva-​unit-​01/​eva-​qwen-​2.5-​32b
eva-​qwen-​2.5-​32b
16k$2.60$3.40
openrouter eva-​unit-​01/​eva-​qwen-​2.5-​72b
eva-​qwen-​2.5-​72b
16k$4.00$6.00
openrouter sao10k/​fimbulvetr-​11b-​v2
fimbulvetr-​11b-​v2
4k$0.80$1.20
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​firefunction-​v2ℹ️
firefunction-​v2
8k$0.90$0.90
vertex_ai-language-models gemini-​1.0-​proℹ️32k$0.50$1.50
vertex_ai-language-models gemini-​1.0-​pro-​001ℹ️32k$0.50$1.50
vertex_ai-language-models gemini-​1.0-​pro-​002ℹ️32k$0.50$1.50
vertex_ai-vision-models gemini-​1.0-​pro-​visionℹ️16k$0.50$1.50
vertex_ai-vision-models gemini-​1.0-​pro-​vision-​001ℹ️16k$0.50$1.50
vertex_ai-language-models gemini-​1.0-​ultraℹ️8k$0.50$1.50
vertex_ai-language-models gemini-​1.0-​ultra-​001ℹ️8k$0.50$1.50
gemini, vertex_ai-language-models gemini-1.5-flash (3 endpoints)1000k1048k$0.075
>128k: $0.15$1.00
$0.30
>128k: $0.60
gemini, vertex_ai-language-models gemini-1.5-flash-001 (2 endpoints)1000k1048k$0.075
>128k: $0.15$1.00
$0.30
>128k: $0.60
vertex_ai-language-models, gemini gemini-1.5-flash-002 (2 endpoints)1048k$0.075
>128k: $0.15$1.00
$0.30
>128k: $0.60
gemini gemini/​gemini-​1.5-​flash-​8bℹ️
gemini-​1.5-​flash-​8b
1048k$0.00
>128k: $0.00
$0.00
>128k: $0.00
gemini, vertex_ai-language-models gemini-1.5-flash-exp-0827 (2 endpoints)1000k1048k$0.00$0.0047
>128k: $0.00$1.00
$0.00$0.0047
>128k: $0.00$0.0094
vertex_ai-language-models gemini-​1.5-​flash-​preview-​0514ℹ️1000k$0.075
>128k: $1.00
$0.0047
>128k: $0.0094
vertex_ai-language-models, gemini gemini-1.5-pro (3 endpoints)1048k2097k$1.25$3.50
>128k: $2.50$7.00
$1.05$10.50
>128k: $10.00$21.00
gemini, vertex_ai-language-models gemini-1.5-pro-001 (2 endpoints)1000k2097k$1.25$3.50
>128k: $2.50$7.00
$5.00$10.50
>128k: $10.00$21.00
vertex_ai-language-models, gemini gemini-1.5-pro-002 (2 endpoints)2097k$1.25$3.50
>128k: $2.50$7.00
$5.00$10.50
>128k: $10.00$21.00
vertex_ai-language-models gemini-1.5-pro-preview-0215 (3 endpoints)1000k$0.0781
>128k: $0.1563
$0.3125
>128k: $0.625
gemini, openrouter, vertex_ai-language-models gemini-2.0-flash-001 (3 endpoints)1048k$0.10$0.15$0.40$0.60
openrouter google/​gemini-​2.0-​flash-​exp:free
gemini-​2.0-​flash-​exp
1048k$0.00$0.0022.2%
vertex_ai-language-models, openrouter gemini-2.0-flash-lite-001 (2 endpoints)1048k$0.075$0.30
gemini gemini/​gemini-​2.0-​flash-​lite-​preview-​02-​05ℹ️
gemini-​2.0-​flash-​lite-​preview-​02-​05
1048k$0.075$0.30
vertex_ai-language-models, gemini gemini-2.0-flash-preview-image-generation (2 endpoints)1048k$0.10$0.40
vertex_ai-language-models, gemini gemini-2.0-flash-thinking-exp (4 endpoints)1048k$0.00
>128k: $0.00
$0.00
>128k: $0.00
18.2%
gemini, vertex_ai-language-models, openrouter gemini-2.5-flash (3 endpoints)1048k$0.30$2.50
gemini, vertex_ai-language-models, openrouter gemini-2.5-flash-lite-preview-06-17 (3 endpoints)1048k$0.10$0.40
openrouter google/​gemini-​2.5-​flash-​preview-​05-​20:thinking
gemini-​2.5-​flash-​preview-​05-​20:thinking
1048k$0.15$3.50
gemini gemini/​gemini-​2.5-​flash-​preview-​ttsℹ️
gemini-​2.5-​flash-​preview-​tts
1048k$0.15$0.60
vertex_ai-language-models, gemini, openrouter gemini-2.5-pro (3 endpoints)1048k$1.25
>200k: $2.50
$10.00
>200k: $15.00
vertex_ai-language-models, gemini gemini-2.5-pro-preview-tts (2 endpoints)1048k$1.25
>200k: $2.50
$10.00
>200k: $15.00
openrouter google/​gemini-​flash-​1.5
gemini-​flash-​1.5
1000k$0.075$0.30
openrouter google/​gemini-​flash-​1.5-​8b
gemini-​flash-​1.5-​8b
1000k$0.0375$0.15
vertex_ai-language-models gemini-​flash-​experimentalℹ️1000k$0.00$0.00
gemini gemini/​gemini-​gemma-​2-​27b-​itℹ️
gemini-​gemma-​2-​27b-​it
$0.35$1.05
gemini gemini/​gemini-​gemma-​2-​9b-​itℹ️
gemini-​gemma-​2-​9b-​it
$0.35$1.05
gemini, vertex_ai-language-models gemini-pro (2 endpoints)32k$0.35$0.50
>128k: $0.70
$1.05$1.50
>128k: $2.10
openrouter google/​gemini-​pro-​1.5
gemini-​pro-​1.5
2000k$1.25$5.00
vertex_ai-language-models gemini-​pro-​experimentalℹ️1000k$0.00$0.00
gemini, vertex_ai-vision-models gemini-pro-vision (2 endpoints)16k30k$0.35$0.50
>128k: $0.70
$1.05$1.50
>128k: $2.10
openrouter google/​gemma-​2-​27b-​it
gemma-​2-​27b-​it
8k$0.80$0.80
openrouter, groq gemma-2-9b-it (3 endpoints)8k$0.00$0.20$0.00$0.20
openrouter gemma-3-12b-it (2 endpoints)96k131k$0.00$0.05$0.00$0.10
gemini, openrouter gemma-3-27b-it (3 endpoints)96k131k$0.00$0.09
>128k: $0.00
$0.00$0.17
>128k: $0.00
4.9%
openrouter gemma-3-4b-it (2 endpoints)32k131k$0.00$0.02$0.00$0.04
openrouter gemma-3n-e4b-it (2 endpoints)8k32k$0.00$0.02$0.00$0.04
groq, anyscale gemma-7b-it (2 endpoints)8k$0.07$0.15$0.07$0.15
openrouter glm-4-32b (2 endpoints)32k32k$0.00$0.24$0.00$0.24
openrouter thudm/​glm-​z1-​32b:free
glm-​z1-​32b
32k$0.00$0.00
openrouter alpindale/​goliath-​120b
goliath-​120b
6k$9.00$11.00
openai, azure, openrouter gpt-3.5-turbo (11 endpoints)4k16k$0.20$1.50$1.50$2.00
openai, azure, openrouter gpt-3.5-turbo-16k (4 endpoints)16k$3.00$4.00
openrouter openai/​gpt-​3.5-​turbo-​instruct
gpt-​3.5-​turbo-​instruct
4k$1.50$2.00
openai, azure, openrouter gpt-4 (6 endpoints)8k8k$30.00$60.00
openai, azure gpt-4-0125-preview (2 endpoints)128k$10.00$30.00
openai, azure, openrouter gpt-4-1106-preview (3 endpoints)128k$10.00$30.00
openai, azure, openrouter gpt-4.1-mini (5 endpoints)1047k$0.40$1.6032.4%
openai, azure, openrouter gpt-4.1-nano (5 endpoints)1047k$0.10$0.408.9%
azure gpt-4-32k (2 endpoints)32k$60.00$120.00
openai, azure gpt-4o-audio-preview (5 endpoints)128k$2.50$10.00
openrouter openai/​gpt-​4o:extended
gpt-​4o:extended
128k$6.00$18.00
openai, azure gpt-4o-mini-audio-preview (3 endpoints)128k$0.15$2.50$0.60$10.00
openai, azure gpt-4o-mini-realtime-preview (5 endpoints)128k$0.60$0.66$2.40$2.64
openai, openrouter gpt-4o-mini-search-preview (3 endpoints)128k$0.15$0.60
openai, azure gpt-4o-realtime-preview (9 endpoints)128k$5.00$5.50$20.00$22.00
openai, openrouter gpt-4o-search-preview (3 endpoints)128k$2.50$10.00
openai, azure, openrouter gpt-4-turbo (5 endpoints)128k$10.00$30.00
openai, openrouter gpt-4-turbo-preview (2 endpoints)128k$10.00$30.00
azure azure/​gpt-​4-​turbo-​vision-​preview
gpt-​4-​turbo-​vision-​preview
128k$10.00$30.00
watsonx watsonx/​ibm/​granite-​3-​8b-​instruct
granite-​3-​8b-​instruct
8k$200.00$200.00
xai, openrouter grok-2 (4 endpoints)131k$2.00$10.00
xai, openrouter grok-2-vision (4 endpoints)32k$2.00$10.00
xai, openrouter grok-3 (3 endpoints)131k$3.00$15.00
xai xai/​grok-​3-​fast-​betaℹ️
grok-​3-​fast-​beta
131k$5.00$25.00
xai xai/​grok-​3-​fast-​latestℹ️
grok-​3-​fast-​latest
131k$5.00$25.00
xai, openrouter grok-3-mini (3 endpoints)131k$0.30$0.50
xai grok-3-mini-fast (2 endpoints)131k$0.60$4.00
xai xai/​grok-​3-​mini-​fast-​betaℹ️
grok-​3-​mini-​fast-​beta
131k$0.60$4.00
xai xai/​grok-​beta
grok-​beta
131k$5.00$15.00
xai, openrouter grok-vision-beta (2 endpoints)8k$5.00$15.00
openrouter nousresearch/​hermes-​2-​pro-​llama-​3-​8b
hermes-​2-​pro-​llama-​3-​8b
131k$0.025$0.04
lambda hermes3-​405b131k$0.80$0.80
lambda hermes3-​70b131k$0.12$0.30
lambda hermes3-​8b131k$0.025$0.04
openrouter nousresearch/​hermes-​3-​llama-​3.1-​405b
hermes-​3-​llama-​3.1-​405b
131k$0.70$0.80
openrouter nousresearch/​hermes-​3-​llama-​3.1-​70b
hermes-​3-​llama-​3.1-​70b
131k$0.10$0.28
openrouter inflection/​inflection-​3-​pi
inflection-​3-​pi
8k$2.50$10.00
openrouter inflection/​inflection-​3-​productivity
inflection-​3-​productivity
8k$2.50$10.00
openrouter opengvlab/​internvl3-​14b
internvl3-​14b
12k$0.20$0.40
bedrock ai21.​j2-​mid-​v1
j2-​mid-​v1
8k$12.50$12.50
bedrock ai21.​j2-​ultra-​v1
j2-​ultra-​v1
8k$18.80$18.80
vertex_ai-ai21_models, ai21 jamba-1.5 (2 endpoints)256k$0.20$0.40
vertex_ai-ai21_models, ai21, bedrock jamba-1.5-large (3 endpoints)256k$2.00$8.00
vertex_ai-ai21_models, ai21 jamba-1.5-large@001 (2 endpoints)256k$2.00$8.00
vertex_ai-ai21_models, ai21, bedrock jamba-1.5-mini (3 endpoints)256k$0.20$0.40
vertex_ai-ai21_models, ai21 jamba-1.5-mini@001 (2 endpoints)256k$0.20$0.40
openrouter ai21/​jamba-​1.6-​large
jamba-​1.6-​large
256k$2.00$8.00
openrouter ai21/​jamba-​1.6-​mini
jamba-​1.6-​mini
256k$0.20$0.40
azure_ai, bedrock jamba-instruct (2 endpoints)70k$0.50$0.70
ai21 jamba-​large-​1.6256k$2.00$8.00
ai21 jamba-​mini-​1.6256k$0.20$0.40
openrouter moonshotai/​kimi-​dev-​72b:free
kimi-​dev-​72b
131k$0.00$0.00
openrouter moonshotai/​kimi-​vl-​a3b-​thinking:free
kimi-​vl-​a3b-​thinking
131k$0.00$0.00
openrouter sao10k/​l3.1-​euryale-​70b
l3.1-​euryale-​70b
32k$0.65$0.75
openrouter sao10k/​l3.3-​euryale-​70b
l3.3-​euryale-​70b
131k$0.65$0.75
openrouter sao10k/​l3-​euryale-​70b
l3-​euryale-​70b
8k$1.48$1.48
openrouter sao10k/​l3-​lunaris-​8b
l3-​lunaris-​8b
8k$0.02$0.05
gemini gemini/​learnlm-​1.5-​pro-​experimentalℹ️
learnlm-​1.5-​pro-​experimental
32k$0.00
>128k: $0.00
$0.00
>128k: $0.00
openrouter liquid/​lfm-​3b
lfm-​3b
32k$0.02$0.02
lambda, openrouter lfm-40b (2 endpoints)32k66k$0.15$0.15
openrouter liquid/​lfm-​7b
lfm-​7b
32k$0.01$0.01
replicate replicate/​meta/​llama-​2-​13b
llama-​2-​13b
4k$0.10$0.50
replicate, deepinfra, anyscale llama-2-13b-chat (3 endpoints)4k$0.10$0.25$0.22$0.50
bedrock meta.​llama2-​13b-​chat-​v1
llama2-​13b-​chat-​v1
4k$0.75$1.00
replicate, groq llama-2-70b (2 endpoints)4k$0.65$0.70$0.80$2.75
replicate, deepinfra, perplexity, anyscale llama-2-70b-chat (4 endpoints)4k$0.65$1.00$0.90$2.80
bedrock meta.​llama2-​70b-​chat-​v1
llama2-​70b-​chat-​v1
4k$1.95$2.56
replicate replicate/​meta/​llama-​2-​7b
llama-​2-​7b
4k$0.05$0.25
replicate, deepinfra, anyscale llama-2-7b-chat (3 endpoints)4k$0.05$0.15$0.13$0.25
cloudflare cloudflare/​@cf/​meta/​llama-​2-​7b-​chat-​fp16
llama-​2-​7b-​chat-​fp16
3k$1.923$1.923
cloudflare cloudflare/​@cf/​meta/​llama-​2-​7b-​chat-​int8
llama-​2-​7b-​chat-​int8
2k$1.923$1.923
openrouter meta-​llama/​llama-​3.1-​405b
llama-​3.1-​405b
32k$2.00$2.00
bedrock, openrouter llama-3.1-405b-instruct (3 endpoints)32k128k$0.80$5.32$0.80$16.00
lambda llama3.1-​405b-​instruct-​fp8131k$0.80$0.80
groq groq/​llama-​3.1-​405b-​reasoning
llama-​3.1-​405b-​reasoning
8k$0.59$0.79
cerebras cerebras/​llama3.1-​70b
llama3.1-​70b
128k$0.60$0.60
openrouter, perplexity, bedrock llama-3.1-70b-instruct (4 endpoints)128k131k$0.10$1.00$0.28$1.00
lambda llama3.1-​70b-​instruct-​fp8131k$0.12$0.30
groq groq/​llama-​3.1-​70b-​versatile
llama-​3.1-​70b-​versatile
8k$0.59$0.79
cerebras cerebras/​llama3.1-​8b
llama3.1-​8b
128k$0.10$0.10
groq groq/​llama-​3.1-​8b-​instant
llama-​3.1-​8b-​instant
128k$0.05$0.08
perplexity, openrouter, lambda, bedrock, nscale llama-3.1-8b-instruct (6 endpoints)128k131k$0.015$0.22$0.02$0.22
openrouter neversleep/​llama-​3.1-​lumimaid-​70b
llama-​3.1-​lumimaid-​70b
16k$2.50$3.00
openrouter neversleep/​llama-​3.1-​lumimaid-​8b
llama-​3.1-​lumimaid-​8b
32k$0.20$1.25
openrouter nvidia/​llama-​3.1-​nemotron-​70b-​instruct
llama-​3.1-​nemotron-​70b-​instruct
131k$0.12$0.30
lambda llama3.1-​nemotron-​70b-​instruct-​fp8131k$0.12$0.30
openrouter llama-3.1-nemotron-ultra-253b-v1 (2 endpoints)131k$0.00$0.60$0.00$1.80
perplexity perplexity/​llama-​3.1-​sonar-​huge-​128k-​online
llama-​3.1-​sonar-​huge-​128k-​online
127k$5.00$5.00
perplexity perplexity/​llama-​3.1-​sonar-​large-​128k-​chat
llama-​3.1-​sonar-​large-​128k-​chat
131k$1.00$1.00
perplexity, openrouter llama-3.1-sonar-large-128k-online (2 endpoints)127k$1.00$1.00
perplexity perplexity/​llama-​3.1-​sonar-​small-​128k-​chat
llama-​3.1-​sonar-​small-​128k-​chat
131k$0.20$0.20
perplexity, openrouter llama-3.1-sonar-small-128k-online (2 endpoints)127k$0.20$0.20
openrouter scb10x/​llama3.1-​typhoon2-​70b-​instruct
llama3.1-​typhoon2-​70b-​instruct
8k$0.88$0.88
bedrock llama3-2-11b-instruct-v1.0 (2 endpoints)128k$0.35$0.35
groq groq/​llama-​3.2-​11b-​text-​preview
llama-​3.2-​11b-​text-​preview
8k$0.18$0.18
openrouter, azure_ai llama-3.2-11b-vision-instruct (3 endpoints)128k131k$0.00$0.37$0.00$0.37
groq groq/​llama-​3.2-​11b-​vision-​preview
llama-​3.2-​11b-​vision-​preview
8k$0.18$0.18
openrouter, bedrock llama-3.2-1b-instruct (4 endpoints)128k131k$0.005$0.13$0.01$0.13
groq groq/​llama-​3.2-​1b-​preview
llama-​3.2-​1b-​preview
8k$0.04$0.04
lambda, bedrock, openrouter llama3.2-3b-instruct (5 endpoints)20k131k$0.003$0.19$0.006$0.19
groq groq/​llama-​3.2-​3b-​preview
llama-​3.2-​3b-​preview
8k$0.06$0.06
bedrock llama3-2-90b-instruct-v1.0 (2 endpoints)128k$2.00$2.00
groq groq/​llama-​3.2-​90b-​text-​preview
llama-​3.2-​90b-​text-​preview
8k$0.90$0.90
openrouter, azure_ai llama-3.2-90b-vision-instruct (2 endpoints)128k131k$1.20$2.04$1.20$2.04
vertex_ai-llama_models vertex_ai/​meta/​llama-​3.2-​90b-​vision-​instruct-​maasℹ️
llama-​3.2-​90b-​vision-​instruct-​maas
128k$0.00$0.00
groq groq/​llama-​3.2-​90b-​vision-​preview
llama-​3.2-​90b-​vision-​preview
8k$0.90$0.90
cerebras cerebras/​llama-​3.3-​70b
llama-​3.3-​70b
128k$0.85$1.20
openrouter, azure_ai, bedrock_converse, nscale llama-3.3-70b-instruct (6 endpoints)128k131k$0.00$0.72$0.00$0.72
lambda llama3.3-​70b-​instruct-​fp8131k$0.12$0.30
together_ai together_ai/​meta-​llama/​Llama-​3.3-​70B-​Instruct-​Turbo
llama-​3.3-​70b-​instruct-​turbo
$0.88$0.88
together_ai together_ai/​meta-​llama/​Llama-​3.3-​70B-​Instruct-​Turbo-​Free
llama-​3.3-​70b-​instruct-​turbo-​free
$0.00$0.00
groq groq/​llama-​3.3-​70b-​specdec
llama-​3.3-​70b-​specdec
8k$0.59$0.99
groq groq/​llama-​3.3-​70b-​versatile
llama-​3.3-​70b-​versatile
128k$0.59$0.79
openrouter llama-3.3-nemotron-super-49b-v1 (2 endpoints)131k$0.00$0.13$0.00$0.40
vertex_ai-llama_models vertex_ai/​meta/​llama3-​405b-​instruct-​maasℹ️
llama3-​405b-​instruct-​maas
32k$0.00$0.00
groq, replicate llama-3-70b (2 endpoints)8k$0.59$0.65$0.79$2.75
openrouter, replicate, bedrock llama-3-70b-instruct (10 endpoints)8k$0.30$4.45$0.40$5.88
vertex_ai-llama_models vertex_ai/​meta/​llama3-​70b-​instruct-​maasℹ️
llama3-​70b-​instruct-​maas
32k$0.00$0.00
groq, replicate llama-3-8b (2 endpoints)8k8k$0.05$0.08$0.25
openrouter, bedrock, replicate llama-3-8b-instruct (10 endpoints)8k8k$0.03$0.50$0.06$1.01
vertex_ai-llama_models vertex_ai/​meta/​llama3-​8b-​instruct-​maasℹ️
llama3-​8b-​instruct-​maas
32k$0.00$0.00
groq groq/​llama3-​groq-​70b-​8192-​tool-​use-​preview
llama3-​groq-​70b-​8192-​tool-​use-​preview
8k$0.89$0.89
groq groq/​llama3-​groq-​8b-​8192-​tool-​use-​preview
llama3-​groq-​8b-​8192-​tool-​use-​preview
8k$0.19$0.19
openrouter neversleep/​llama-​3-​lumimaid-​70b
llama-​3-​lumimaid-​70b
8k$4.00$6.00
openrouter neversleep/​llama-​3-​lumimaid-​8b
llama-​3-​lumimaid-​8b
24k$0.20$1.25
groq, sambanova llama-4-maverick-17b-128e-instruct (2 endpoints)131k$0.20$0.63$0.60$1.80
azure_ai azure_ai/​Llama-​4-​Maverick-​17B-​128E-​Instruct-​FP8ℹ️
llama-​4-​maverick-​17b-​128e-​instruct-​fp8
1000k$1.41$0.35
vertex_ai-llama_models vertex_ai/​meta/​llama-​4-​maverick-​17b-​128e-​instruct-​maasℹ️
llama-​4-​maverick-​17b-​128e-​instruct-​maas
1000k$0.35$1.15
vertex_ai-llama_models vertex_ai/​meta/​llama-​4-​maverick-​17b-​16e-​instruct-​maasℹ️
llama-​4-​maverick-​17b-​16e-​instruct-​maas
1000k$0.35$1.15
bedrock_converse llama4-maverick-17b-instruct-v1.0 (2 endpoints)128k$0.24$0.97
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama4-​maverick-​instruct-​basicℹ️
llama4-​maverick-​instruct-​basic
131k$0.22$0.88
vertex_ai-llama_models vertex_ai/​meta/​llama-​4-​scout-​17b-​128e-​instruct-​maasℹ️
llama-​4-​scout-​17b-​128e-​instruct-​maas
10000k$0.25$0.70
azure_ai, groq, sambanova, nscale llama-4-scout-17b-16e-instruct (4 endpoints)8k10000k$0.09$0.40$0.29$0.78
vertex_ai-llama_models vertex_ai/​meta/​llama-​4-​scout-​17b-​16e-​instruct-​maasℹ️
llama-​4-​scout-​17b-​16e-​instruct-​maas
10000k$0.25$0.70
bedrock_converse llama4-scout-17b-instruct-v1.0 (2 endpoints)128k$0.17$0.66
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama4-​scout-​instruct-​basicℹ️
llama4-​scout-​instruct-​basic
131k$0.15$0.60
openrouter meta-​llama/​llama-​guard-​2-​8b
llama-​guard-​2-​8b
8k$0.20$0.20
openrouter, groq llama-guard-3-8b (2 endpoints)8k131k$0.02$0.20$0.06$0.20
openrouter meta-​llama/​llama-​guard-​4-​12b
llama-​guard-​4-​12b
163k$0.05$0.05
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p1-​405b-​instructℹ️
llama-​v3p1-​405b-​instruct
128k$3.00$3.00
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p1-​8b-​instructℹ️
llama-​v3p1-​8b-​instruct
16k$0.10$0.10
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p2-​11b-​vision-​instructℹ️
llama-​v3p2-​11b-​vision-​instruct
16k$0.20$0.20
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p2-​1b-​instructℹ️
llama-​v3p2-​1b-​instruct
16k$0.10$0.10
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p2-​3b-​instructℹ️
llama-​v3p2-​3b-​instruct
16k$0.10$0.10
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​llama-​v3p2-​90b-​vision-​instructℹ️
llama-​v3p2-​90b-​vision-​instruct
16k$0.90$0.90
openrouter eleutherai/​llemma_7b
llemma_7b
4k$0.80$1.20
aleph_alpha luminous-​base-​control$37.50$41.25
aleph_alpha luminous-​extended-​control$56.25$61.875
aleph_alpha luminous-​supreme-​control$218.75$240.625
deepinfra deepinfra/​lizpreciatior/​lzlv_70b_fp16_hf
lzlv_70b_fp16_hf
4k$0.70$0.90
openrouter arcee-​ai/​maestro-​reasoning
maestro-​reasoning
131k$0.90$3.30
openrouter, mistral magistral-medium-latest (3 endpoints)40k40k$2.00$5.00
openrouter mistralai/​magistral-​medium-​2506:thinking
magistral-​medium-​2506:thinking
40k$2.00$5.00
mistral, openrouter magistral-small-latest (3 endpoints)40k$0.50$1.50
openrouter alpindale/​magnum-​72b
magnum-​72b
16k$4.00$6.00
openrouter anthracite-​org/​magnum-​v2-​72b
magnum-​v2-​72b
32k$3.00$3.00
openrouter anthracite-​org/​magnum-​v4-​72b
magnum-​v4-​72b
16k$2.50$3.00
openrouter microsoft/​mai-​ds-​r1:free
mai-​ds-​r1
163k$0.00$0.00
openrouter inception/​mercury
mercury
32k$0.25$1.00
openrouter inception/​mercury-​coder
mercury-​coder
32k$0.25$1.00
azure_ai, deepinfra, sambanova meta-llama-3.1-405b-instruct (3 endpoints)16k128k$0.90$5.33$0.90$16.00
together_ai together_ai/​meta-​llama/​Meta-​Llama-​3.1-​405B-​Instruct-​Turbo
meta-​llama-​3.1-​405b-​instruct-​turbo
$3.50$3.50
azure_ai, friendliai meta-llama-3.1-70b-instruct (2 endpoints)8k128k$0.60$2.68$0.60$3.54
together_ai together_ai/​meta-​llama/​Meta-​Llama-​3.1-​70B-​Instruct-​Turbo
meta-​llama-​3.1-​70b-​instruct-​turbo
$0.88$0.88
azure_ai, sambanova, friendliai meta-llama-3.1-8b-instruct (3 endpoints)8k128k$0.10$0.30$0.10$0.61
together_ai together_ai/​meta-​llama/​Meta-​Llama-​3.1-​8B-​Instruct-​Turbo
meta-​llama-​3.1-​8b-​instruct-​turbo
$0.18$0.18
sambanova sambanova/​Meta-​Llama-​3.2-​1B-​Instructℹ️
meta-​llama-​3.2-​1b-​instruct
16k$0.04$0.08
sambanova sambanova/​Meta-​Llama-​3.2-​3B-​Instructℹ️
meta-​llama-​3.2-​3b-​instruct
4k$0.08$0.16
sambanova sambanova/​Meta-​Llama-​3.3-​70B-​Instructℹ️
meta-​llama-​3.3-​70b-​instruct
131k$0.60$1.20
anyscale, azure_ai, deepinfra meta-llama-3-70b-instruct (3 endpoints)8k8k$0.59$1.10$0.37$1.00
anyscale, deepinfra meta-llama-3-8b-instruct (2 endpoints)8k8k$0.08$0.15$0.08$0.15
sambanova sambanova/​Meta-​Llama-​Guard-​3-​8Bℹ️
meta-​llama-​guard-​3-​8b
16k$0.30$0.30
sagemakersagemaker/​meta-​textgeneration-​llama-​2-​13b-​f
meta-​textgeneration-​llama-​2-​13b-​f
4k$0.00$0.00
sagemakersagemaker/​meta-​textgeneration-​llama-​2-​70b-​b-​f
meta-​textgeneration-​llama-​2-​70b-​b-​f
4k$0.00$0.00
sagemakersagemaker/​meta-​textgeneration-​llama-​2-​7b-​f
meta-​textgeneration-​llama-​2-​7b-​f
4k$0.00$0.00
openrouter sophosympatheia/​midnight-​rose-​70b
midnight-​rose-​70b
4k$0.80$0.80
openrouter minimax/​minimax-​01
minimax-​01
1000k$0.20$1.10
openrouter minimax/​minimax-​m1
minimax-​m1
1000k$0.30$1.65
openrouter, azure_ai ministral-3b (2 endpoints)128k131k$0.04$0.04
openrouter mistralai/​ministral-​8b
ministral-​8b
128k$0.10$0.10
openrouter, perplexity mistral-7b-instruct (3 endpoints)4k32k$0.00$0.07$0.00$0.28
deepinfra, anyscale, cloudflare, openrouter mistral-7b-instruct-v0.1 (4 endpoints)2k32k$0.11$1.923$0.13$1.923
openrouter, bedrock, replicate mistral-7b-instruct-v0.2 (6 endpoints)4k32k$0.05$0.20$0.20$0.26
openrouter mistralai/​mistral-​7b-​instruct-​v0.3
mistral-​7b-​instruct-​v0.3
32k$0.028$0.054
replicate replicate/​mistralai/​mistral-​7b-​v0.1
mistral-​7b-​v0.1
4k$0.05$0.25
openrouter, azure_ai, vertex_ai-mistral_models, mistral, bedrock, azure mistral-large (19 endpoints)32k131k$2.00$10.40$6.00$31.20
vertex_ai-mistral_models vertex_ai/​mistral-​large@2411-​001
mistral-​large@2411-​001
128k$2.00$6.00
vertex_ai-mistral_models vertex_ai/​mistral-​large@latest
mistral-​large@latest
128k$2.00$6.00
deepinfra deepinfra/​amazon/​MistralLite
mistrallite
32k$0.20$0.20
azure_ai, mistral mistral-medium-latest (5 endpoints)32k131k$0.40$2.70$2.00$8.10
openrouter mistralai/​mistral-​medium-​3
mistral-​medium-​3
131k$0.40$2.00
openrouter, azure_ai, vertex_ai-mistral_models mistral-nemo (4 endpoints)128k131k$0.00$3.00$0.00$3.00
vertex_ai-mistral_models vertex_ai/​mistral-​nemo@latest
mistral-​nemo@latest
128k$0.15$0.15
openrouter mistralai/​mistral-​saba
mistral-​saba
32k$0.20$0.60
groq groq/​mistral-​saba-​24b
mistral-​saba-​24b
32k$0.79$0.79
openrouter mistral-small-24b-instruct-2501 (2 endpoints)32k$0.00$0.05$0.00$0.08
vertex_ai-mistral_models vertex_ai/​mistral-​small-​2503@001
mistral-​small-​2503@001
32k$1.00$3.00
openrouter mistral-small-3.1-24b-instruct (2 endpoints)96k128k$0.00$0.05$0.00$0.10
openrouter mistral-small-3.2-24b-instruct (2 endpoints)96k128k$0.00$0.05$0.00$0.10
openrouter, mistral mistral-tiny (2 endpoints)32k32k$0.25$0.25
openrouter, fireworks_ai mixtral-8x22b-instruct (2 endpoints)65k$0.90$1.20$0.90$1.20
anyscale, nscale mixtral-8x22b-instruct-v0.1 (2 endpoints)65k$0.60$0.90$0.60$0.90
groq groq/​mixtral-​8x7b-​32768
mixtral-​8x7b-​32768
32k$0.24$0.24
openrouter, perplexity mixtral-8x7b-instruct (2 endpoints)4k32k$0.07$0.08$0.24$0.28
deepinfra, bedrock, anyscale, replicate, together_ai mixtral-8x7b-instruct-v0.1 (8 endpoints)4k32k$0.15$0.60$0.15$1.00
openrouter nothingiisreal/​mn-​celeste-​12b
mn-​celeste-​12b
16k$0.80$1.20
openrouter infermatic/​mn-​inferor-​12b
mn-​inferor-​12b
16k$0.80$1.20
openrouter aetherwiing/​mn-​starcannon-​12b
mn-​starcannon-​12b
16k$0.80$1.20
openrouter morph/​morph-​v2
morph-​v2
32k$1.20$2.70
openrouter pygmalionai/​mythalion-​13b
mythalion-​13b
4k$0.80$1.20
openrouter, deepinfra mythomax-l2-13b (2 endpoints)4k$0.065$0.22$0.065$0.22
openrouter neversleep/​noromaid-​20b
noromaid-​20b
8k$1.25$2.00
openrouter nousresearch/​nous-​hermes-​2-​mixtral-​8x7b-​dpo
nous-​hermes-​2-​mixtral-​8x7b-​dpo
32k$0.60$0.60
openrouter amazon/​nova-​lite-​v1
nova-​lite-​v1
300k$0.06$0.24
bedrock_converse nova-lite-v1.0 (4 endpoints)128k$0.06$0.078$0.24$0.312
openrouter amazon/​nova-​micro-​v1
nova-​micro-​v1
128k$0.035$0.14
bedrock_converse nova-micro-v1.0 (4 endpoints)300k$0.035$0.046$0.14$0.184
bedrock_converse us.​amazon.​nova-​premier-​v1:0
nova-​premier-​v1.0
1000k$2.50$12.50
openrouter amazon/​nova-​pro-​v1
nova-​pro-​v1
300k$0.80$3.20
bedrock_converse nova-pro-v1.0 (4 endpoints)300k$0.80$1.05$3.20$4.20
openai, azure, openrouter o1-preview (8 endpoints)128k$15.00$16.50$60.00$66.00
deepinfra deepinfra/​openchat/​openchat_3.5
openchat_3.5
4k$0.13$0.13
mistral mistral/​open-​codestral-​mambaℹ️
open-​codestral-​mamba
256k$0.25$0.25
openrouter all-​hands/​openhands-​lm-​32b-​v0.1
openhands-​lm-​32b-​v0.1
16k$2.60$3.4010.2%
mistral mistral/​open-​mistral-​7b
open-​mistral-​7b
32k$0.25$0.25
mistral open-mistral-nemo (2 endpoints)128k$0.30$0.30
mistral mistral/​open-​mixtral-​8x22b
open-​mixtral-​8x22b
65k$2.00$6.00
mistral mistral/​open-​mixtral-​8x7b
open-​mixtral-​8x7b
32k$0.70$0.70
openrouter microsoft/​phi-​3.5-​mini-​128k-​instruct
phi-​3.5-​mini-​128k-​instruct
128k$0.10$0.10
azure_ai azure_ai/​Phi-​3.5-​mini-​instructℹ️
phi-​3.5-​mini-​instruct
128k$0.13$0.52
azure_ai azure_ai/​Phi-​3.5-​MoE-​instructℹ️
phi-​3.5-​moe-​instruct
128k$0.16$0.64
azure_ai azure_ai/​Phi-​3.5-​vision-​instructℹ️
phi-​3.5-​vision-​instruct
128k$0.13$0.52
azure_ai, openrouter phi-3-medium-128k-instruct (2 endpoints)128k$0.17$1.00$0.68$1.00
azure_ai azure_ai/​Phi-​3-​medium-​4k-​instructℹ️
phi-​3-​medium-​4k-​instruct
4k$0.17$0.68
openrouter, azure_ai phi-3-mini-128k-instruct (2 endpoints)128k$0.10$0.13$0.10$0.52
azure_ai azure_ai/​Phi-​3-​mini-​4k-​instructℹ️
phi-​3-​mini-​4k-​instruct
4k$0.13$0.52
azure_ai azure_ai/​Phi-​3-​small-​128k-​instructℹ️
phi-​3-​small-​128k-​instruct
128k$0.15$0.60
azure_ai azure_ai/​Phi-​3-​small-​8k-​instructℹ️
phi-​3-​small-​8k-​instruct
8k$0.15$0.60
openrouter, azure_ai phi-4 (2 endpoints)16k$0.07$0.125$0.14$0.50
azure_ai azure_ai/​Phi-​4-​mini-​instructℹ️
phi-​4-​mini-​instruct
131k$0.075$0.30
openrouter, azure_ai phi-4-multimodal-instruct (2 endpoints)131k$0.05$0.08$0.10$0.32
openrouter microsoft/​phi-​4-​reasoning-​plus
phi-​4-​reasoning-​plus
32k$0.07$0.35
deepinfra deepinfra/​Phind/​Phind-​CodeLlama-​34B-​v2
phind-​codellama-​34b-​v2
16k$0.60$0.60
mistral, openrouter pixtral-12b (2 endpoints)32k128k$0.10$0.15$0.10$0.15
openrouter, mistral pixtral-large-latest (3 endpoints)128k131k$2.00$6.00
perplexity perplexity/​pplx-​70b-​chat
pplx-​70b-​chat
4k$0.70$2.80
perplexity perplexity/​pplx-​70b-​online
pplx-​70b-​online
4k$0.00$2.80
perplexity perplexity/​pplx-​7b-​chat
pplx-​7b-​chat
8k$0.07$0.28
perplexity perplexity/​pplx-​7b-​online
pplx-​7b-​online
4k$0.00$0.28
openrouter qwen-2.5-72b-instruct (2 endpoints)32k$0.00$0.12$0.00$0.39
openrouter qwen/​qwen-​2.5-​7b-​instruct
qwen-​2.5-​7b-​instruct
32k$0.04$0.10
lambda, openrouter, nscale qwen25-coder-32b-instruct (4 endpoints)32k33k$0.00$0.07$0.00$0.2016.4%
nscalenscale/​Qwen/​Qwen2.5-​Coder-​3B-​Instructℹ️
qwen2.5-​coder-​3b-​instruct
$0.01$0.03
nscalenscale/​Qwen/​Qwen2.5-​Coder-​7B-​Instructℹ️
qwen2.5-​coder-​7b-​instruct
$0.01$0.03
openrouter qwen2.5-vl-32b-instruct (2 endpoints)8k128k$0.00$0.90$0.00$0.90
openrouter qwen2.5-vl-72b-instruct (2 endpoints)32k131k$0.00$0.25$0.00$0.75
openrouter qwen/​qwen-​2.5-​vl-​7b-​instruct
qwen-​2.5-​vl-​7b-​instruct
32k$0.20$0.20
fireworks_ai, openrouter qwen2-72b-instruct (2 endpoints)32k$0.90$0.90
sambanova sambanova/​Qwen2-​Audio-​7B-​Instructℹ️
qwen2-​audio-​7b-​instruct
4k$0.50$100.00
fireworks_ai fireworks_ai/​accounts/​fireworks/​models/​qwen2p5-​coder-​32b-​instructℹ️
qwen2p5-​coder-​32b-​instruct
4k$0.90$0.90
openrouter qwen3-14b (2 endpoints)40k$0.00$0.06$0.00$0.24
openrouter qwen3-235b-a22b (2 endpoints)40k$0.00$0.13$0.00$0.60
openrouter qwen3-30b-a3b (2 endpoints)40k$0.00$0.08$0.00$0.29
cerebras, openrouter, sambanova qwen-3-32b (4 endpoints)8k128k$0.00$0.40$0.00$0.8040%
openrouter qwen3-8b (2 endpoints)40k128k$0.00$0.035$0.00$0.138
openrouter qwen/​qwen-​max
qwen-​max
32k$1.60$6.40
openrouter qwen/​qwen-​plus
qwen-​plus
131k$0.40$1.20
groq groq/​qwen-​qwq-​32b
qwen-​qwq-​32b
128k$0.29$0.39
openrouter qwen/​qwen-​turbo
qwen-​turbo
1000k$0.05$0.20
openrouter qwen/​qwen-​vl-​max
qwen-​vl-​max
7k$0.80$3.20
openrouter qwen/​qwen-​vl-​plus
qwen-​vl-​plus
7k$0.21$0.63
openrouter featherless/​qwerky-​72b:free
qwerky-​72b
32k$0.00$0.00
openrouter, sambanova, nscale qwq-32b (4 endpoints)16k131k$0.00$0.50$0.00$1.0020.9%
openrouter arliai/​qwq-​32b-​arliai-​rpr-​v1:free
qwq-​32b-​arliai-​rpr-​v1
32k$0.00$0.00
openrouter qwen/​qwq-​32b-​preview
qwq-​32b-​preview
32k$0.20$0.20
openrouter perplexity/​r1-​1776
r1-​1776
128k$2.00$8.00
openrouter rekaai/​reka-​flash-​3:free
reka-​flash-​3
32k$0.00$0.00
openrouter undi95/​remm-​slerp-​l2-​13b
remm-​slerp-​l2-​13b
4k$0.80$1.20
openrouter thedrummer/​rocinante-​12b
rocinante-​12b
32k$0.20$0.50
openrouter sarvamai/​sarvam-​m:free
sarvam-​m
32k$0.00$0.00
openrouter shisa-​ai/​shisa-​v2-​llama3.3-​70b:free
shisa-​v2-​llama3.3-​70b
32k$0.00$0.00
openrouter thedrummer/​skyfall-​36b-​v2
skyfall-​36b-​v2
32k$0.50$0.80
perplexity, openrouter sonar (2 endpoints)127k128k$1.00$1.00
perplexity, openrouter sonar-deep-research (2 endpoints)128k$2.00$8.00
perplexity perplexity/​sonar-​medium-​chat
sonar-​medium-​chat
16k$0.60$1.80
perplexity perplexity/​sonar-​medium-​online
sonar-​medium-​online
12k$0.00$1.80
perplexity, openrouter sonar-pro (2 endpoints)200k$3.00$15.00
perplexity, openrouter sonar-reasoning (2 endpoints)127k128k$1.00$5.00
perplexity, openrouter sonar-reasoning-pro (2 endpoints)128k$2.00$8.00
perplexity perplexity/​sonar-​small-​chat
sonar-​small-​chat
16k$0.07$0.28
perplexity perplexity/​sonar-​small-​online
sonar-​small-​online
12k$0.00$0.28
openrouter raifle/​sorcererlm-​8x22b
sorcererlm-​8x22b
16k$4.50$4.50
openrouter arcee-​ai/​spotlight
spotlight
131k$0.18$0.18
bedrock amazon.​titan-​text-​express-​v1
titan-​text-​express-​v1
42k$1.30$1.70
bedrock amazon.​titan-​text-​lite-​v1
titan-​text-​lite-​v1
42k$0.30$0.40
bedrock amazon.​titan-​text-​premier-​v1:0
titan-​text-​premier-​v1.0
42k$0.50$1.50
together_ai together-​ai-​21.1b-​41b$0.80$0.80
together_ai together-​ai-​41.1b-​80b$0.90$0.90
together_ai together-​ai-​4.1b-​8b$0.20$0.20
together_ai together-​ai-​81.1b-​110b$1.80$1.80
together_ai together-​ai-​8.1b-​21b$0.30$0.30
together_ai together-​ai-​up-​to-​4b$0.10$0.10
openrouter undi95/​toppy-​m-​7b
toppy-​m-​7b
4k$0.80$1.20
openrouter thedrummer/​unslopnemo-​12b
unslopnemo-​12b
32k$0.40$0.40
openrouter thedrummer/​valkyrie-​49b-​v1
valkyrie-​49b-​v1
131k$0.50$0.80
openrouter arcee-​ai/​virtuoso-​large
virtuoso-​large
131k$0.75$1.20
openrouter arcee-​ai/​virtuoso-​medium-​v2
virtuoso-​medium-​v2
131k$0.50$0.80
openrouter mancer/​weaver
weaver
8k$1.50$1.50
openrouter microsoft/​wizardlm-​2-​8x22b
wizardlm-​2-​8x22b
65k$0.48$0.48
deepinfra deepinfra/​01-​ai/​Yi-​34B-​Chat
yi-​34b-​chat
4k$0.60$0.60
fireworks_ai, openrouter yi-large (2 endpoints)32k$3.00$3.00
anyscale anyscale/​HuggingFaceH4/​zephyr-​7b-​beta
zephyr-​7b-​beta
16k$0.15$0.15