Comparison of Open Source Models

Comparison and analysis of open source AI models across key performance metrics including quality, performance, inference speed, context window, parameter count & licensing details. Models are considered open source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details relating to our methodology, see our FAQs.
Z AI logoGLM-5.2 (max) and MiniMax logoMiniMax-M3 are the highest intelligence open source models, followed by DeepSeek logoDeepSeek V4 Pro (Max) & Kimi logoKimi K2.6.

Highlights

Artificial Analysis Openness Index · Higher is better
Updated
Artificial Analysis Intelligence Index · Higher is better
Trainable parameters in billions

Openness

Artificial Analysis Openness Index: Score

Openness Index assesses model openness on a 0 to 100 normalized scale (higher is more open)
Reasoning models are indicated by a lightbulb icon

Open Source Progress

Progress in Open Weights vs. Proprietary Intelligence

Artificial Analysis Intelligence Index v4.1 incorporates 9 evaluations: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

Open Source Language Models Intelligence By Lab Over Time

Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Open Source Models Intelligence By Size Over Time

Artificial Analysis Intelligence Index v4.1 incorporates 9 evaluations: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Intelligence

Artificial Analysis Intelligence Index

Artificial Analysis Intelligence Index v4.1 incorporates 9 evaluations: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR
Estimate (independent evaluation forthcoming)
Reasoning models are indicated by a lightbulb icon

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Intelligence Evaluations

Intelligence evaluations measured independently by Artificial Analysis · Higher is better

Agentic real-world work tasks, (Elo-500)/2000

Agentic tool use

Agentic coding & terminal use

Coding

Reasoning & knowledge

Scientific reasoning

Physics reasoning

Long context reasoning

Instruction following

Long-horizon agentic tasks

Kubernetes incident root-cause analysis

Visual reasoning

Reasoning models are indicated by a lightbulb icon.

While model intelligence generally translates across use cases, specific evaluations may be more relevant for certain use cases.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Size

Intelligence Index By Model Size

Artificial Analysis Intelligence Index v4.1 incorporates 9 evaluations: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR
Estimate (independent evaluation forthcoming)
Large Models (>150B)
Medium Models (40B-150B)
Small Models (4B-40B)
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Model Size: Total and Active Parameters

Comparison between total model parameters and parameters active during inference
Reasoning models are indicated by a lightbulb icon

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Active Parameters

Active parameters at inference time · Artificial Analysis Intelligence Index
Most attractive quadrant
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Total Parameters

Artificial Analysis Intelligence Index · Size in parameters (billions)
Most attractive quadrant
Alibaba
DeepSeek
Google
Kimi
MBZUAI Institute of Foundation Models
MiniMax
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

Context Window

Context Window

Context window: tokens limit · Higher is better
Reasoning models are indicated by a lightbulb icon

Larger context windows are relevant to RAG (Retrieval Augmented Generation) LLM workflows which typically involve reasoning and information retrieval of large amounts of data.

Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).

Further details

Weights
Provider Benchmarks
GLM-5.2 (max)
Z AI logoZ AI
51
753B
40B active at inference time
1.00M
$0.9
96
FireworksParasailFriendliAI
+7
MiniMax-M3
MiniMax logoMiniMax
44
428B
23B active at inference time
1.00M
$0.2
60
ParasailSiliconFlowTogether AI
+4
DeepSeek V4 Pro (Reasoning, Max Effort)
DeepSeek logoDeepSeek
44
1.6KB
49B active at inference time
1.00M
$0.2
84
FireworksLightning AI
?
+8
Kimi K2.6
Kimi logoKimi
43
1.0KB
32B active at inference time
256k
$0.7
44
Eigen AIDatabricksMicrosoft Azure
+13
MiMo-V2.5-Pro
Xiaomi logoXiaomi
42
1.0KB
42B active at inference time
1.00M
$0.2
54
NovitaXiaomiDeepInfraGMI
Kimi K2.7 Code
Kimi logoKimi
42
1.0KB
32B active at inference time
256k
$0.7
59
NovitaDeepInfraMakora
+6
DeepSeek V4 Pro (Reasoning, High Effort)
DeepSeek logoDeepSeek
41
1.6KB
49B active at inference time
1.00M
$0.2
76
BasetenDeepInfraMakora
+8
DeepSeek V4 Flash (Reasoning, Max Effort)
DeepSeek logoDeepSeek
40
284B
13B active at inference time
1.00M
$0.1
109
ParasailSiliconFlowMakora
+4
GLM-5.1 (Reasoning)
Z AI logoZ AI
40
744B
40B active at inference time
200k
$0.9
83
FireworksCoreWeaveSiliconFlow
+9
MiMo-V2.5
Xiaomi logoXiaomi
40
310B
15B active at inference time
1.00M
$0.1
82
ParasailDeepInfraNovita
+2
GLM-5 (Reasoning)
Z AI logoZ AI
40
744B
40B active at inference time
200k
$0.7
76
SiliconFlowParasailFriendliAI
+9
MiniMax-M2.7
MiniMax logoMiniMax
38
230B
10B active at inference time
205k
$0.2
46
SambaNovaTogether AIGMI
+3
Kimi K2.5 (Reasoning)
Kimi logoKimi
38
1.0KB
32B active at inference time
256k
$0.6
55
NebiusBasetenAmazon Bedrock
+12
Nemotron 3 Ultra 550B A55B (Reasoning)
NVIDIA logoNVIDIA
38
550B
55B active at inference time
262k
$0.6
171
Not available
DeepInfraTogether AIBlackbox AI
+5
DeepSeek V4 Flash (Reasoning, High Effort)
DeepSeek logoDeepSeek
37
284B
13B active at inference time
1.00M
$0.1
-
MakoraDeepSeekNovita
+5
Qwen3.6 27B (Reasoning)
Alibaba logoAlibaba
37
27.8B
262k
$0.9
59
MakoraDeepInfraNovita
+3
GLM-5.1 (Non-reasoning)
Z AI logoZ AI
35
744B
40B active at inference time
200k
$0.9
57
DeepInfraSiliconFlowBaseten
+5
Kimi K2.6 (Non-reasoning)
Kimi logoKimi
35
1.0KB
32B active at inference time
256k
$0.7
45
KimiSiliconFlowMakora
+10
GLM-4.7 (Reasoning)
Z AI logoZ AI
34
357B
32B active at inference time
200k
$0.7
115
SiliconFlowAmazon BedrockGoogle
+7
Qwen3.5 27B (Reasoning)
Alibaba logoAlibaba
34
27.8B
262k
$0.5
81
DeepInfraAlibaba CloudNovita
+3
Qwen3.5 397B A17B (Reasoning)
Alibaba logoAlibaba
34
397B
17B active at inference time
262k
$0.9
51
ClarifaiWaferTogether AI
+9
MiniMax-M2.5
MiniMax logoMiniMax
34
230B
10B active at inference time
205k
$0.3
203
Eigen AIGMICoreWeave
+13
Hy3-preview (Reasoning)
Tencent logoTencent
34
295B
21B active at inference time
256k
$0.1
117
SiliconFlowGMI
DeepSeek V3.2 (Reasoning)
DeepSeek logoDeepSeek
33
685B
37B active at inference time
128k
$0.2
-
DeepSeekDeepInfraAmazon Bedrock
+12
MiMo-V2-Flash (Feb 2026)
Xiaomi logoXiaomi
33
309B
15B active at inference time
256k
$0.1
89
Xiaomi
Kimi K2 Thinking
Kimi logoKimi
33
1.0KB
32B active at inference time
256k
$0.8
123
NovitaAmazon BedrockGoogle
+3
GLM-5 (Non-reasoning)
Z AI logoZ AI
32
744B
40B active at inference time
200k
$0.7
66
NovitaFireworksNebius
+3
Qwen3.5 122B A10B (Reasoning)
Alibaba logoAlibaba
32
125B
10B active at inference time
262k
$0.7
135
GMIDeepInfraAlibaba Cloud
+2
Qwen3.5 397B A17B (Non-reasoning)
Alibaba logoAlibaba
32
397B
17B active at inference time
262k
$0.9
52
WaferTogether AIAlibaba Cloud
+6
Qwen3.6 35B A3B (Reasoning)
Alibaba logoAlibaba
32
36B
3B active at inference time
262k
$0.4
179
MakoraNovitaDeepInfra
+6
MiniMax-M2.1
MiniMax logoMiniMax
31
230B
10B active at inference time
205k
$0.4
223
NovitaFriendliAIMiniMax
DeepSeek V4 Pro (Non-reasoning)
DeepSeek logoDeepSeek
31
1.6KB
49B active at inference time
1.00M
$0.2
79
Microsoft AzureNebiusMakora
+2
MiMo-V2-Flash (Reasoning)
Xiaomi logoXiaomi
31
309B
15B active at inference time
256k
$0.1
87
Xiaomi
Ring-2.6-1T
InclusionAI logoInclusionAI
31
1.0KB
63B active at inference time
262k
$0.5
133
InclusionAI
Mistral Medium 3.5
Mistral logoMistral
30
128B
256k
$1.2
135
Mistral
Step 3.7 Flash
StepFun logoStepFun
30
198B
11B active at inference time
256k
$0.2
383
StepFun
Kimi K2.5 (Non-reasoning)
Kimi logoKimi
29
1.0KB
32B active at inference time
256k
$0.8
54
NovitaKimiGMI
+6
Gemma 4 31B (Reasoning)
Google logoGoogle
29
30.7B
256k
-
34
ParasailLightning AITogether AI
+8
Qwen3.5 27B (Non-reasoning)
Alibaba logoAlibaba
29
27.8B
262k
$0.5
90
CoreWeaveAlibaba CloudDeepInfra
Command A+
Cohere logoCohere
29
218B
25B active at inference time
192k
-
199
Cohere
Qwen3.6 27B (Non-reasoning)
Alibaba logoAlibaba
29
27.8B
262k
$0.9
63
MakoraDeepInfraGroq
+2
Qwen3.5 35B A3B (Reasoning)
Alibaba logoAlibaba
29
36B
3B active at inference time
262k
$0.4
158
DeepInfraAlibaba CloudSiliconFlow
+2
DeepSeek V4 Flash (Non-reasoning)
DeepSeek logoDeepSeek
29
284B
13B active at inference time
1.00M
$0.1
113
MakoraGMIDeepSeekCoreWeave
MiniMax-M2
MiniMax logoMiniMax
28
230B
10B active at inference time
205k
$0.4
119
Amazon BedrockNovitaMiniMaxGoogle
Qwen3.5 122B A10B (Non-reasoning)
Alibaba logoAlibaba
28
125B
10B active at inference time
262k
$0.7
156
Alibaba CloudDeepInfra
MiMo-V2.5-Pro (Non-reasoning)
Xiaomi logoXiaomi
28
1.0KB
41.7B active at inference time
1.00M
$0.6
56
NovitaXiaomiDeepInfraGMI
GLM-4.7 (Non-reasoning)
Z AI logoZ AI
27
357B
32B active at inference time
200k
$0.7
117
SiliconFlowParasailGoogle
+6
DeepSeek V3.1 Terminus (Reasoning)
DeepSeek logoDeepSeek
26
685B
37B active at inference time
128k
$1.7
-
SambaNovaNovita
Hy3-preview (Non-reasoning)
Tencent logoTencent
26
295B
21B active at inference time
256k
$0.1
133
SiliconFlowGMI
Ling-2.6-1T
InclusionAI logoInclusionAI
26
1.0KB
63B active at inference time
262k
$0.5
-
InclusionAI
Gemma 4 26B A4B (Reasoning)
Google logoGoogle
26
25.2B
3.8B active at inference time
256k
$0.1
-
CloudflareDeepInfraGoogle
+4
Step 3.5 Flash
StepFun logoStepFun
26
196B
11B active at inference time
256k
$0.1
211
SiliconFlowStepFun
DeepSeek V3.2 Exp (Reasoning)
DeepSeek logoDeepSeek
25
685B
37B active at inference time
128k
$0.2
-
NovitaDeepSeek
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)
NVIDIA logoNVIDIA
25
120.6B
12.7B active at inference time
1.00M
$0.3
148
BasetenCoreWeaveDeepInfra
+2
GLM-4.6 (Reasoning)
Z AI logoZ AI
25
357B
32B active at inference time
200k
$0.7
50
Together AINovitaDeepInfra
Qwen3.5 9B (Reasoning)
Alibaba logoAlibaba
25
9.65B
262k
$0.1
63
SiliconFlowTogether AI
Gemma 4 31B (Non-reasoning)
Google logoGoogle
25
30.7B
256k
$0.2
38
Together AINovitaParasail
+4
K-EXAONE (Reasoning)
LG AI Research logoLG AI Research
25
236B
23B active at inference time
256k
-
-
-
DeepSeek V3.2 (Non-reasoning)
DeepSeek logoDeepSeek
25
685B
37B active at inference time
128k
$0.5
-
GMISiliconFlowEigen AI
+12
Trinity Large Thinking
Arcee AI logoArcee AI
24
399B
13B active at inference time
512k
$0.2
154
Arcee AIParasail
Qwen3.6 35B A3B (Non-reasoning)
Alibaba logoAlibaba
24
36B
3B active at inference time
262k
$0.6
188
ParasailDeepInfraAlibaba Cloud
+5
gpt-oss-120b (high)
OpenAI logoOpenAI
24
117B
5.1B active at inference time
131k
$0.2
341
DeepInfraCloudflareLightning AI
+23
Kimi K2 0905
Kimi logoKimi
24
1.0KB
32B active at inference time
256k
$0.8
27
Novita
Qwen3.5 35B A3B (Non-reasoning)
Alibaba logoAlibaba
23
36B
3B active at inference time
262k
$0.4
173
Alibaba CloudDeepInfra
MiMo-V2-Flash (Non-reasoning)
Xiaomi logoXiaomi
23
309B
15B active at inference time
256k
$0.1
93
Xiaomi
GLM-4.6 (Non-reasoning)
Z AI logoZ AI
23
357B
32B active at inference time
200k
$0.8
51
Together AINovita
EXAONE 4.5 33B
LG AI Research logoLG AI Research
23
34.4B
262k
-
-
-
GLM-4.7-Flash (Reasoning)
Z AI logoZ AI
23
31.2B
3B active at inference time
200k
$0.1
88
NovitaAmazon BedrockDeepInfra
Qwen3 235B A22B 2507 (Reasoning)
Alibaba logoAlibaba
22
235B
22B active at inference time
256k
$0.6
48
DeepInfraCoreWeaveEigen AI
+3
DeepSeek V3.2 Speciale
DeepSeek logoDeepSeek
22
685B
37B active at inference time
128k
-
-
-
HyperNova 60B 2605
Multiverse Computing logoMultiverse Computing
22
58.7B
4.8B active at inference time
131k
$0.1
360
CompactifAI
Gemma 4 12B (Reasoning)
Google logoGoogle
22
12B
256k
$0.1
126
SiliconFlow
DeepSeek V3.1 Terminus (Non-reasoning)
DeepSeek logoDeepSeek
21
685B
37B active at inference time
128k
$0.3
-
NovitaSambaNovaDeepInfra
DeepSeek V3.2 Exp (Non-reasoning)
DeepSeek logoDeepSeek
21
685B
37B active at inference time
128k
$0.2
-
NovitaDeepSeek
Nemotron Cascade 2 30B A3B
NVIDIA logoNVIDIA
21
31.6B
3B active at inference time
1.00M
-
-
-
Apriel-v1.5-15B-Thinker
ServiceNow logoServiceNow
21
15B
128k
-
-
Together AI
Qwen3 Coder Next
Alibaba logoAlibaba
21
79.7B
3B active at inference time
256k
$0.4
63
ParasailTogether AINovitaAmazon Bedrock
DeepSeek V3.1 (Non-reasoning)
DeepSeek logoDeepSeek
21
685B
37B active at inference time
128k
$0.7
-
NovitaCoreWeaveTogether AI
+7
Mistral Small 4 (Reasoning)
Mistral logoMistral
21
119B
6.5B active at inference time
256k
$0.2
170
Mistral
DeepSeek V3.1 (Reasoning)
DeepSeek logoDeepSeek
21
685B
37B active at inference time
128k
$0.7
-
Amazon BedrockGoogleSambaNovaNovita
Qwen3 VL 235B A22B (Reasoning)
Alibaba logoAlibaba
21
235B
22B active at inference time
262k
$1.4
56
NovitaAlibaba Cloud
North Mini Code
Cohere logoCohere
21
30B
3B active at inference time
256k
-
151
Not available
Cohere
Apriel-v1.6-15B-Thinker
ServiceNow logoServiceNow
21
15B
128k
-
-
Together AI
Qwen3.5 9B (Non-reasoning)
Alibaba logoAlibaba
20
9.65B
262k
-
-
-
Gemma 4 26B A4B (Non-reasoning)
Google logoGoogle
20
25.2B
3.8B active at inference time
256k
$0.2
40
ClarifaiParasailGMI
+4
Qwen3.5 4B (Reasoning)
Alibaba logoAlibaba
20
4.66B
262k
$0.0
30
DeepInfra
DeepSeek R1 0528 (May '25)
DeepSeek logoDeepSeek
20
685B
37B active at inference time
128k
$1.6
-
HyperbolicGoogleMicrosoft Azure
+3
Qwen3 Next 80B A3B (Reasoning)
Alibaba logoAlibaba
20
80B
3B active at inference time
262k
$1.1
174
GMIAlibaba CloudNebius
+5
GLM-4.5 (Reasoning)
Z AI logoZ AI
19
355B
32B active at inference time
128k
$0.8
50
Novita
Kimi K2
Kimi logoKimi
19
1.0KB
32B active at inference time
128k
$0.6
27
NovitaKimi
Ling 2.6 Flash
InclusionAI logoInclusionAI
19
107B
7.4B active at inference time
262k
$0.1
-
Novita
Seed-OSS-36B-Instruct
ByteDance Seed logoByteDance Seed
18
36.2B
512k
$0.2
34
SiliconFlow
Qwen3 235B A22B 2507 Instruct
Alibaba logoAlibaba
18
235B
22B active at inference time
256k
$0.3
58
FriendliAINebiusCoreWeave
+9
Qwen3 Coder 480B A35B Instruct
Alibaba logoAlibaba
18
480B
35B active at inference time
262k
$0.5
60
Eigen AIGoogleNovita
+6
Qwen3 VL 32B (Reasoning)
Alibaba logoAlibaba
18
33.4B
256k
$1.5
91
Alibaba Cloud
gpt-oss-120b (low)
OpenAI logoOpenAI
18
117B
5.1B active at inference time
131k
$0.2
355
FireworksHyperbolicCompactifAI
+19
MiniMax M1 80k
MiniMax logoMiniMax
18
456B
45.9B active at inference time
1.00M
$0.7
-
Novita
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)
NVIDIA logoNVIDIA
18
31.6B
3.6B active at inference time
1.00M
$0.1
86
NebiusDeepInfra
K2 Think V2
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
17
70B
262k
-
-
-
LongCat Flash Lite
LongCat logoLongCat
17
68.5B
3B active at inference time
256k
-
-
LongCat
HyperCLOVA X SEED Think (32B)
Naver logoNaver
17
32B
128k
-
-
-
GLM-4.6V (Reasoning)
Z AI logoZ AI
17
108B
12B active at inference time
128k
$0.4
80
SiliconFlowNovita
K-EXAONE (Non-reasoning)
LG AI Research logoLG AI Research
17
236B
23B active at inference time
256k
-
-
-
GLM-4.5-Air
Z AI logoZ AI
17
106B
12B active at inference time
128k
$0.3
79
Together AISiliconFlow
Mistral Large 3
Mistral logoMistral
16
675B
41B active at inference time
256k
$0.6
51
Microsoft AzureAmazon BedrockMistral
Ring-1T
InclusionAI logoInclusionAI
16
1.0KB
50B active at inference time
128k
-
-
-
Qwen3.5 4B (Non-reasoning)
Alibaba logoAlibaba
16
4.66B
262k
$0.0
36
DeepInfra
Qwen3 30B A3B 2507 (Reasoning)
Alibaba logoAlibaba
16
30.5B
3.3B active at inference time
262k
$0.4
146
ClarifaiAlibaba Cloud
DeepSeek V3 0324
DeepSeek logoDeepSeek
16
671B
37B active at inference time
128k
$1.2
-
Together AIReplicateHyperbolic
+3
INTELLECT-3
Prime Intellect logoPrime Intellect
16
107B
12B active at inference time
131k
-
-
-
GLM-4.7-Flash (Non-reasoning)
Z AI logoZ AI
16
31.2B
3B active at inference time
200k
$0.1
144
NovitaAmazon Bedrock
Devstral 2
Mistral logoMistral
15
125B
256k
-
44
Mistral
Solar Open 100B (Reasoning)
Upstage logoUpstage
15
102B
12B active at inference time
128k
-
-
-
Nemotron 3 Nano Omni 30B A3B Reasoning
NVIDIA logoNVIDIA
15
30B
3B active at inference time
256k
$0.1
280
NebiusClarifai
gpt-oss-20B (high)
OpenAI logoOpenAI
15
21B
3.6B active at inference time
131k
$0.1
216
CompactifAIHyperbolicCloudflare
+10
MiniMax M1 40k
MiniMax logoMiniMax
14
456B
45.9B active at inference time
1.00M
-
-
-
gpt-oss-20B (low)
OpenAI logoOpenAI
14
21B
3.6B active at inference time
131k
$0.1
224
ClarifaiTogether AICoreWeave
+9
Qwen3 VL 235B A22B Instruct
Alibaba logoAlibaba
14
235B
22B active at inference time
262k
$0.5
50
ParasailAlibaba CloudEigen AI
+2
Llama 4 Maverick
Meta logoMeta
14
402B
17B active at inference time
1.00M
$0.3
93
ParasailAmazon BedrockDatabricks
+6
K2-V2 (high)
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
14
70B
512k
-
-
-
Qwen3 Next 80B A3B Instruct
Alibaba logoAlibaba
14
80B
3B active at inference time
262k
$0.7
178
HyperbolicGoogleDeepInfra
+4
Tri-21B-think Preview
Trillion Labs logoTrillion Labs
14
21B
32.0k
-
-
-
Qwen3 Coder 30B A3B Instruct
Alibaba logoAlibaba
14
30.5B
3.3B active at inference time
262k
$0.3
100
ClarifaiScalewayAlibaba CloudAmazon Bedrock
Qwen3 235B A22B (Reasoning)
Alibaba logoAlibaba
13
235B
22B active at inference time
32.8k
$1.5
60
Alibaba Cloud
QwQ 32B
Alibaba logoAlibaba
13
32.8B
131k
$0.7
31
Cloudflare
Qwen3 VL 30B A3B (Reasoning)
Alibaba logoAlibaba
13
30B
3B active at inference time
256k
$0.3
111
Alibaba CloudEigen AINovitaFireworks
Gemma 4 12B (Non-reasoning)
Google logoGoogle
13
12B
262k
$0.1
134
SiliconFlow
Devstral Small 2
Mistral logoMistral
13
24B
256k
-
55
Mistral
Ling-1T
InclusionAI logoInclusionAI
13
1.0KB
50B active at inference time
128k
-
-
-
DeepSeek R1 (Jan '25)
DeepSeek logoDeepSeek
13
685B
37B active at inference time
128k
$2.0
-
HyperbolicNovitaAmazon Bedrock
+3
Gemma 4 E4B (Reasoning)
Google logoGoogle
12
8B
4.5B active at inference time
128k
-
-
-
K2-V2 (medium)
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
12
70B
512k
-
-
-
Llama Nemotron Super 49B v1.5 (Reasoning)
NVIDIA logoNVIDIA
12
49B
128k
$0.1
49
DeepInfra
Mistral Small 4 (Non-reasoning)
Mistral logoMistral
12
119B
6.5B active at inference time
256k
$0.2
156
Mistral
Tri-21B-Think
Trillion Labs logoTrillion Labs
12
21B
32.0k
-
-
-
Llama 3.3 Nemotron Super 49B v1 (Reasoning)
NVIDIA logoNVIDIA
12
49B
128k
-
-
-
Qwen3 4B 2507 (Reasoning)
Alibaba logoAlibaba
12
4.02B
262k
-
-
-
MiniCPM5-1B (Reasoning)
OpenBMB logoOpenBMB
12
1B
128k
-
-
-
Magistral Small 1.2
Mistral logoMistral
12
24B
128k
$0.6
107
Amazon BedrockMistral
Sarvam 105B (high)
Sarvam logoSarvam
12
106B
10.3B active at inference time
128k
$0.0
96
Sarvam
Devstral Small (May '25)
Mistral logoMistral
12
23.6B
256k
-
-
-
MiniCPM5-1B (Non-reasoning)
OpenBMB logoOpenBMB
12
1B
128k
-
-
-
Qwen3 VL 32B Instruct
Alibaba logoAlibaba
11
33.4B
256k
$0.9
67
Alibaba Cloud
DeepSeek R1 Distill Qwen 32B
DeepSeek logoDeepSeek
11
32B
128k
-
-
-
GLM-4.6V (Non-reasoning)
Z AI logoZ AI
11
108B
12B active at inference time
128k
$0.4
84
NovitaSiliconFlow
Qwen3 235B A22B (Non-reasoning)
Alibaba logoAlibaba
11
235B
22B active at inference time
32.8k
$0.6
61
NovitaAlibaba Cloud
Magistral Small 1
Mistral logoMistral
11
23.6B
40.0k
-
-
-
EXAONE 4.0 32B (Reasoning)
LG AI Research logoLG AI Research
11
32B
131k
-
-
-
Qwen3 VL 8B (Reasoning)
Alibaba logoAlibaba
11
8.77B
256k
$0.4
111
Alibaba Cloud
Qwen3 32B (Reasoning)
Alibaba logoAlibaba
10
32.8B
32.8k
$0.2
61
Alibaba CloudNovitaGroq
+3
DeepSeek V3 (Dec '24)
DeepSeek logoDeepSeek
10
671B
37B active at inference time
128k
$0.4
-
DeepInfraTogether AINovita
+2
DeepSeek R1 0528 Qwen3 8B
DeepSeek logoDeepSeek
10
8.19B
32.8k
-
-
-
Qwen3.5 2B (Reasoning)
Alibaba logoAlibaba
10
2.27B
262k
$0.0
41
DeepInfra
Qwen3 14B (Reasoning)
Alibaba logoAlibaba
10
14.8B
32.8k
$0.4
63
Alibaba CloudDeepInfra
Nanbeige4.1-3B
Nanbeige logoNanbeige
10
3.93B
256k
-
-
-
Llama 4 Scout
Meta logoMeta
10
109B
17B active at inference time
10.0M
$0.2
108
CloudflareNovitaAmazon Bedrock
+6
Qwen3 VL 30B A3B Instruct
Alibaba logoAlibaba
10
30B
3B active at inference time
256k
$0.2
112
FireworksAlibaba CloudEigen AINovita
Hermes 4 - Llama-3.1 70B (Reasoning)
Nous Research logoNous Research
10
70.6B
128k
$0.2
71
Nebius
Ministral 3 14B
Mistral logoMistral
10
14B
256k
$0.2
93
MistralAmazon Bedrock
DeepSeek R1 Distill Llama 70B
DeepSeek logoDeepSeek
10
70B
128k
$0.7
50
SambaNovaScalewayDeepInfra
DeepSeek R1 Distill Qwen 14B
DeepSeek logoDeepSeek
10
14B
128k
-
-
-
Falcon-H1R-7B
TII UAE logoTII UAE
10
7B
256k
-
-
-
Ling-flash-2.0
InclusionAI logoInclusionAI
10
103B
6.1B active at inference time
128k
$0.2
55
SiliconFlow
Qwen3 Omni 30B A3B (Reasoning)
Alibaba logoAlibaba
10
35.3B
3B active at inference time
65.5k
$0.3
84
Alibaba Cloud
Qwen2.5 Instruct 72B
Alibaba logoAlibaba
10
72B
131k
$0.2
-
SiliconFlowDeepInfraAlibaba Cloud
Step3 VL 10B
StepFun logoStepFun
9
10.2B
65.5k
-
-
-
Qwen3 30B A3B (Reasoning)
Alibaba logoAlibaba
9
30.5B
3.3B active at inference time
32.8k
$0.1
110
FireworksEigen AIAlibaba Cloud
+2
Devstral Small (Jul '25)
Mistral logoMistral
9
24B
256k
$0.1
35
Mistral
Gemma 4 E2B (Reasoning)
Google logoGoogle
9
5.1B
2.3B active at inference time
128k
-
-
-
QwQ 32B-Preview
Alibaba logoAlibaba
9
32.8B
32.8k
-
-
-
GLM-4.5V (Reasoning)
Z AI logoZ AI
9
108B
12B active at inference time
64.0k
$0.7
26
Novita
Mistral Large 2 (Nov '24)
Mistral logoMistral
9
123B
128k
$2.4
53
Mistral
Mistral Small 3.2
Mistral logoMistral
9
24B
128k
$0.1
140
DeepInfraMistral
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
NVIDIA logoNVIDIA
9
253B
128k
$0.7
51
Nebius
Qwen3 30B A3B 2507 Instruct
Alibaba logoAlibaba
9
30.5B
3.3B active at inference time
262k
$0.2
144
CoreWeaveClarifaiNebiusAlibaba Cloud
ERNIE 4.5 300B A47B
Baidu logoBaidu
9
300B
47B active at inference time
131k
$0.4
-
NovitaSiliconFlow
Hermes 4 - Llama-3.1 405B (Reasoning)
Nous Research logoNous Research
9
406B
128k
$1.2
41
Nebius
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
NVIDIA logoNVIDIA
9
13.2B
128k
$0.2
284
DeepInfra
Ministral 3 8B
Mistral logoMistral
9
8B
256k
$0.1
102
MistralAmazon Bedrock
Gemma 4 E4B (Non-reasoning)
Google logoGoogle
9
8B
4.5B active at inference time
128k
-
-
-
Granite 4.1 30B
IBM logoIBM
9
30B
131k
-
-
-
NVIDIA Nemotron Nano 9B V2 (Reasoning)
NVIDIA logoNVIDIA
9
9B
131k
$0.1
89
DeepInfra
Hermes 4 - Llama-3.1 405B (Non-reasoning)
Nous Research logoNous Research
9
406B
128k
$1.2
42
Nebius
NVIDIA Nemotron 3 Nano 4B
NVIDIA logoNVIDIA
9
3.97B
262k
-
-
-
Qwen3.5 2B (Non-reasoning)
Alibaba logoAlibaba
9
2.27B
262k
$0.0
31
DeepInfra
Llama Nemotron Super 49B v1.5 (Non-reasoning)
NVIDIA logoNVIDIA
9
49B
128k
$0.1
50
DeepInfra
Qwen3 32B (Non-reasoning)
Alibaba logoAlibaba
9
32.8B
32.8k
$0.2
64
NovitaNebiusSambaNova
+4
Llama 3.3 Instruct 70B
Meta logoMeta
9
70B
128k
$0.6
91
HyperbolicNovitaDatabricks
+18
Mistral Small 3.1
Mistral logoMistral
9
24B
128k
$0.1
157
DeepInfraMistralCompactifAICloudflare
K2-V2 (low)
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
9
70B
512k
-
-
-
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)
NVIDIA logoNVIDIA
9
4.51B
128k
-
-
-
Kimi Linear 48B A3B Instruct
Kimi logoKimi
9
49.1B
3B active at inference time
1.00M
-
-
-
Llama 3.1 Instruct 405B
Meta logoMeta
9
405B
128k
$3.1
64
Amazon BedrockAmazon BedrockDatabricksMicrosoft Azure
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
NVIDIA logoNVIDIA
8
49B
128k
-
-
-
Qwen3 VL 8B Instruct
Alibaba logoAlibaba
8
8.77B
256k
$0.2
121
Alibaba Cloud
Qwen3 4B (Reasoning)
Alibaba logoAlibaba
8
4.02B
32.0k
$0.2
-
Alibaba Cloud
Llama 3.1 Tulu3 405B
Allen Institute for AI logoAllen Institute for AI
8
405B
128k
-
-
-
Ring-flash-2.0
InclusionAI logoInclusionAI
8
103B
6.1B active at inference time
128k
$0.2
-
SiliconFlow
Pixtral Large
Mistral logoMistral
8
124B
128k
$2.4
50
Mistral
Olmo 3.1 32B Think
Allen Institute for AI logoAllen Institute for AI
8
32.2B
65.5k
-
-
Parasail
Grok 2 (Dec '24)
xAI logoxAI
8
270B
131k
-
-
-
Qwen3 VL 4B (Reasoning)
Alibaba logoAlibaba
8
4.44B
256k
-
-
-
Command A
Cohere logoCohere
8
111B
256k
$3.3
74
Microsoft AzureCohere
Llama 3.1 Nemotron Instruct 70B
NVIDIA logoNVIDIA
8
70B
128k
$1.2
299
DeepInfra
Qwen2.5 Instruct 32B
Alibaba logoAlibaba
7
32B
128k
-
-
-
Qwen3 8B (Reasoning)
Alibaba logoAlibaba
7
8.19B
131k
$0.2
38
Alibaba CloudEigen AI
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
NVIDIA logoNVIDIA
7
31.6B
3.6B active at inference time
1.00M
$0.1
87
DeepInfra
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
NVIDIA logoNVIDIA
7
9B
131k
$0.1
142
Amazon BedrockDeepInfra
Mistral Large 2 (Jul '24)
Mistral logoMistral
7
123B
128k
$2.4
-
Amazon Bedrock
Qwen3 4B 2507 Instruct
Alibaba logoAlibaba
7
4.02B
262k
-
-
-
Qwen2.5 Coder Instruct 32B
Alibaba logoAlibaba
7
32B
131k
-
-
-
Qwen3 14B (Non-reasoning)
Alibaba logoAlibaba
7
14.8B
32.8k
$0.3
63
Alibaba CloudDeepInfra
GLM-4.5V (Non-reasoning)
Z AI logoZ AI
7
108B
12B active at inference time
64.0k
$0.7
34
Novita
Mistral Small 3
Mistral logoMistral
7
24B
32.0k
$0.1
152
DeepInfraMistral
Hermes 4 - Llama-3.1 70B (Non-reasoning)
Nous Research logoNous Research
7
70.6B
128k
$0.2
72
Nebius
Qwen3 30B A3B (Non-reasoning)
Alibaba logoAlibaba
7
30.5B
3.3B active at inference time
32.8k
$0.1
108
Eigen AIDeepInfraAlibaba Cloud
DeepSeek-V2.5 (Dec '24)
DeepSeek logoDeepSeek
7
236B
21B active at inference time
128k
-
-
-
Qwen3 4B (Non-reasoning)
Alibaba logoAlibaba
7
4.02B
32.0k
$0.1
-
Alibaba Cloud
Llama 3.1 Instruct 70B
Meta logoMeta
7
70B
128k
$0.6
31
DeepInfraDeepInfraAmazon BedrockAmazon Bedrock
Granite 4.1 8B
IBM logoIBM
7
8B
131k
$0.1
122
CoreWeave
Sarvam 30B (high)
Sarvam logoSarvam
7
32.2B
2.4B active at inference time
65.5k
$0.0
165
Sarvam
DeepSeek-V2.5
DeepSeek logoDeepSeek
7
236B
21B active at inference time
128k
-
-
-
Olmo 3.1 32B Instruct
Allen Institute for AI logoAllen Institute for AI
6
32.2B
65.5k
-
-
-
DeepSeek R1 Distill Llama 8B
DeepSeek logoDeepSeek
6
8B
128k
-
-
-
Gemma 4 E2B (Non-reasoning)
Google logoGoogle
6
5.1B
2.3B active at inference time
128k
-
-
-
Olmo 3 32B Think
Allen Institute for AI logoAllen Institute for AI
6
32.2B
65.5k
-
-
-
R1 1776
Perplexity logoPerplexity
6
671B
37B active at inference time
128k
-
-
-
Llama 3.2 Instruct 90B (Vision)
Meta logoMeta
6
90B
128k
$1.4
58
Microsoft AzureAmazon Bedrock
Solar Mini
Upstage logoUpstage
6
10.7B
4.10k
$0.1
-
Upstage
Llama 3.1 Instruct 8B
Meta logoMeta
6
8B
128k
$0.1
153
Microsoft AzureCoreWeaveCloudflare
+12
Grok-1
xAI logoxAI
6
314B
78B active at inference time
8.19k
-
-
-
Qwen2 Instruct 72B
Alibaba logoAlibaba
6
72B
131k
-
-
-
EXAONE 4.0 32B (Non-reasoning)
LG AI Research logoLG AI Research
6
32B
131k
-
-
-
Ministral 3 3B
Mistral logoMistral
6
3B
256k
$0.1
180
Amazon BedrockMistral
DeepHermes 3 - Mistral 24B Preview (Non-reasoning)
Nous Research logoNous Research
5
24B
32.0k
-
-
-
Jamba 1.7 Large
AI21 Labs logoAI21 Labs
5
398B
94B active at inference time
256k
$2.6
59
AI21 Labs
Granite 4.0 H Small
IBM logoIBM
5
32B
9B active at inference time
128k
$0.1
386
Replicate
Jamba 1.5 Large
AI21 Labs logoAI21 Labs
5
398B
94B active at inference time
256k
$2.6
-
Amazon Bedrock
Qwen3 Omni 30B A3B Instruct
Alibaba logoAlibaba
5
35.3B
3B active at inference time
65.5k
$0.3
95
Alibaba Cloud
Hermes 3 - Llama-3.1 70B
Nous Research logoNous Research
5
70.6B
128k
$0.3
32
DeepInfra
Qwen3 8B (Non-reasoning)
Alibaba logoAlibaba
5
8.19B
32.8k
$0.2
39
Eigen AIFireworksAlibaba Cloud
DeepSeek-Coder-V2
DeepSeek logoDeepSeek
5
236B
21B active at inference time
128k
-
-
-
OLMo 2 32B
Allen Institute for AI logoAllen Institute for AI
5
32.2B
4.10k
-
-
-
Jamba 1.6 Large
AI21 Labs logoAI21 Labs
5
398B
94B active at inference time
256k
$2.6
60
AI21 Labs
Qwen3.5 0.8B (Reasoning)
Alibaba logoAlibaba
5
0.873B
262k
$0.0
32
DeepInfra
LFM2 24B A2B
Liquid AI logoLiquid AI
5
23.8B
2.3B active at inference time
32.8k
$0.0
127
Together AI
Phi-4
Microsoft logoMicrosoft
5
14B
16.0k
$0.2
40
Microsoft AzureDeepInfra
Gemma 3 27B Instruct
Google logoGoogle
5
27.4B
128k
$0.1
-
ParasailNovitaGoogle
+3
Mistral Small (Sep '24)
Mistral logoMistral
5
22B
32.8k
$0.2
158
Mistral
Phi-3 Mini Instruct 3.8B
Microsoft logoMicrosoft
5
3.8B
4.10k
-
-
-
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)
NVIDIA logoNVIDIA
5
13.2B
128k
$0.2
212
DeepInfraAmazon Bedrock
Gemma 3n E4B Instruct Preview (May '25)
Google logoGoogle
5
8.39B
4B active at inference time
32.0k
-
-
-
Phi-4 Multimodal Instruct
Microsoft logoMicrosoft
5
5.6B
128k
-
16
Microsoft Azure
Qwen2.5 Coder Instruct 7B
Alibaba logoAlibaba
4
7.62B
131k
-
-
-
Qwen3.5 0.8B (Non-reasoning)
Alibaba logoAlibaba
4
0.873B
262k
$0.0
35
DeepInfra
Mixtral 8x22B Instruct
Mistral logoMistral
4
141B
39B active at inference time
65.4k
-
-
-
Llama 2 Chat 7B
Meta logoMeta
4
7B
4.10k
$0.1
-
Replicate
Llama 3.2 Instruct 3B
Meta logoMeta
4
3B
128k
$0.1
51
Amazon Bedrock
MiniCPM-V 4.6 1.3B
OpenBMB logoOpenBMB
4
1.3B
262k
-
-
-
Jamba Reasoning 3B
AI21 Labs logoAI21 Labs
4
3B
262k
-
-
-
Qwen3 VL 4B Instruct
Alibaba logoAlibaba
4
4.44B
256k
-
-
-
Qwen1.5 Chat 110B
Alibaba logoAlibaba
4
110B
32.0k
-
-
-
Reka Flash 3
Reka AI logoReka AI
4
21B
128k
$0.3
-
Reka AI
Olmo 3 7B Think
Allen Institute for AI logoAllen Institute for AI
4
7B
65.5k
-
-
-
OLMo 2 7B
Allen Institute for AI logoAllen Institute for AI
4
7.3B
4.10k
-
-
-
Molmo 7B-D
Allen Institute for AI logoAllen Institute for AI
4
8.02B
4.10k
-
-
-
Ling-mini-2.0
InclusionAI logoInclusionAI
4
16.3B
1.4B active at inference time
131k
-
-
-
DeepSeek R1 Distill Qwen 1.5B
DeepSeek logoDeepSeek
4
1.5B
128k
-
-
-
DeepSeek-V2-Chat
DeepSeek logoDeepSeek
4
236B
21B active at inference time
128k
-
-
-
Llama 3 Instruct 70B
Meta logoMeta
3
70B
8.19k
$0.9
-
ReplicateNovitaAmazon Bedrock
Arctic Instruct
Snowflake logoSnowflake
3
480B
17B active at inference time
4.00k
-
-
-
Qwen Chat 72B
Alibaba logoAlibaba
3
72B
33.8k
-
-
-
Gemma 3 12B Instruct
Google logoGoogle
3
12.2B
128k
$0.1
-
DeepInfraGoogleDatabricks
+2
Llama 3.2 Instruct 11B (Vision)
Meta logoMeta
3
11B
128k
$0.2
51
Microsoft AzureAmazon BedrockDeepInfra
Granite 4.1 3B
IBM logoIBM
3
3B
131k
-
-
-
DeepSeek Coder V2 Lite Instruct
DeepSeek logoDeepSeek
3
16B
2.4B active at inference time
128k
-
-
-
Sarvam M (Reasoning)
Sarvam logoSarvam
3
23.6B
32.8k
-
-
Sarvam
Phi-4 Mini Instruct
Microsoft logoMicrosoft
3
3.84B
128k
-
45
CoreWeaveMicrosoft Azure
Llama 2 Chat 70B
Meta logoMeta
3
70B
4.10k
-
-
-
DeepSeek LLM 67B Chat (V1)
DeepSeek logoDeepSeek
3
7B
4.10k
-
-
-
Llama 2 Chat 13B
Meta logoMeta
3
13B
4.10k
-
-
-
Command-R+ (Apr '24)
Cohere logoCohere
3
104B
128k
$4.2
-
Amazon Bedrock
OpenChat 3.5 (1210)
OpenChat logoOpenChat
3
7B
8.19k
-
-
-
DBRX Instruct
Databricks logoDatabricks
3
132B
36B active at inference time
32.8k
-
-
-
Exaone 4.0 1.2B (Reasoning)
LG AI Research logoLG AI Research
3
1.28B
64.0k
-
-
-
Olmo 3 7B Instruct
Allen Institute for AI logoAllen Institute for AI
3
7B
65.5k
$0.1
-
Parasail
Exaone 4.0 1.2B (Non-reasoning)
LG AI Research logoLG AI Research
3
1.28B
64.0k
-
-
-
LFM2.5-1.2B-Thinking
Liquid AI logoLiquid AI
3
1.17B
32.0k
-
-
-
Jamba 1.7 Mini
AI21 Labs logoAI21 Labs
3
52B
12B active at inference time
258k
-
-
-
LFM2 2.6B
Liquid AI logoLiquid AI
3
2.57B
32.8k
-
338
Liquid AI
LFM2.5-1.2B-Instruct
Liquid AI logoLiquid AI
3
1.17B
32.0k
-
492
Liquid AI
Jamba 1.5 Mini
AI21 Labs logoAI21 Labs
3
52B
12B active at inference time
256k
$0.2
-
Amazon Bedrock
Granite 4.0 H 1B
IBM logoIBM
3
1.5B
128k
-
-
-
Qwen3 1.7B (Reasoning)
Alibaba logoAlibaba
3
2.03B
32.0k
$0.2
-
Alibaba Cloud
Jamba 1.6 Mini
AI21 Labs logoAI21 Labs
3
52B
12B active at inference time
256k
$0.2
181
AI21 Labs
Mixtral 8x7B Instruct
Mistral logoMistral
2
46.7B
12.9B active at inference time
32.8k
$0.5
-
Amazon Bedrock
Gemma 3 270M
Google logoGoogle
2
0.268B
32.0k
-
-
-
Apertus 70B Instruct
Swiss AI Initiative logoSwiss AI Initiative
2
70B
65.5k
$1.0
-
Public AI
Granite 4.0 Micro
IBM logoIBM
2
3B
128k
-
-
-
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)
Nous Research logoNous Research
2
8B
128k
-
-
-
Llama 65B
Meta logoMeta
2
65B
2.05k
-
-
-
Qwen Chat 14B
Alibaba logoAlibaba
2
14B
8.19k
-
-
-
Mistral 7B Instruct
Mistral logoMistral
2
7B
8.19k
$0.2
99
Amazon BedrockMistral
Command-R (Mar '24)
Cohere logoCohere
2
35B
128k
$0.6
-
Amazon Bedrock
Granite 4.0 1B
IBM logoIBM
2
1.6B
128k
-
-
-
Molmo2-8B
Allen Institute for AI logoAllen Institute for AI
2
8.66B
36.9k
-
-
-
LFM2 8B A1B
Liquid AI logoLiquid AI
2
8.34B
1.5B active at inference time
32.8k
-
-
Liquid AI
Granite 3.3 8B (Non-reasoning)
IBM logoIBM
2
8.17B
128k
$0.1
351
Replicate
Qwen3 1.7B (Non-reasoning)
Alibaba logoAlibaba
2
2.03B
32.0k
$0.1
-
Alibaba Cloud
Qwen3 0.6B (Reasoning)
Alibaba logoAlibaba
1
0.752B
32.0k
$0.2
-
Alibaba Cloud
Llama 3 Instruct 8B
Meta logoMeta
1
8B
8.19k
$0.1
-
Amazon BedrockReplicateDeepInfraNovita
Gemma 3n E4B Instruct
Google logoGoogle
1
8.39B
4B active at inference time
32.0k
$0.0
54
Together AI
LFM2 1.2B
Liquid AI logoLiquid AI
1
1.17B
32.8k
-
439
Liquid AI
Gemma 3 4B Instruct
Google logoGoogle
1
4.3B
128k
$0.0
-
DeepInfraAmazon BedrockGoogle
Llama 3.2 Instruct 1B
Meta logoMeta
1
1B
128k
$0.1
84
Amazon BedrockNovita
LFM2.5-VL-1.6B
Liquid AI logoLiquid AI
1
1.6B
32.0k
-
425
Liquid AI
Granite 4.0 H 350M
IBM logoIBM
1
0.34B
32.8k
-
-
-
Granite 4.0 350M
IBM logoIBM
1
0.35B
32.8k
-
-
-
Apertus 8B Instruct
Swiss AI Initiative logoSwiss AI Initiative
1
8B
65.5k
$0.1
-
Public AI
Tiny Aya Global
Cohere logoCohere
1
3.35B
8.19k
-
-
Cohere
Gemma 3 1B Instruct
Google logoGoogle
1
1B
32.0k
-
-
Google
Gemma 3n E2B Instruct
Google logoGoogle
1
5.98B
2B active at inference time
32.0k
-
-
Google
Qwen3 0.6B (Non-reasoning)
Alibaba logoAlibaba
1
0.752B
32.0k
$0.1
-
Alibaba Cloud
EXAONE 4.5 33B (Non-reasoning)
LG AI Research logoLG AI Research
-
34.4B
262k
-
-
-
Cogito v2.1 (Reasoning)
Deep Cogito logoDeep Cogito
-
671B
37B active at inference time
128k
$1.3
91
Together AI