WHAT LLM PROVIDER?

Compare LLM models across providers to find the best fit for your needs

Chart Configuration

Filter Models

Filter models by their quality score

0
0 (Any quality)67 (Highest quality)

Filter models by their price per million tokens

$30.00
$0 (Free)$30 (Premium)

Model Comparison Chart(301 models)

Model Data Table

Name
Provider
Creator
Model
Price (per M/Token)
Input Price
Output Price
Output Speed
Context Window
Latency
Quality Index
o3-mini (high)OpenAIOpenAIo3-mini (high)$1.93$1.10$4.40118.5 tokens/s200,000 tokens59.59 s66
o3-miniOpenAIOpenAIo3-mini$1.93$1.10$4.40183.2 tokens/s200,000 tokens12.49 s63
o3-miniMicrosoft AzureOpenAIo3-mini$1.93$1.10$4.40142.6 tokens/s200,000 tokens21.29 s63
o1Microsoft AzureOpenAIo1$26.25$15.00$60.00104.5 tokens/s200,000 tokens31.34 s62
DeepSeek R1DeepSeekDeepSeekDeepSeek R1$0.96$0.55$2.1925.1 tokens/s64,000 tokens68.28 s60
DeepSeek R1HyperbolicDeepSeekDeepSeek R1$2.00$2.00$2.0070 tokens/s128,000 tokens37.51 s60
DeepSeek R1 BaseNebius AI StudioDeepSeekDeepSeek R1$1.20$0.80$2.4012.5 tokens/s128,000 tokens123.9 s60
DeepSeek R1 FastNebius AI StudioDeepSeekDeepSeek R1$3.00$2.00$6.0072.5 tokens/s128,000 tokens19.82 s60
DeepSeek R1CentMLDeepSeekDeepSeek R1$3.99$3.99$3.9971.2 tokens/s128,000 tokens19.24 s60
DeepSeek R1Microsoft AzureDeepSeekDeepSeek R1$0.00$0.00$0.0026.4 tokens/s128,000 tokens54.07 s60
DeepSeek R1FireworksDeepSeekDeepSeek R1$4.25$3.00$8.0087.5 tokens/s128,000 tokens17.68 s60
DeepSeek R1DeepinfraDeepSeekDeepSeek R1$1.16$0.75$2.408.5 tokens/s64,000 tokens212.83 s60
DeepSeek R1NovitaDeepSeekDeepSeek R1$4.00$4.00$4.0015.8 tokens/s64,000 tokens65.29 s60
DeepSeek R1Together.aiDeepSeekDeepSeek R1$4.00$3.00$7.00113.6 tokens/s128,000 tokens15.23 s60
DeepSeek R1kluster.aiDeepSeekDeepSeek R1$7.00$7.00$7.0026.6 tokens/s128,000 tokens62.07 s60
Claude 3.7 Sonnet ThinkingAnthropicAnthropicClaude 3.7 Sonnet Thinking$6.00$3.00$15.0079.6 tokens/s200,000 tokens0.92 s57
o1-miniOpenAIOpenAIo1-mini$1.93$1.10$4.40200.8 tokens/s128,000 tokens11.33 s54
o1-miniMicrosoft AzureOpenAIo1-mini$5.78$3.30$13.20203.7 tokens/s128,000 tokens14.2 s54
DeepSeek R1 Distill Qwen 32BDeepinfraDeepSeekDeepSeek R1 Distill Qwen 32B$0.14$0.12$0.1840 tokens/s128,000 tokens16.31 s51
DeepSeek R1 Distill Qwen 32BNovitaDeepSeekDeepSeek R1 Distill Qwen 32B$0.30$0.30$0.3021 tokens/s64,000 tokens32.19 s51
DeepSeek R1 Distill Qwen 32BGroqDeepSeekDeepSeek R1 Distill Qwen 32B$0.69$0.69$0.6996.8 tokens/s128,000 tokens2.31 s51
Gemini 2.0 Pro Experimental (AI Studio)Google (AI Studio)GoogleGemini 2.0 Pro Experimental (AI Studio)$0.00$0.00$0.00123.3 tokens/s2,000,000 tokens0.56 s49
DeepSeek R1 Distill Qwen 14BNovitaDeepSeekDeepSeek R1 Distill Qwen 14B$0.15$0.15$0.1545.7 tokens/s64,000 tokens17.88 s49
DeepSeek R1 Distill Qwen 14BTogether.aiDeepSeekDeepSeek R1 Distill Qwen 14B$1.60$1.60$1.60164.8 tokens/s128,000 tokens6.62 s49
DeepSeek R1 Distill Llama 70BCerebrasDeepSeekDeepSeek R1 Distill Llama 70B$0.94$0.85$1.201864 tokens/s66,000 tokens0.82 s48
DeepSeek R1 Distill Llama 70B BaseNebius AI StudioDeepSeekDeepSeek R1 Distill Llama 70B$0.38$0.25$0.7537.3 tokens/s128,000 tokens25 s48
DeepSeek R1 Distill Llama 70BDeepinfraDeepSeekDeepSeek R1 Distill Llama 70B$0.34$0.23$0.6939.8 tokens/s128,000 tokens10.84 s48
DeepSeek R1 Distill Llama 70BNovitaDeepSeekDeepSeek R1 Distill Llama 70B$0.39$0.39$0.3922.1 tokens/s32,000 tokens15.56 s48
DeepSeek R1 Distill Llama 70BGroqDeepSeekDeepSeek R1 Distill Llama 70B$0.81$0.75$0.99256.8 tokens/s128,000 tokens3.65 s48
DeepSeek R1 Distill Llama 70B (Spec decoding)GroqDeepSeekDeepSeek R1 Distill Llama 70B (Spec decoding)$0.81$0.75$0.991754.8 tokens/s128,000 tokens1.19 s48
DeepSeek R1 Distill Llama 70BSambaNovaDeepSeekDeepSeek R1 Distill Llama 70B$0.88$0.70$1.40124.3 tokens/s16,000 tokens12.2 s48
DeepSeek R1 Distill Llama 70BTogether.aiDeepSeekDeepSeek R1 Distill Llama 70B$2.00$2.00$2.00110.4 tokens/s128,000 tokens15.8 s48
Claude 3.7 SonnetAmazon BedrockAnthropicClaude 3.7 Sonnet$6.00$3.00$15.0040.7 tokens/s200,000 tokens0.74 s48
Claude 3.7 SonnetAnthropicAnthropicClaude 3.7 Sonnet$6.00$3.00$15.0080.3 tokens/s200,000 tokens0.97 s48
Gemini 2.0 Flash VertexGoogle VertexGoogleGemini 2.0 Flash Vertex$0.26$0.15$0.600 tokens/s1,000,000 tokens0.1 s48
Gemini 2.0 Flash (AI Studio)Google (AI Studio)GoogleGemini 2.0 Flash (AI Studio)$0.17$0.10$0.40186.8 tokens/s1,000,000 tokens0.36 s48
DeepSeek V3DeepSeekDeepSeekDeepSeek V3$0.48$0.27$1.1027.4 tokens/s66,000 tokens7.19 s46
DeepSeek V3 (FP8)Hyperbolic (FP8)DeepSeekDeepSeek V3$0.25$0.25$0.2519.2 tokens/s128,000 tokens0.91 s46
DeepSeek V3Nebius AI StudioDeepSeekDeepSeek V3$0.75$0.50$1.5026.1 tokens/s128,000 tokens0.86 s46
DeepSeek V3FireworksDeepSeekDeepSeek V3$1.31$0.75$3.0059.1 tokens/s128,000 tokens0.78 s46
DeepSeek V3DeepinfraDeepSeekDeepSeek V3$0.59$0.49$0.897.5 tokens/s64,000 tokens1.01 s46
DeepSeek V3NovitaDeepSeekDeepSeek V3$0.89$0.89$0.8914.7 tokens/s64,000 tokens1.9 s46
DeepSeek V3 (FP8)Together.aiDeepSeekDeepSeek V3$1.25$1.25$1.2518.6 tokens/s128,000 tokens0.39 s46
Qwen2.5 MaxAlibaba CloudAlibabaQwen2.5 Max$2.80$1.60$6.4036.1 tokens/s32,000 tokens1.26 s45
Gemini 1.5 Pro (Sep) (Vertex)Google (Vertex)GoogleGemini 1.5 Pro (Sep) (Vertex)$2.19$1.25$5.000 tokens/s2,000,000 tokens0.1 s45
Gemini 1.5 Pro (Sep) (AI Studio)Google (AI Studio)GoogleGemini 1.5 Pro (Sep) (AI Studio)$2.19$1.25$5.00100.3 tokens/s2,000,000 tokens0.57 s45
Claude 3.5 Sonnet (Oct)Amazon BedrockAnthropicClaude 3.5 Sonnet (Oct)$6.00$3.00$15.0043 tokens/s200,000 tokens1.3 s44
Claude 3.5 Sonnet (Oct) VertexGoogle VertexAnthropicClaude 3.5 Sonnet (Oct) Vertex$6.00$3.00$15.0080.1 tokens/s200,000 tokens0.82 s44
Claude 3.5 Sonnet (Oct)AnthropicAnthropicClaude 3.5 Sonnet (Oct)$6.00$3.00$15.0077.2 tokens/s200,000 tokens1.27 s44
QwQ 32B-PreviewHyperbolicAlibabaQwQ 32B-Preview$0.20$0.20$0.2067.9 tokens/s33,000 tokens1.15 s43
QwQ 32B-Preview BaseNebius AI StudioAlibabaQwQ 32B-Preview$0.14$0.09$0.2783.5 tokens/s33,000 tokens0.6 s43
QwQ 32B-PreviewFireworksAlibabaQwQ 32B-Preview$0.90$0.90$0.9068.7 tokens/s33,000 tokens0.41 s43
QwQ 32B-PreviewDeepinfraAlibabaQwQ 32B-Preview$0.26$0.15$0.6045.1 tokens/s33,000 tokens14.98 s43
QwQ 32B-PreviewTogether.aiAlibabaQwQ 32B-Preview$1.20$1.20$1.2065.2 tokens/s33,000 tokens0.59 s43
Gemini 2.0 Flash-Lite (Preview) (AI Studio)Google (AI Studio)GoogleGemini 2.0 Flash-Lite (Preview) (AI Studio)$0.13$0.07$0.30184 tokens/s1,000,000 tokens0.24 s42
GPT-4o (Nov '24)OpenAIOpenAIGPT-4o (Nov '24)$4.38$2.50$10.0093.6 tokens/s128,000 tokens0.38 s41
GPT-4o (Nov '24)Microsoft AzureOpenAIGPT-4o (Nov '24)$4.38$2.50$10.00135.4 tokens/s128,000 tokens0.99 s41
Llama 3.3 70BCerebrasMetaLlama 3.3 70B$0.94$0.85$1.202196.9 tokens/s33,000 tokens0.18 s41
Llama 3.3 70BHyperbolicMetaLlama 3.3 70B$0.40$0.40$0.4038.7 tokens/s128,000 tokens1.32 s41
Llama 3.3 70BAmazon BedrockMetaLlama 3.3 70B$0.71$0.71$0.71136.2 tokens/s128,000 tokens0.58 s41
Llama 3.3 70B FastNebius AI StudioMetaLlama 3.3 70B$0.38$0.25$0.7574.6 tokens/s128,000 tokens0.55 s41
Llama 3.3 70B BaseNebius AI StudioMetaLlama 3.3 70B$0.20$0.13$0.4027.9 tokens/s128,000 tokens0.61 s41
Llama 3.3 70BCentMLMetaLlama 3.3 70B$0.50$0.50$0.50133.5 tokens/s128,000 tokens0.54 s41
Llama 3.3 70BMicrosoft AzureMetaLlama 3.3 70B$0.71$0.71$0.7161.2 tokens/s128,000 tokens0.45 s41
Llama 3.3 70BFireworksMetaLlama 3.3 70B$0.90$0.90$0.90120.2 tokens/s128,000 tokens0.5 s41
Llama 3.3 70B (Turbo, FP8)Deepinfra (Turbo, FP8)MetaLlama 3.3 70B$0.20$0.13$0.4035.4 tokens/s128,000 tokens0.46 s41
Llama 3.3 70BDeepinfraMetaLlama 3.3 70B$0.27$0.23$0.4021.9 tokens/s128,000 tokens0.42 s41
Llama 3.3 70BFriendliAIMetaLlama 3.3 70B$0.60$0.60$0.60167.9 tokens/s128,000 tokens0.33 s41
Llama 3.3 70BNovitaMetaLlama 3.3 70B$0.39$0.39$0.3957.5 tokens/s128,000 tokens0.85 s41
Llama 3.3 70B (Spec decoding)GroqMetaLlama 3.3 70B (Spec decoding)$0.69$0.59$0.991603.3 tokens/s8,000 tokens0.4 s41
Llama 3.3 70BGroqMetaLlama 3.3 70B$0.64$0.59$0.79275.4 tokens/s128,000 tokens0.14 s41
Llama 3.3 70BSambaNovaMetaLlama 3.3 70B$0.75$0.60$1.20304.3 tokens/s128,000 tokens0.51 s41
Llama 3.3 70B TurboTogether.aiMetaLlama 3.3 70B$0.88$0.88$0.88174.9 tokens/s128,000 tokens0.59 s41
Llama 3.3 70Bkluster.aiMetaLlama 3.3 70B$0.70$0.70$0.7019.6 tokens/s128,000 tokens0.82 s41
GPT-4o (ChatGPT)OpenAIOpenAIGPT-4o (ChatGPT)$7.50$5.00$15.0096.7 tokens/s128,000 tokens0.54 s41
GPT-4o (Aug '24)OpenAIOpenAIGPT-4o (Aug '24)$4.38$2.50$10.0046.7 tokens/s128,000 tokens0.49 s41
GPT-4o (Aug '24)Microsoft AzureOpenAIGPT-4o (Aug '24)$4.38$2.50$10.00121.3 tokens/s128,000 tokens0.65 s41
GPT-4o (May '24)OpenAIOpenAIGPT-4o (May '24)$7.50$5.00$15.0044.3 tokens/s128,000 tokens0.5 s41
GPT-4o (May '24)Microsoft AzureOpenAIGPT-4o (May '24)$7.50$5.00$15.00124.8 tokens/s128,000 tokens0.78 s41
Llama 3.1 405BReplicateMetaLlama 3.1 405B$9.50$9.50$9.5018.9 tokens/s128,000 tokens0.39 s40
Llama 3.1 405BHyperbolicMetaLlama 3.1 405B$4.00$4.00$4.007.1 tokens/s128,000 tokens0.86 s40
Llama 3.1 405B Latency OptimizedAmazon Bedrock Latency OptimizedMetaLlama 3.1 405B$3.00$3.00$3.0063 tokens/s128,000 tokens0.73 s40
Llama 3.1 405B BaseNebius AI StudioMetaLlama 3.1 405B$1.50$1.00$3.0034.6 tokens/s128,000 tokens0.69 s40
Llama 3.1 405B VertexGoogle VertexMetaLlama 3.1 405B Vertex$7.75$5.00$16.000 tokens/s128,000 tokens0.08 s40
Llama 3.1 405BMicrosoft AzureMetaLlama 3.1 405B$8.00$5.33$16.0031.8 tokens/s128,000 tokens0.5 s40
Llama 3.1 405BFireworksMetaLlama 3.1 405B$3.00$3.00$3.0064.2 tokens/s128,000 tokens0.8 s40
Llama 3.1 405BDeepinfraMetaLlama 3.1 405B$0.90$0.90$0.9024 tokens/s33,000 tokens0.48 s40
Llama 3.1 405BSambaNovaMetaLlama 3.1 405B$6.25$5.00$10.00165.6 tokens/s16,000 tokens0.62 s40
Llama 3.1 405BDatabricksMetaLlama 3.1 405B$7.50$5.00$15.0036.5 tokens/s128,000 tokens0.71 s40
Llama 3.1 405B TurboTogether.aiMetaLlama 3.1 405B$3.50$3.50$3.5026.3 tokens/s128,000 tokens0.64 s40
Llama 3.1 405Bkluster.aiMetaLlama 3.1 405B$3.50$3.50$3.5017.9 tokens/s128,000 tokens0.98 s40
Qwen2.5 72BHyperbolicAlibabaQwen2.5 72B$0.40$0.40$0.4046.8 tokens/s131,000 tokens1.05 s40
Qwen2.5 72B BaseNebius AI StudioAlibabaQwen2.5 72B$0.20$0.13$0.4032.3 tokens/s131,000 tokens0.62 s40
Qwen2.5 72B FastNebius AI StudioAlibabaQwen2.5 72B$0.38$0.25$0.7570.1 tokens/s131,000 tokens0.55 s40
Qwen2.5 72BFireworksAlibabaQwen2.5 72B$0.90$0.90$0.9044.3 tokens/s131,000 tokens0.37 s40
Qwen2.5 72BDeepinfraAlibabaQwen2.5 72B$0.27$0.23$0.4044.3 tokens/s33,000 tokens0.27 s40
Qwen2.5 72BSambaNovaAlibabaQwen2.5 72B$2.50$2.00$4.00225.8 tokens/s16,000 tokens0.33 s40
Qwen2.5 72B TurboTogether.aiAlibabaQwen2.5 72B$1.20$1.20$1.2096.6 tokens/s131,000 tokens0.45 s40
Qwen2.5 72BAlibaba CloudAlibabaQwen2.5 72B$0.00$0.00$0.0039.8 tokens/s131,000 tokens1.19 s40
Phi-4NebiusMicrosoft AzurePhi-4$0.15$0.10$0.30119.2 tokens/s16,000 tokens0.47 s40
Phi-4DeepinfraMicrosoft AzurePhi-4$0.09$0.07$0.1435.5 tokens/s16,000 tokens0.36 s40
Tulu3 405BSambaNovaAllen Institute for AITulu3 405B$6.25$5.00$10.00172.4 tokens/s16,000 tokens1.37 s40
MiniMax-Text-01MiniMaxMiniMaxMiniMax-Text-01$0.42$0.20$1.1038.3 tokens/s1,000,000 tokens0.86 s40
Mistral Large 2 (Nov '24)MistralMistralMistral Large 2 (Nov '24)$3.00$2.00$6.0045.9 tokens/s128,000 tokens0.49 s38
Mistral Large 2 (Nov '24)Microsoft AzureMistralMistral Large 2 (Nov '24)$3.00$2.00$6.0029.7 tokens/s128,000 tokens0.87 s38
Grok BetaxAIxAIGrok Beta$7.50$5.00$15.0066.4 tokens/s128,000 tokens0.26 s38
Pixtral LargeMistralMistralPixtral Large$3.00$2.00$6.0035 tokens/s128,000 tokens0.43 s37
Qwen2.5 Instruct 32B FastNebius AI StudioAlibabaQwen2.5 Instruct 32B$0.00$0.00$0.0086.2 tokens/s128,000 tokens0.54 s37
Qwen2.5 Instruct 32B BaseNebius AI StudioAlibabaQwen2.5 Instruct 32B$0.00$0.00$0.0060 tokens/s128,000 tokens0.55 s37
Qwen2.5 Instruct 32BGroqAlibabaQwen2.5 Instruct 32B$0.79$0.79$0.79198.2 tokens/s128,000 tokens0.22 s37
Llama 3.1 Nemotron 70B BaseNebius AI StudioNVIDIALlama 3.1 Nemotron 70B$0.20$0.13$0.4048.7 tokens/s128,000 tokens0.58 s37
Llama 3.1 Nemotron 70B FastNebius AI StudioNVIDIALlama 3.1 Nemotron 70B$0.38$0.25$0.7572.8 tokens/s128,000 tokens0.54 s37
Llama 3.1 Nemotron 70BDeepinfraNVIDIALlama 3.1 Nemotron 70B$0.27$0.23$0.4030 tokens/s128,000 tokens0.31 s37
Nova ProAmazon BedrockAmazonNova Pro$1.40$0.80$3.2080.1 tokens/s300,000 tokens0.37 s37
Mistral Large 2 (Jul '24)MistralMistralMistral Large 2 (Jul '24)$3.00$2.00$6.0031.6 tokens/s128,000 tokens0.48 s37
Mistral Large 2 (Jul '24)Amazon BedrockMistralMistral Large 2 (Jul '24)$3.00$2.00$6.0033.1 tokens/s128,000 tokens0.45 s37
Mistral Large 2 (Jul '24)Microsoft AzureMistralMistral Large 2 (Jul '24)$3.00$2.00$6.0035.6 tokens/s128,000 tokens0.54 s37
Qwen2.5 Coder 32BHyperbolicAlibabaQwen2.5 Coder 32B$0.20$0.20$0.2061.5 tokens/s131,000 tokens1.21 s36
Qwen2.5 Coder 32BCentMLAlibabaQwen2.5 Coder 32B$0.80$0.80$0.8064.7 tokens/s131,000 tokens0.51 s36
Qwen2.5 Coder 32BFireworksAlibabaQwen2.5 Coder 32B$0.90$0.90$0.9065.1 tokens/s33,000 tokens0.34 s36
Qwen2.5 Coder 32BDeepinfraAlibabaQwen2.5 Coder 32B$0.10$0.08$0.1849.4 tokens/s33,000 tokens0.52 s36
Qwen2.5 Coder 32BGroqAlibabaQwen2.5 Coder 32B$0.79$0.79$0.79384.5 tokens/s131,000 tokens0.35 s36
Qwen2.5 Coder 32BSambaNovaAlibabaQwen2.5 Coder 32B$1.88$1.50$3.00325.1 tokens/s16,000 tokens0.29 s36
Qwen2.5 Coder 32BTogether.aiAlibabaQwen2.5 Coder 32B$0.80$0.80$0.8077.7 tokens/s131,000 tokens0.53 s36
GPT-4o miniOpenAIOpenAIGPT-4o mini$0.26$0.15$0.6095.4 tokens/s128,000 tokens0.51 s36
GPT-4o miniMicrosoft AzureOpenAIGPT-4o mini$0.26$0.15$0.60172.9 tokens/s128,000 tokens0.75 s36
Llama 3.1 70BHyperbolicMetaLlama 3.1 70B$0.40$0.40$0.40152.7 tokens/s128,000 tokens1 s35
Llama 3.1 70B Latency OptimizedAmazon Bedrock Latency OptimizedMetaLlama 3.1 70B$0.90$0.90$0.90141.9 tokens/s128,000 tokens0.34 s35
Llama 3.1 70B BaseNebius AI StudioMetaLlama 3.1 70B$0.20$0.13$0.4042.7 tokens/s128,000 tokens0.6 s35
Llama 3.1 70B FastNebius AI StudioMetaLlama 3.1 70B$0.38$0.25$0.75147.5 tokens/s128,000 tokens0.53 s35
Llama 3.1 70B VertexGoogle VertexMetaLlama 3.1 70B Vertex$0.00$0.00$0.000 tokens/s128,000 tokens0.08 s35
Llama 3.1 70BFireworksMetaLlama 3.1 70B$0.90$0.90$0.90127.2 tokens/s128,000 tokens0.43 s35
Llama 3.1 70B (Turbo, FP8)Deepinfra (Turbo, FP8)MetaLlama 3.1 70B$0.20$0.13$0.4035 tokens/s128,000 tokens0.38 s35
Llama 3.1 70BDeepinfraMetaLlama 3.1 70B$0.27$0.23$0.4032.9 tokens/s128,000 tokens0.28 s35
Llama 3.1 70BFriendliAIMetaLlama 3.1 70B$0.60$0.60$0.60192 tokens/s128,000 tokens0.32 s35
Llama 3.1 70BNovitaMetaLlama 3.1 70B$0.35$0.34$0.3984.9 tokens/s32,000 tokens0.95 s35
Llama 3.1 70BSambaNovaMetaLlama 3.1 70B$0.75$0.60$1.20318 tokens/s128,000 tokens0.51 s35
Llama 3.1 70BDatabricksMetaLlama 3.1 70B$1.50$1.00$3.0075.8 tokens/s128,000 tokens0.38 s35
Llama 3.1 70B TurboTogether.aiMetaLlama 3.1 70B$0.88$0.88$0.88225.2 tokens/s128,000 tokens0.35 s35
Llama 3.1 70BSimplismartMetaLlama 3.1 70B$0.90$0.90$0.90126.1 tokens/s128,000 tokens0.51 s35
Mistral Small 3MistralMistralMistral Small 3$0.15$0.10$0.30123.4 tokens/s32,000 tokens0.45 s35
Mistral Small 3FireworksMistralMistral Small 3$0.90$0.90$0.9038.5 tokens/s32,000 tokens0.49 s35
Mistral Small 3DeepinfraMistralMistral Small 3$0.09$0.07$0.1481.4 tokens/s32,000 tokens0.48 s35
Mistral Small 3Together.aiMistralMistral Small 3$0.80$0.80$0.8097.2 tokens/s32,000 tokens0.18 s35
Claude 3 OpusAmazon BedrockAnthropicClaude 3 Opus$30.00$15.00$75.0026.2 tokens/s200,000 tokens1.22 s35
Claude 3 Opus VertexGoogle VertexAnthropicClaude 3 Opus Vertex$30.00$15.00$75.0026.3 tokens/s200,000 tokens2.3 s35
Claude 3 OpusAnthropicAnthropicClaude 3 Opus$30.00$15.00$75.0027.1 tokens/s200,000 tokens1.16 s35
Claude 3.5 Haiku VertexGoogle VertexAnthropicClaude 3.5 Haiku Vertex$1.60$0.80$4.0067.5 tokens/s200,000 tokens0.63 s35
Claude 3.5 HaikuAnthropicAnthropicClaude 3.5 Haiku$1.60$0.80$4.0066.2 tokens/s200,000 tokens1.26 s35
DeepSeek R1 Distill Llama 8BNovitaDeepSeekDeepSeek R1 Distill Llama 8B$0.04$0.04$0.0454.2 tokens/s32,000 tokens9.07 s34
Gemini 1.5 Pro (May) (Vertex)Google (Vertex)GoogleGemini 1.5 Pro (May) (Vertex)$2.19$1.25$5.000 tokens/s2,000,000 tokens0.1 s34
Gemini 1.5 Pro (May) (AI Studio)Google (AI Studio)GoogleGemini 1.5 Pro (May) (AI Studio)$2.19$1.25$5.0065.9 tokens/s2,000,000 tokens0.43 s34
Qwen TurboAlibaba CloudAlibabaQwen Turbo$0.09$0.05$0.2077.2 tokens/s1,000,000 tokens1.05 s34
Llama 3.2 90B (Vision)Amazon BedrockMetaLlama 3.2 90B (Vision)$0.72$0.72$0.7237.2 tokens/s128,000 tokens0.73 s33
Llama 3.2 90B (Vision) VertexGoogle VertexMetaLlama 3.2 90B (Vision) Vertex$0.00$0.00$0.000 tokens/s128,000 tokens0.08 s33
Llama 3.2 90B (Vision)FireworksMetaLlama 3.2 90B (Vision)$0.90$0.90$0.9041.1 tokens/s128,000 tokens0.36 s33
Llama 3.2 90B (Vision)DeepinfraMetaLlama 3.2 90B (Vision)$0.36$0.35$0.4034.5 tokens/s33,000 tokens0.28 s33
Llama 3.2 90B (Vision)GroqMetaLlama 3.2 90B (Vision)$0.90$0.90$0.90261.4 tokens/s8,000 tokens0.31 s33
Llama 3.2 90B (Vision) TurboTogether.aiMetaLlama 3.2 90B (Vision)$1.20$1.20$1.2054.3 tokens/s128,000 tokens0.26 s33
Qwen2 72BTogether.aiAlibabaQwen2 72B$0.90$0.90$0.9063.8 tokens/s33,000 tokens0.43 s33
Mistral SabaMistralMistralMistral Saba$0.30$0.20$0.6096.7 tokens/s32,000 tokens0.4 s32
Mistral SabaGroqMistralMistral Saba$0.79$0.79$0.79381.2 tokens/s32,000 tokens0.33 s32
Jamba 1.5 LargeAI21 LabsAI21 LabsJamba 1.5 Large$3.50$2.00$8.0060.5 tokens/s256,000 tokens0.51 s29
Jamba 1.5 LargeMicrosoft AzureAI21 LabsJamba 1.5 Large$3.50$2.00$8.0051 tokens/s256,000 tokens0.73 s29
Gemini 1.5 Flash (May) (Vertex)Google (Vertex)GoogleGemini 1.5 Flash (May) (Vertex)$0.13$0.07$0.300 tokens/s1,000,000 tokens0.1 s28
Gemini 1.5 Flash (May) (AI Studio)Google (AI Studio)GoogleGemini 1.5 Flash (May) (AI Studio)$0.13$0.07$0.30307.9 tokens/s1,000,000 tokens0.23 s28
Nova MicroAmazon BedrockAmazonNova Micro$0.06$0.04$0.14189.3 tokens/s130,000 tokens0.29 s28
Yi-LargeFireworks01.AIYi-Large$3.00$3.00$3.0067.2 tokens/s32,000 tokens0.39 s28
Codestral (Jan '25)MistralMistralCodestral (Jan '25)$0.45$0.30$0.90205.7 tokens/s256,000 tokens0.38 s28
Codestral (Jan '25) VertexGoogle VertexMistralCodestral (Jan '25) Vertex$0.45$0.30$0.90150.1 tokens/s128,000 tokens0.15 s28
Llama 3 70BReplicateMetaLlama 3 70B$1.18$0.65$2.7546.5 tokens/s8,000 tokens0.37 s27
Llama 3 70BHyperbolicMetaLlama 3 70B$0.40$0.40$0.4082.4 tokens/s8,000 tokens1.09 s27
Llama 3 70BAmazon BedrockMetaLlama 3 70B$2.86$2.65$3.5054 tokens/s8,000 tokens0.43 s27
Llama 3 70BMicrosoft AzureMetaLlama 3 70B$2.90$2.68$3.5418.9 tokens/s8,000 tokens0.78 s27
Llama 3 70BFireworksMetaLlama 3 70B$0.90$0.90$0.90142 tokens/s8,000 tokens0.28 s27
Llama 3 70BDeepinfraMetaLlama 3 70B$0.27$0.23$0.4050 tokens/s8,000 tokens0.56 s27
Llama 3 70BGroqMetaLlama 3 70B$0.64$0.59$0.79339.7 tokens/s8,000 tokens0.23 s27
Llama 3 70B (Turbo, FP8)Together.aiMetaLlama 3 70B$0.88$0.88$0.8857.4 tokens/s8,000 tokens0.47 s27
Mistral Small (Sep '24)MistralMistralMistral Small (Sep '24)$0.30$0.20$0.6071.6 tokens/s33,000 tokens0.41 s27
Mistral Large (Feb '24)MistralMistralMistral Large (Feb '24)$6.00$4.00$12.0037.8 tokens/s33,000 tokens0.44 s26
Mistral Large (Feb '24)Amazon BedrockMistralMistral Large (Feb '24)$6.00$4.00$12.0044.6 tokens/s33,000 tokens0.37 s26
Mistral Large (Feb '24)Microsoft AzureMistralMistral Large (Feb '24)$6.00$4.00$12.0039.8 tokens/s33,000 tokens0.5 s26
Mixtral 8x22BMistralMistralMixtral 8x22B$3.00$2.00$6.0084.9 tokens/s65,000 tokens0.44 s26
Mixtral 8x22B BaseNebius AI StudioMistralMixtral 8x22B$0.60$0.40$1.2091.6 tokens/s65,000 tokens0.5 s26
Mixtral 8x22B FastNebius AI StudioMistralMixtral 8x22B$1.05$0.70$2.10108.6 tokens/s65,000 tokens0.52 s26
Mixtral 8x22BFireworksMistralMixtral 8x22B$1.20$1.20$1.2091.6 tokens/s65,000 tokens0.33 s26
Mixtral 8x22BTogether.aiMistralMixtral 8x22B$1.20$1.20$1.2070.9 tokens/s65,000 tokens0.81 s26
Qwen2.5 Coder 7B FastNebius AI StudioAlibabaQwen2.5 Coder 7B$0.04$0.03$0.09222.7 tokens/s131,000 tokens0.47 s26
Qwen2.5 Coder 7B BaseNebius AI StudioAlibabaQwen2.5 Coder 7B$0.01$0.01$0.03188.3 tokens/s131,000 tokens0.51 s26
Phi-3 Medium 14BMicrosoft AzureMicrosoft AzurePhi-3 Medium 14B$0.30$0.17$0.6853.1 tokens/s128,000 tokens0.42 s25
DeepSeek Coder V2 Lite Fast, FP8Nebius AI StudioDeepSeekDeepSeek Coder V2 Lite$0.12$0.08$0.24117.6 tokens/s128,000 tokens0.56 s24
DeepSeek Coder V2 Lite Base, FP8Nebius AI StudioDeepSeekDeepSeek Coder V2 Lite$0.06$0.04$0.12113.1 tokens/s128,000 tokens0.58 s24
Mistral MediumMistralMistralMistral Medium$4.09$2.75$8.1044.7 tokens/s33,000 tokens0.43 s24
Llama 3.1 8BCerebrasMetaLlama 3.1 8B$0.10$0.10$0.102223.1 tokens/s33,000 tokens0.27 s24
Llama 3.1 8BHyperbolicMetaLlama 3.1 8B$0.10$0.10$0.10104.6 tokens/s128,000 tokens1 s24
Llama 3.1 8BAmazon BedrockMetaLlama 3.1 8B$0.22$0.22$0.2292.1 tokens/s128,000 tokens0.33 s24
Llama 3.1 8B FastNebius AI StudioMetaLlama 3.1 8B$0.04$0.03$0.09183.9 tokens/s128,000 tokens0.5 s24
Llama 3.1 8B BaseNebius AI StudioMetaLlama 3.1 8B$0.03$0.02$0.0666.6 tokens/s128,000 tokens0.53 s24
Llama 3.1 8B VertexGoogle VertexMetaLlama 3.1 8B Vertex$0.00$0.00$0.000 tokens/s128,000 tokens0.08 s24
Llama 3.1 8BMicrosoft AzureMetaLlama 3.1 8B$0.38$0.30$0.61213.4 tokens/s128,000 tokens0.24 s24
Llama 3.1 8BFireworksMetaLlama 3.1 8B$0.20$0.20$0.20232 tokens/s128,000 tokens0.23 s24
Llama 3.1 8BDeepinfraMetaLlama 3.1 8B$0.04$0.03$0.0569.8 tokens/s128,000 tokens0.27 s24
Llama 3.1 8BFriendliAIMetaLlama 3.1 8B$0.10$0.10$0.10491.8 tokens/s128,000 tokens0.25 s24
Llama 3.1 8BNovitaMetaLlama 3.1 8B$0.05$0.05$0.0565.1 tokens/s16,000 tokens0.6 s24
Llama 3.1 8BGroqMetaLlama 3.1 8B$0.06$0.05$0.08751.5 tokens/s128,000 tokens0.17 s24
Llama 3.1 8BSambaNovaMetaLlama 3.1 8B$0.13$0.10$0.201068.7 tokens/s16,000 tokens0.26 s24
Llama 3.1 8B TurboTogether.aiMetaLlama 3.1 8B Turbo$0.18$0.18$0.18277.3 tokens/s128,000 tokens0.19 s24
Llama 3.1 8BSimplismartMetaLlama 3.1 8B$0.15$0.15$0.15458.6 tokens/s128,000 tokens0.15 s24
Llama 3.1 8Bkluster.aiMetaLlama 3.1 8B$0.18$0.18$0.1814 tokens/s128,000 tokens0.39 s24
Pixtral 12BMistralMistralPixtral 12B$0.15$0.15$0.15102.3 tokens/s128,000 tokens0.43 s23
Pixtral 12BHyperbolicMistralPixtral 12B$0.10$0.10$0.1075.6 tokens/s128,000 tokens0.45 s23
Mistral Small (Feb '24)MistralMistralMistral Small (Feb '24)$1.50$1.00$3.00130 tokens/s33,000 tokens0.44 s23
Mistral Small (Feb '24)Microsoft AzureMistralMistral Small (Feb '24)$1.50$1.00$3.0053.9 tokens/s33,000 tokens0.38 s23
Ministral 8BMistralMistralMinistral 8B$0.10$0.10$0.10141.9 tokens/s128,000 tokens0.4 s22
Llama 3.2 11B (Vision)Amazon BedrockMetaLlama 3.2 11B (Vision)$0.16$0.16$0.16142.5 tokens/s128,000 tokens0.36 s22
Llama 3.2 11B (Vision)CentMLMetaLlama 3.2 11B (Vision)$0.15$0.15$0.1581.7 tokens/s128,000 tokens0.44 s22
Llama 3.2 11B (Vision)FireworksMetaLlama 3.2 11B (Vision)$0.20$0.20$0.2069.8 tokens/s128,000 tokens0.27 s22
Llama 3.2 11B (Vision)DeepinfraMetaLlama 3.2 11B (Vision)$0.06$0.06$0.0650.5 tokens/s128,000 tokens0.25 s22
Llama 3.2 11B (Vision)GroqMetaLlama 3.2 11B (Vision)$0.18$0.18$0.18751.3 tokens/s8,000 tokens0.18 s22
Llama 3.2 11B (Vision) TurboTogether.aiMetaLlama 3.2 11B (Vision)$0.18$0.18$0.18142.8 tokens/s128,000 tokens0.19 s22
Command-R+Amazon BedrockCohereCommand-R+$6.00$3.00$15.0049.5 tokens/s128,000 tokens0.49 s21
Command-R+CohereCohereCommand-R+$4.38$2.50$10.0073 tokens/s128,000 tokens0.24 s21
Codestral (May '24)MistralMistralCodestral (May '24)$0.30$0.20$0.6084.1 tokens/s33,000 tokens0.42 s20
Aya Expanse 32BCohereCohereAya Expanse 32B$0.75$0.50$1.50121.8 tokens/s128,000 tokens0.15 s20
Command-R+ (Apr '24)Amazon BedrockCohereCommand-R+ (Apr '24)$6.00$3.00$15.0046.9 tokens/s128,000 tokens0.49 s20
Command-R+ (Apr '24)CohereCohereCommand-R+ (Apr '24)$6.00$3.00$15.0078.1 tokens/s128,000 tokens0.23 s20
Command-R+ (Apr '24)Microsoft AzureCohereCommand-R+ (Apr '24)$6.00$3.00$15.0050.7 tokens/s128,000 tokens0.58 s20
DBRXDatabricksDatabricksDBRX$1.13$0.75$2.2568.7 tokens/s33,000 tokens0.47 s20
DBRXTogether.aiDatabricksDBRX$1.20$1.20$1.2082.9 tokens/s33,000 tokens0.32 s20
Ministral 3BMistralMistralMinistral 3B$0.04$0.04$0.04220.1 tokens/s128,000 tokens0.38 s20
Mistral NeMoMistralMistralMistral NeMo$0.15$0.15$0.15120.7 tokens/s128,000 tokens0.43 s20
Mistral NeMo FastNebius AI StudioMistralMistral NeMo$0.12$0.08$0.24159.4 tokens/s128,000 tokens0.48 s20
Mistral NeMo BaseNebius AI StudioMistralMistral NeMo$0.06$0.04$0.1231.5 tokens/s128,000 tokens0.64 s20
Mistral NeMoDeepinfraMistralMistral NeMo$0.06$0.04$0.1054.1 tokens/s128,000 tokens0.32 s20
DeepSeek R1 Distill Qwen 1.5BTogether.aiDeepSeekDeepSeek R1 Distill Qwen 1.5B$0.18$0.18$0.18380.2 tokens/s128,000 tokens6.65 s19
Mixtral 8x7BMistralMistralMixtral 8x7B$0.70$0.70$0.7098 tokens/s33,000 tokens0.41 s17
Mixtral 8x7BAmazon BedrockMistralMixtral 8x7B$0.51$0.45$0.7078.8 tokens/s33,000 tokens0.33 s17
Mixtral 8x7B FastNebius AI StudioMistralMixtral 8x7B$0.23$0.15$0.45164.4 tokens/s33,000 tokens0.5 s17
Mixtral 8x7B BaseNebius AI StudioMistralMixtral 8x7B$0.12$0.08$0.24135 tokens/s33,000 tokens0.47 s17
Mixtral 8x7BFireworksMistralMixtral 8x7B$0.50$0.50$0.50153.2 tokens/s33,000 tokens0.26 s17
Mixtral 8x7BDeepinfraMistralMixtral 8x7B$0.24$0.24$0.24109.5 tokens/s33,000 tokens0.46 s17
Mixtral 8x7BGroqMistralMixtral 8x7B$0.24$0.24$0.24572.1 tokens/s33,000 tokens0.27 s17
Mixtral 8x7BDatabricksMistralMixtral 8x7B$0.63$0.50$1.0092.6 tokens/s33,000 tokens0.4 s17
Mixtral 8x7BTogether.aiMistralMixtral 8x7B$0.60$0.60$0.60102.9 tokens/s33,000 tokens0.38 s17
OpenChat 3.5DeepinfraOpenChatOpenChat 3.5$0.06$0.06$0.0677.7 tokens/s8,000 tokens0.27 s16
Command-RCohereCohereCommand-R$0.26$0.15$0.6075.9 tokens/s128,000 tokens0.19 s15
Command-R (Mar '24)Amazon BedrockCohereCommand-R (Mar '24)$0.75$0.50$1.50105.3 tokens/s128,000 tokens0.34 s15
Command-R (Mar '24)CohereCohereCommand-R (Mar '24)$0.75$0.50$1.50173.7 tokens/s128,000 tokens0.15 s15
Command-R (Mar '24)Microsoft AzureCohereCommand-R (Mar '24)$0.75$0.50$1.5081.6 tokens/s128,000 tokens0.46 s15
Codestral-MambaMistralMistralCodestral-Mamba$0.25$0.25$0.2596 tokens/s256,000 tokens0.56 s14
Mistral 7BMistralMistralMistral 7B$0.25$0.25$0.25131 tokens/s8,000 tokens0.36 s10
Mistral 7BDeepinfraMistralMistral 7B$0.04$0.03$0.0684.5 tokens/s8,000 tokens0.21 s10
Mistral 7BNovitaMistralMistral 7B$0.06$0.06$0.0699.4 tokens/s32,000 tokens0.91 s10
Mistral 7BTogether.aiMistralMistral 7B$0.20$0.20$0.20167 tokens/s8,000 tokens0.21 s10
Llama 2 Chat 7BReplicateMetaLlama 2 Chat 7B$0.10$0.05$0.25122.6 tokens/s4,000 tokens0.59 s8
o1-previewOpenAIOpenAIo1-preview$26.25$15.00$60.00125 tokens/s128,000 tokens22.96 sN/A
o1-previewMicrosoft AzureOpenAIo1-preview$28.88$16.50$66.00136.2 tokens/s128,000 tokens26.78 sN/A
Llama 3.2 3BHyperbolicMetaLlama 3.2 3B$0.10$0.10$0.10175.1 tokens/s128,000 tokens0.88 sN/A
Llama 3.2 3BAmazon BedrockMetaLlama 3.2 3B$0.15$0.15$0.1573 tokens/s128,000 tokens0.31 sN/A
Llama 3.2 3B BaseNebius AI StudioMetaLlama 3.2 3B$0.01$0.01$0.02122.5 tokens/s128,000 tokens0.52 sN/A
Llama 3.2 3BFireworksMetaLlama 3.2 3B$0.10$0.10$0.10155.6 tokens/s128,000 tokens0.19 sN/A
Llama 3.2 3BDeepinfraMetaLlama 3.2 3B$0.02$0.02$0.03155.2 tokens/s128,000 tokens0.45 sN/A
Llama 3.2 3BNovitaMetaLlama 3.2 3B$0.04$0.03$0.0594.7 tokens/s32,000 tokens0.58 sN/A
Llama 3.2 3BGroqMetaLlama 3.2 3B$0.06$0.06$0.061544.5 tokens/s8,000 tokens0.33 sN/A
Llama 3.2 3BSambaNovaMetaLlama 3.2 3B$0.10$0.08$0.161549 tokens/s8,000 tokens0.28 sN/A
Llama 3.2 3B TurboTogether.aiMetaLlama 3.2 3B$0.06$0.06$0.0663.7 tokens/s128,000 tokens0.37 sN/A
Llama 3.2 1BAmazon BedrockMetaLlama 3.2 1B$0.10$0.10$0.10122.3 tokens/s128,000 tokens0.34 sN/A
Llama 3.2 1B BaseNebius AI StudioMetaLlama 3.2 1B$0.01$0.01$0.01267.1 tokens/s128,000 tokens0.49 sN/A
Llama 3.2 1BDeepinfraMetaLlama 3.2 1B$0.01$0.01$0.02136.3 tokens/s128,000 tokens0.25 sN/A
Llama 3.2 1BGroqMetaLlama 3.2 1B$0.04$0.04$0.043114.8 tokens/s8,000 tokens0.49 sN/A
Llama 3.2 1BSambaNovaMetaLlama 3.2 1B$0.05$0.04$0.082539.8 tokens/s16,000 tokens0.94 sN/A
Gemini 2.0 Flash (exp) (AI Studio)Google (AI Studio)GoogleGemini 2.0 Flash (exp) (AI Studio)$0.00$0.00$0.00174.6 tokens/s1,000,000 tokens0.26 sN/A
Gemini 1.5 Flash (Sep) (Vertex)Google VertexGoogleGemini 1.5 Flash (Sep) (Vertex)$0.13$0.07$0.300 tokens/s1,000,000 tokens0.1 sN/A
Gemini 1.5 Flash (Sep) (AI Studio)Google (AI Studio)GoogleGemini 1.5 Flash (Sep) (AI Studio)$0.13$0.07$0.30180.5 tokens/s1,000,000 tokens0.29 sN/A
Gemma 2 27BTogether.aiGoogleGemma 2 27B$0.80$0.80$0.8081.8 tokens/s8,000 tokens0.45 sN/A
Gemma 2 9B BaseNebius AI StudioGoogleGemma 2 9B$0.03$0.02$0.06106.1 tokens/s8,000 tokens0.82 sN/A
Gemma 2 9BDeepinfraGoogleGemma 2 9B$0.04$0.03$0.0646.9 tokens/s8,000 tokens0.35 sN/A
Gemma 2 9BGroqGoogleGemma 2 9B$0.20$0.20$0.20662.9 tokens/s8,000 tokens0.23 sN/A
Gemma 2 9BTogether.aiGoogleGemma 2 9B$0.30$0.30$0.30133 tokens/s8,000 tokens0.24 sN/A
Gemini 1.5 Flash-8B AI StudioGoogle AI StudioGoogleGemini 1.5 Flash-8B AI Studio$0.07$0.04$0.15276.4 tokens/s1,000,000 tokens0.19 sN/A
Claude 3.5 Sonnet (June)AnthropicAnthropicClaude 3.5 Sonnet (June)$6.00$3.00$15.0080.6 tokens/s200,000 tokens0.77 sN/A
Claude 3 HaikuAnthropicAnthropicClaude 3 Haiku$0.50$0.25$1.25131.7 tokens/s200,000 tokens0.58 sN/A
Aya Expanse 8BCohereCohereAya Expanse 8B$0.75$0.50$1.50166 tokens/s8,000 tokens0.21 sN/A
Jamba 1.5 MiniAI21 LabsAI21 LabsJamba 1.5 Mini$0.25$0.20$0.40183.5 tokens/s256,000 tokens0.3 sN/A
Jamba 1.5 MiniMicrosoft AzureAI21 LabsJamba 1.5 Mini$0.25$0.20$0.4080.9 tokens/s256,000 tokens0.49 sN/A
GPT-4 TurboOpenAIOpenAIGPT-4 Turbo$15.00$10.00$30.0049.2 tokens/s128,000 tokens0.52 sN/A
Llama 3 8BReplicateMetaLlama 3 8B$0.10$0.05$0.2573 tokens/s8,000 tokens0.39 sN/A
Llama 3 8BAmazon BedrockMetaLlama 3 8B$0.38$0.30$0.60103.3 tokens/s8,000 tokens0.3 sN/A
Llama 3 8BMicrosoft AzureMetaLlama 3 8B$0.38$0.30$0.6173.5 tokens/s8,000 tokens0.36 sN/A
Llama 3 8BFireworksMetaLlama 3 8B$0.20$0.20$0.20126 tokens/s8,000 tokens0.23 sN/A
Llama 3 8BDeepinfraMetaLlama 3 8B$0.04$0.03$0.06112.4 tokens/s8,000 tokens0.17 sN/A
Llama 3 8BNovitaMetaLlama 3 8B$0.04$0.04$0.0443.3 tokens/s8,000 tokens1.19 sN/A
Llama 3 8BGroqMetaLlama 3 8B$0.06$0.05$0.081198.6 tokens/s8,000 tokens0.34 sN/A
Gemini 1.0 Pro VertexGoogle VertexGoogleGemini 1.0 Pro Vertex$0.19$0.13$0.380 tokens/s33,000 tokens0.15 sN/A
Claude 3 SonnetAmazon BedrockAnthropicClaude 3 Sonnet$6.00$3.00$15.0062.4 tokens/s200,000 tokens0.68 sN/A
Claude 3 SonnetAnthropicAnthropicClaude 3 Sonnet$6.00$3.00$15.0058 tokens/s200,000 tokens0.54 sN/A
Claude 2.1Amazon BedrockAnthropicClaude 2.1$12.00$8.00$24.0028.9 tokens/s200,000 tokens1.9 sN/A
Claude 2.1AnthropicAnthropicClaude 2.1$12.00$8.00$24.0013.3 tokens/s200,000 tokens0.82 sN/A
Claude 2.0AnthropicAnthropicClaude 2.0$12.00$8.00$24.0029.3 tokens/s100,000 tokens0.82 sN/A
Jamba InstructAI21 LabsAI21 LabsJamba Instruct$0.55$0.50$0.70184 tokens/s256,000 tokens0.29 sN/A
Jamba InstructMicrosoft AzureAI21 LabsJamba Instruct$0.55$0.50$0.7076.9 tokens/s256,000 tokens0.52 sN/A
Showing 301 of 303 models

About This Tool

This interactive tool helps you compare different LLM providers and models based on various metrics like price, performance, and capabilities.

Data is sourced from artificialanalysis.ai and is updated regularly to reflect the latest information available.

Use the filters and chart configuration options to customize your view and find the perfect LLM for your specific needs.