Open Source Models

Open-source AI,
no infrastructure.

Run Llama, Mistral, DeepSeek, Qwen, Gemma, and 136+ more open-weight models via a single API. No GPUs required. OpenAI-compatible.

Providers available

meta-llama (10)mistralai (19)deepseek (13)qwen (47)google (26)microsoft (3)nvidia (8)cohere (5)nousresearch (4)liquid (1)

Featured model families

Llama 3

Meta

Llama 3 License

Meta's flagship open-weight models. Llama 3.3 70B matches GPT-4o on many benchmarks at a fraction of the cost.

DeepSeek

DeepSeek AI

MIT

Chinese open-source lab producing state-of-the-art coding and reasoning models. DeepSeek R1 rivals o1 at open-weight pricing.

Mistral

Mistral AI

Apache 2.0

European open-source AI models known for efficiency. Mistral 7B and Mixtral 8x7B set the standard for small, capable models.

Qwen

Alibaba Cloud

Qwen License

Qwen2.5 and QwQ are among the strongest open-weight models for coding and reasoning tasks, including 128k context variants.

Gemma

Google

Gemma Terms

Google's open-weight Gemma models are compact and highly capable — Gemma 3 27B competes with models twice its size.

Phi

Microsoft

MIT

Microsoft's Phi models punch far above their weight class. Phi-4 14B performs comparably to much larger models on reasoning tasks.

All open-source models

136 open-weight models from 10 providers.

ModelContextInputOutput
Google: Gemini 3.1 Pro Preview
google/gemini-3.1-pro-preview
1M
$2/M
$12/M
Qwen: Qwen3.7 Plus (20% off)
qwen/qwen3.7-plus
1M
$0.32/M
$1.28/M
Cohere: Command A
cohere/command-a
256k
$2.50/M
$10/M
Cohere: Command R (08-2024)
cohere/command-r-08-2024
128k
$0.15/M
$0.60/M
Cohere: Command R+ (08-2024)
cohere/command-r-plus-08-2024
128k
$2.50/M
$10/M
Cohere: Command R7B (12-2024)
cohere/command-r7b-12-2024
128k
$0.04/M
$0.15/M
Cohere: North Mini Code (free)
cohere/north-mini-code:free
256k
Free
Free
DeepSeek: DeepSeek V3
deepseek/deepseek-chat
131k
$0.20/M
$0.80/M
DeepSeek: DeepSeek V3 0324
deepseek/deepseek-chat-v3-0324
164k
$0.20/M
$0.77/M
DeepSeek: DeepSeek V3.1
deepseek/deepseek-chat-v3.1
164k
$0.21/M
$0.79/M
DeepSeek: DeepSeek V3.1 Terminus
deepseek/deepseek-v3.1-terminus
164k
$0.27/M
$0.95/M
DeepSeek: DeepSeek V3.2
deepseek/deepseek-v3.2
131k
$0.23/M
$0.34/M
DeepSeek: DeepSeek V3.2 Exp
deepseek/deepseek-v3.2-exp
164k
$0.27/M
$0.41/M
DeepSeek: DeepSeek V4 Flash
deepseek/deepseek-v4-flash
1M
$0.09/M
$0.18/M
DeepSeek: DeepSeek V4 Flash (>40% off)
deepseek/deepseek-v4-flash:discounted
1M
$0.14/M
$0.28/M
DeepSeek: DeepSeek V4 Pro
deepseek/deepseek-v4-pro
1M
$0.43/M
$0.87/M
DeepSeek: DeepSeek V4 Pro (>80% off)
deepseek/deepseek-v4-pro:discounted
1M
$0.43/M
$0.87/M
DeepSeek: R1
deepseek/deepseek-r1
164k
$0.70/M
$2.50/M
DeepSeek: R1 0528
deepseek/deepseek-r1-0528
164k
$0.50/M
$2.15/M
DeepSeek: R1 Distill Llama 70B
deepseek/deepseek-r1-distill-llama-70b
128k
$0.80/M
$0.80/M
Google: Gemini 2.5 Flash
google/gemini-2.5-flash
1M
$0.30/M
$2.50/M
Google: Gemini 2.5 Flash Lite
google/gemini-2.5-flash-lite
1M
$0.10/M
$0.40/M
Google: Gemini 2.5 Flash Lite Preview 09-2025
google/gemini-2.5-flash-lite-preview-09-2025
1M
$0.10/M
$0.40/M
Google: Gemini 2.5 Pro
google/gemini-2.5-pro
1M
$1.25/M
$10/M
Google: Gemini 2.5 Pro Preview 05-06
google/gemini-2.5-pro-preview-05-06
1M
$1.25/M
$10/M
Google: Gemini 2.5 Pro Preview 06-05
google/gemini-2.5-pro-preview
1M
$1.25/M
$10/M
Google: Gemini 3 Flash Preview
google/gemini-3-flash-preview
1M
$0.50/M
$3/M
Google: Gemini 3.1 Flash Lite
google/gemini-3.1-flash-lite
1M
$0.25/M
$1.50/M
Google: Gemini 3.1 Flash Lite Preview
google/gemini-3.1-flash-lite-preview
1M
$0.25/M
$1.50/M
Google: Gemini 3.1 Pro Preview Custom Tools
google/gemini-3.1-pro-preview-customtools
1M
$2/M
$12/M
Google: Gemini 3.5 Flash
google/gemini-3.5-flash
1M
$1.50/M
$9/M
Google: Gemma 2 27B
google/gemma-2-27b-it
8k
$0.65/M
$0.65/M
Google: Gemma 3 12B
google/gemma-3-12b-it
131k
$0.05/M
$0.15/M
Google: Gemma 3 27B
google/gemma-3-27b-it
131k
$0.08/M
$0.16/M
Google: Gemma 3 4B
google/gemma-3-4b-it
131k
$0.05/M
$0.10/M
Google: Gemma 3n 4B
google/gemma-3n-e4b-it
33k
$0.06/M
$0.12/M
Google: Gemma 4 26B A4B
google/gemma-4-26b-a4b-it
262k
$0.06/M
$0.33/M
Google: Gemma 4 31B
google/gemma-4-31b-it
262k
$0.12/M
$0.35/M
Google: Lyria 3 Clip Preview
google/lyria-3-clip-preview
1M
Free
Free
Google: Lyria 3 Pro Preview
google/lyria-3-pro-preview
1M
Free
Free
Google: Nano Banana (Gemini 2.5 Flash Image)
google/gemini-2.5-flash-image
33k
$0.30/M
$2.50/M
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)
google/gemini-3.1-flash-image-preview
131k
$0.50/M
$3/M
Google: Nano Banana 2 (Gemini 3.1 Flash Image)
google/gemini-3.1-flash-image
131k
$0.50/M
$3/M
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)
google/gemini-3-pro-image-preview
66k
$2/M
$12/M
Google: Nano Banana Pro (Gemini 3 Pro Image)
google/gemini-3-pro-image
66k
$2/M
$12/M
LiquidAI: LFM2-24B-A2B
liquid/lfm-2-24b-a2b
128k
$0.03/M
$0.12/M
Meta: Llama 3 8B Instruct
meta-llama/llama-3-8b-instruct
8k
$0.14/M
$0.14/M
Meta: Llama 3.1 70B Instruct
meta-llama/llama-3.1-70b-instruct
131k
$0.40/M
$0.40/M
Meta: Llama 3.1 8B Instruct
meta-llama/llama-3.1-8b-instruct
131k
$0.02/M
$0.03/M
Meta: Llama 3.2 11B Vision Instruct
meta-llama/llama-3.2-11b-vision-instruct
131k
$0.34/M
$0.34/M
Meta: Llama 3.2 1B Instruct
meta-llama/llama-3.2-1b-instruct
131k
$0.03/M
$0.20/M
Meta: Llama 3.2 3B Instruct
meta-llama/llama-3.2-3b-instruct
131k
$0.05/M
$0.34/M
Meta: Llama 3.3 70B Instruct
meta-llama/llama-3.3-70b-instruct
131k
$0.10/M
$0.32/M
Meta: Llama 4 Maverick
meta-llama/llama-4-maverick
1M
$0.15/M
$0.60/M
Meta: Llama 4 Scout
meta-llama/llama-4-scout
10M
$0.10/M
$0.30/M
Meta: Llama Guard 4 12B
meta-llama/llama-guard-4-12b
164k
$0.18/M
$0.18/M
Microsoft: Phi 4
microsoft/phi-4
16k
$0.07/M
$0.14/M
Microsoft: Phi 4 Mini Instruct
microsoft/phi-4-mini-instruct
131k
$0.08/M
$0.35/M
Mistral Large
mistralai/mistral-large
128k
$2/M
$6/M
Mistral Large 2407
mistralai/mistral-large-2407
131k
$2/M
$6/M
Mistral: Codestral 2508
mistralai/codestral-2508
256k
$0.30/M
$0.90/M
Mistral: Devstral 2 2512
mistralai/devstral-2512
262k
$0.40/M
$2/M
Mistral: Ministral 3 14B 2512
mistralai/ministral-14b-2512
262k
$0.20/M
$0.20/M
Mistral: Ministral 3 3B 2512
mistralai/ministral-3b-2512
131k
$0.10/M
$0.10/M
Mistral: Ministral 3 8B 2512
mistralai/ministral-8b-2512
262k
$0.15/M
$0.15/M
Mistral: Mistral Large 3 2512
mistralai/mistral-large-2512
262k
$0.50/M
$1.50/M
Mistral: Mistral Medium 3
mistralai/mistral-medium-3
131k
$0.40/M
$2/M
Mistral: Mistral Medium 3.1
mistralai/mistral-medium-3.1
131k
$0.40/M
$2/M
Mistral: Mistral Medium 3.5
mistralai/mistral-medium-3-5
262k
$1.50/M
$7.50/M
Mistral: Mistral Nemo
mistralai/mistral-nemo
131k
$0.02/M
$0.03/M
Mistral: Mistral Small 3
mistralai/mistral-small-24b-instruct-2501
33k
$0.05/M
$0.08/M
Mistral: Mistral Small 3.1 24B
mistralai/mistral-small-3.1-24b-instruct
128k
$0.35/M
$0.55/M
Mistral: Mistral Small 3.2 24B
mistralai/mistral-small-3.2-24b-instruct
128k
$0.07/M
$0.20/M
Mistral: Mistral Small 4
mistralai/mistral-small-2603
262k
$0.15/M
$0.60/M
Mistral: Mixtral 8x22B Instruct
mistralai/mixtral-8x22b-instruct
66k
$2/M
$6/M
Mistral: Saba
mistralai/mistral-saba
33k
$0.20/M
$0.60/M
Mistral: Voxtral Small 24B 2507
mistralai/voxtral-small-24b-2507
32k
$0.10/M
$0.30/M
Nous: Hermes 3 405B Instruct
nousresearch/hermes-3-llama-3.1-405b
131k
$1/M
$1/M
Nous: Hermes 3 70B Instruct
nousresearch/hermes-3-llama-3.1-70b
131k
$0.70/M
$0.70/M
Nous: Hermes 4 405B
nousresearch/hermes-4-405b
131k
$1/M
$3/M
Nous: Hermes 4 70B
nousresearch/hermes-4-70b
131k
$0.13/M
$0.40/M
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
nvidia/llama-3.3-nemotron-super-49b-v1.5
131k
$0.40/M
$0.40/M
NVIDIA: Nemotron 3 Nano 30B A3B
nvidia/nemotron-3-nano-30b-a3b
262k
$0.05/M
$0.20/M
NVIDIA: Nemotron 3 Nano Omni (free)
nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free
256k
Free
Free
NVIDIA: Nemotron 3 Super
nvidia/nemotron-3-super-120b-a12b
1M
$0.09/M
$0.45/M
NVIDIA: Nemotron 3 Super (free)
nvidia/nemotron-3-super-120b-a12b:free
1M
Free
Free
NVIDIA: Nemotron 3 Ultra
nvidia/nemotron-3-ultra-550b-a55b
1M
$0.50/M
$2.20/M
NVIDIA: Nemotron 3 Ultra (free)
nvidia/nemotron-3-ultra-550b-a55b:free
1M
Free
Free
NVIDIA: Nemotron 3.5 Content Safety (free)
nvidia/nemotron-3.5-content-safety:free
128k
Free
Free
Qwen: Qwen Plus 0728
qwen/qwen-plus-2025-07-28
1M
$0.26/M
$0.78/M
Qwen: Qwen Plus 0728 (thinking)
qwen/qwen-plus-2025-07-28:thinking
1M
$0.26/M
$0.78/M
Qwen: Qwen-Plus
qwen/qwen-plus
1M
$0.26/M
$0.78/M
Qwen: Qwen2.5 7B Instruct
qwen/qwen-2.5-7b-instruct
131k
$0.04/M
$0.10/M
Qwen: Qwen2.5 VL 72B Instruct
qwen/qwen2.5-vl-72b-instruct
131k
$0.80/M
$1/M
Qwen: Qwen3 14B
qwen/qwen3-14b
132k
$0.10/M
$0.24/M
Qwen: Qwen3 235B A22B
qwen/qwen3-235b-a22b
131k
$0.45/M
$1.82/M
Qwen: Qwen3 235B A22B Instruct 2507
qwen/qwen3-235b-a22b-2507
262k
$0.09/M
$0.10/M
Qwen: Qwen3 235B A22B Thinking 2507
qwen/qwen3-235b-a22b-thinking-2507
262k
$0.10/M
$0.10/M
Qwen: Qwen3 30B A3B
qwen/qwen3-30b-a3b
131k
$0.12/M
$0.50/M
Qwen: Qwen3 30B A3B Instruct 2507
qwen/qwen3-30b-a3b-instruct-2507
131k
$0.05/M
$0.19/M

Showing 100 of 136 open-source models. View all →

Why use open-source models?

No vendor lock-in

Open weights mean the model can be hosted anywhere. Switch providers without changing your application.

Transparent pricing

No hidden model updates. Run the exact same checkpoint for months and get reproducible results.

Fine-tuning ready

Open weights let you fine-tune on your own data for domain-specific tasks — something closed models cannot offer.

Community audited

Thousands of researchers test open-source models. Known limitations are documented publicly.

Cost-effective at scale

Many open models are 10–100× cheaper per token than their closed equivalents with comparable quality.

No data training risk

Some providers don't train on your data when you use their hosted open-source inference.

Run any open-source model via API

No GPU setup. No Docker. Just an API key and one line of code.

Get started free