Modelos de IA 2026Fichas Técnicas Completas

Especificações técnicas, preços, context window e capacidades de 500 modelos de IA de 61 empresas. Fichas atualizadas semanalmente com dados dos fabricantes.

Para ranking por performance, acesse o Benchmark de IA.

500

Modelos

61

Empresas

92

Open Source

122

Multimodal

AI21 Labs

7 modelos
ModeloOS
AI21: Jamba Large 1.7
256K tokens·$2.00/1M
Jamba 1.5 Large
$2.00/1M
Jamba 1.5 Mini
$0.20/1M
Jamba 1.6 Large
$2.00/1M
Jamba 1.6 Mini
$0.20/1M
Jamba 1.7 Mini
0
Jamba Reasoning 3B
0

AionLabs

3 modelos
ModeloOS
AionLabs: Aion-1.0
131K tokens·$4.00/1M
AionLabs: Aion-2.0
131K tokens·$0.80/1M
AionLabs: Aion-RP 1.0 (8B)
33K tokens·$0.80/1M

AlfredPros

1 modelo
ModeloOS
AlfredPros: CodeLLaMa 7B Instruct Solidity
4K tokens·$0.80/1M

Alibaba

64 modelos
ModeloOS
Qwen Chat 14B
0
Qwen Chat 72B
0
Qwen: Qwen2.5 7B Instruct
33K tokens·$0.04/1M
Qwen: Qwen2.5 VL 72B Instruct
32K tokens·$0.25/1M
Qwen: Qwen3 235B A22B Instruct 2507
262K tokens·$0.45/1M
Qwen: Qwen3 235B A22B Thinking 2507
131K tokens·$0.15/1M
Qwen: Qwen3 30B A3B Instruct 2507
262K tokens·$0.08/1M
Qwen: Qwen3 30B A3B Thinking 2507
131K tokens·$0.08/1M
Qwen: Qwen3 Coder 30B A3B Instruct
160K tokens·$0.19/1M
Qwen: Qwen3 Next 80B A3B Instruct
262K tokens·$0.50/1M
Qwen: Qwen3 VL 235B A22B Instruct
262K tokens·$0.30/1M
Qwen: Qwen3 VL 30B A3B Instruct
131K tokens·$0.20/1M
Qwen: Qwen3 VL 32B Instruct
131K tokens·$0.70/1M
Qwen: Qwen3 VL 8B Instruct
131K tokens·$0.18/1M
Qwen1.5 Chat 110B
0
Qwen2 Instruct 72B
0
Qwen2.5 72B Instruct
33K tokens·$0.36/1M
Qwen2.5 Coder 32B Instruct
33K tokens00
Qwen2.5 Coder Instruct 7B
0
Qwen2.5 Instruct 32B
0
Qwen2.5 Max
$1.60/1M
Qwen3 0.6B (Non-reasoning)
$0.11/1M
Qwen3 0.6B (Reasoning)
$0.11/1M
Qwen3 1.7B (Non-reasoning)
$0.11/1M
Qwen3 1.7B (Reasoning)
$0.11/1M
Qwen3 14B (Non-reasoning)
$0.23/1M
Qwen3 14B (Reasoning)
$0.23/1M
Qwen3 235B A22B (Reasoning)
$0.70/1M
Qwen3 30B A3B (Reasoning)
$0.09/1M
Qwen3 30B A3B 2507 (Reasoning)
$0.28/1M
Qwen3 30B A3B 2507 Instruct
$0.15/1M
Qwen3 32B (Non-reasoning)
$0.15/1M
Qwen3 32B (Reasoning)
$0.15/1M
Qwen3 4B (Non-reasoning)
$0.11/1M
Qwen3 4B (Reasoning)
$0.11/1M
Qwen3 4B 2507 (Reasoning)
0
Qwen3 4B 2507 Instruct
0
Qwen3 8B (Non-reasoning)
$0.18/1M
Qwen3 8B (Reasoning)
$0.11/1M
Qwen3 Coder 480B A35B Instruct
$0.30/1M
Qwen3 Max (Preview)
$1.20/1M
Qwen3 Max Thinking (Preview)
$1.20/1M
Qwen3 Next 80B A3B (Reasoning)
$0.50/1M
Qwen3 Omni 30B A3B (Reasoning)
$0.25/1M
Qwen3 Omni 30B A3B Instruct
$0.25/1M
Qwen3 VL 235B A22B (Reasoning)
$0.84/1M
Qwen3 VL 30B A3B (Reasoning)
$0.20/1M
Qwen3 VL 32B (Reasoning)
$0.70/1M
Qwen3 VL 4B (Reasoning)
0
Qwen3 VL 4B Instruct
0
Qwen3 VL 8B (Reasoning)
$0.18/1M
Qwen3.5 0.8B (Non-reasoning)
$0.01/1M
Qwen3.5 0.8B (Reasoning)
$0.01/1M
Qwen3.5 2B (Reasoning)
$0.02/1M
Qwen3.5 4B (Non-reasoning)
$0.03/1M
Qwen3.5 4B (Reasoning)
$0.03/1M
Qwen3.5 9B (Reasoning)
0
Qwen3.5 Omni Flash
$0.10/1M
Qwen3.5 Omni Plus
$0.40/1M
Qwen3.6 Max Preview
$1.30/1M
Qwen3.7 Max
$2.50/1M
QwQ 32B
$0.66/1M
QwQ 32B-Preview
0
Wan 2.1

Allen Institute for AI

8 modelos
ModeloOS
Llama 3.1 Tulu3 405B
0
Molmo 7B-D
0
Molmo2-8B
0
OLMo 2 32B
0
OLMo 2 7B
0
Olmo 3 7B Instruct
$0.10/1M
Olmo 3 7B Think
0
Olmo 3.1 32B Think
0

AllenAI

2 modelos
ModeloOS
Olmo 3 32B Think
66K tokens00
Olmo 3.1 32B Instruct
66K tokens00

Amazon

13 modelos
ModeloOS
Amazon: Nova 2 Lite
1.0M tokens·$0.30/1M
Amazon: Nova Lite 1.0
300K tokens·$0.06/1M
Amazon: Nova Micro 1.0
128K tokens·$0.04/1M
Amazon: Nova Premier 1.0
1.0M tokens·$2.50/1M
Amazon: Nova Pro 1.0
300K tokens·$0.80/1M
Nova 2.0 Lite (high)
$0.30/1M
Nova 2.0 Omni (low)
$0.30/1M
Nova 2.0 Omni (medium)
$0.30/1M
Nova 2.0 Omni (Non-reasoning)
$0.30/1M
Nova 2.0 Pro Preview (medium)
$1.25/1M
Nova Lite
$0.06/1M
Nova Micro
$0.04/1M
Nova Pro
$0.80/1M

Anthropic

35 modelos
ModeloOS
Anthropic: Claude 3 Haiku
200K tokens·$0.25/1M
Anthropic: Claude Opus 4.8 (Fast)
1.0M tokens·$10.00/1M
Claude 2.0
0
Claude 2.1
0
Claude 3 Opus
$18.75/1M
Claude 3 Sonnet
$3.00/1M
Claude 3.5 Haiku
200K tokens·$1.00/1M
Claude 3.5 Sonnet (June '24)
$3.75/1M
Claude 3.5 Sonnet (Oct '24)
$3.75/1M
Claude 3.7 Sonnet
200K tokens·$3.75/1M
Claude 3.7 Sonnet (thinking)
200K tokens00
Claude 4 Opus (Reasoning)
$18.75/1M
Claude 4 Sonnet (Reasoning)
$3.75/1M
Claude 4.1 Opus (Non-reasoning)
$18.75/1M
Claude 4.1 Opus (Reasoning)
$18.75/1M
Claude 4.5 Haiku (Reasoning)
$1.25/1M
Claude 4.5 Sonnet (Non-reasoning)
$3.75/1M
Claude 4.5 Sonnet (Reasoning)
$3.75/1M
Claude Haiku 4.5
200K tokens·$1.25/1M
Claude Instant
0
Claude Opus 4
200K tokens·$18.75/1M
Claude Opus 4.1
200K tokens·$15.00/1M
Claude Opus 4.5
200K tokens·$6.25/1M
Claude Opus 4.5 (Reasoning)
$6.25/1M
Claude Opus 4.6
1.0M tokens·$6.25/1M
Claude Opus 4.6 (Adaptive Reasoning, Max Effort)
$6.25/1M
Claude Opus 4.6 (Fast)
1.0M tokens·$30.00/1M
Claude Opus 4.7
1.0M tokens·$6.25/1M
Claude Opus 4.7 (Fast)
1.0M tokens·$30.00/1M
Claude Opus 4.8 (Adaptive Reasoning, Max Effort)
1.0M tokens·$6.25/1M
Claude Sonnet 4
1.0M tokens·$3.75/1M
Claude Sonnet 4.5
1.0M tokens·$3.00/1M
Claude Sonnet 4.6
1.0M tokens·$3.75/1M
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
$3.75/1M
Claude Sonnet 4.6 (Non-reasoning, Low Effort)
$3.75/1M

Arcee AI

7 modelos
ModeloOS
Arcee AI: Coder Large
33K tokens·$0.50/1M
Arcee AI: Maestro Reasoning
131K tokens·$0.90/1M
Arcee AI: Spotlight
131K tokens·$0.18/1M
Arcee AI: Trinity Large Thinking
262K tokens·$0.22/1M
Arcee AI: Trinity Mini
131K tokens·$0.04/1M
Arcee AI: Virtuoso Large
131K tokens·$0.75/1M
Trinity Large Thinking
$0.23/1M

Baidu

5 modelos
ModeloOS
Baidu: ERNIE 4.5 21B A3B Thinking
131K tokens·$0.07/1M
Baidu: ERNIE 4.5 300B A47B
123K tokens·$0.28/1M
Baidu: ERNIE 4.5 VL 28B A3B
30K tokens·$0.14/1M
Baidu: ERNIE 4.5 VL 424B A47B
123K tokens·$0.42/1M
ERNIE 5.0 Thinking Preview
0

ByteDance

2 modelos
ModeloOS
ByteDance: UI-TARS 7B
128K tokens·$0.10/1M
Doubao Seed Code
0

ByteDance Seed

4 modelos
ModeloOS
ByteDance Seed: Seed 1.6 Flash
262K tokens·$0.07/1M
ByteDance Seed: Seed-2.0-Lite
262K tokens·$0.25/1M
Doubao Seed Code
0
Seed-OSS-36B-Instruct
$0.21/1M

China Mobile

3 modelos
ModeloOS
JT-35B-Flash
0
JT-35B-Flash
0
JT-MINI
0

Cohere

6 modelos
ModeloOS
Cohere: Command R+ (08-2024)
128K tokens·$2.50/1M
Cohere: Command R7B (12-2024)
128K tokens·$0.04/1M
Command A+
0
Command-R (Mar '24)
$0.50/1M
Command-R+ (Apr '24)
$3.00/1M
Tiny Aya Global
0

Databricks

1 modelo
ModeloOS
DBRX Instruct
0

Deep Cogito

2 modelos
ModeloOS
Cogito v2.1 (Reasoning)
$1.25/1M
Deep Cogito: Cogito v2.1 671B
128K tokens·$1.25/1M

DeepSeek

25 modelos
ModeloOS
DeepSeek Coder V2 Lite Instruct
0
DeepSeek LLM 67B Chat (V1)
0
DeepSeek R1 (Jan '25)
$1.68/1M
DeepSeek R1 0528 Qwen3 8B
0
DeepSeek R1 Distill Llama 8B
0
DeepSeek R1 Distill Qwen 1.5B
0
DeepSeek R1 Distill Qwen 14B
0
DeepSeek V3
131K tokens·$0.23/1M
DeepSeek V3 0324
$1.20/1M
DeepSeek V3.1
164K tokens·$0.40/1M
DeepSeek V3.1 Terminus
164K tokens·$1.64/1M
DeepSeek V3.2
131K tokens·$0.50/1M
DeepSeek V3.2 Exp
164K tokens·$0.27/1M
DeepSeek V3.2 Exp (Non-reasoning)
$0.28/1M
DeepSeek V3.2 Exp (Reasoning)
$0.28/1M
DeepSeek V3.2 Speciale
164K tokens00
DeepSeek V4 Flash
1.0M tokens·$0.14/1M
DeepSeek V4 Pro
1.0M tokens·$0.43/1M
DeepSeek-Coder-V2
0
DeepSeek-V2-Chat
0
DeepSeek-V2.5
0
DeepSeek-V2.5 (Dec '24)
0
DeepSeek: R1
164K tokens·$0.70/1M
DeepSeek: R1 Distill Qwen 32B
128K tokens00
R1 Distill Llama 70B
131K tokens·$0.70/1M

EssentialAI

1 modelo
ModeloOS
EssentialAI: Rnj 1 Instruct
33K tokens·$0.15/1M

Goliath 120B

1 modelo
ModeloOS
Goliath 120B
6K tokens·$3.75/1M

Google

61 modelos
ModeloOS
Gemini 1.0 Pro
0
Gemini 1.0 Ultra
0
Gemini 1.5 Flash (May '24)
0
Gemini 1.5 Flash (Sep '24)
0
Gemini 1.5 Flash-8B
0
Gemini 1.5 Pro (May '24)
0
Gemini 1.5 Pro (Sep '24)
0
Gemini 2.0 Flash
1.0M tokens·$0.15/1M
Gemini 2.0 Flash (experimental)
0
Gemini 2.0 Flash Lite
1.0M tokens·$0.07/1M
Gemini 2.0 Flash Thinking Experimental (Dec '24)
0
Gemini 2.0 Flash Thinking Experimental (Jan '25)
0
Gemini 2.0 Flash-Lite (Feb '25)
0
Gemini 2.0 Flash-Lite (Preview)
0
Gemini 2.0 Pro Experimental (Feb '25)
0
Gemini 2.5 Flash
1.0M tokens·$0.30/1M
Gemini 2.5 Flash Lite
1.0M tokens·$0.10/1M
Gemini 2.5 Flash Preview (Non-reasoning)
0
Gemini 2.5 Flash Preview (Reasoning)
$0.30/1M
Gemini 2.5 Flash Preview (Sep '25) (Reasoning)
0
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
$0.10/1M
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)
$0.10/1M
Gemini 2.5 Pro
1.0M tokens·$1.25/1M
Gemini 2.5 Pro Preview (Mar' 25)
0
Gemini 2.5 Pro Preview (May' 25)
$1.25/1M
Gemini 2.5 Pro Preview 05-06
1.0M tokens·$1.25/1M
Gemini 2.5 Pro Preview 06-05
1.0M tokens·$1.25/1M
Gemini 3 Deep Think
0
Gemini 3 Flash Preview
1.0M tokens·$0.50/1M
Gemini 3 Flash Preview (Non-reasoning)
$0.50/1M
Gemini 3 Flash Preview (Reasoning)
$0.50/1M
Gemini 3 Pro Preview (high)
$2.00/1M
Gemini 3 Pro Preview (low)
$2.00/1M
Gemini 3.1 Flash Lite
1.0M tokens·$0.25/1M
Gemini 3.1 Flash Lite Preview
1.0M tokens·$0.25/1M
Gemini 3.1 Pro Preview
1.0M tokens·$2.00/1M
Gemini 3.1 Pro Preview Custom Tools
1.0M tokens·$2.00/1M
Gemini 3.5 Flash (minimal)
$1.50/1M
Gemma 2 27B
8K tokens·$0.65/1M
Gemma 3 12B
131K tokens·$0.09/1M
Gemma 3 1B Instruct
0
Gemma 3 270M
0
Gemma 3 27B
131K tokens·$0.11/1M
Gemma 3 4B
131K tokens·$0.04/1M
Gemma 3n 4B
33K tokens·$0.06/1M
Gemma 3n E2B Instruct
0
Gemma 3n E4B Instruct
$0.02/1M
Gemma 3n E4B Instruct Preview (May '25)
0
Gemma 4 26B A4B
262K tokens·$0.13/1M
Gemma 4 31B
262K tokens·$0.14/1M
Gemma 4 E2B (Non-reasoning)
0
Gemma 4 E2B (Reasoning)
0
Gemma 4 E4B (Non-reasoning)
0
Gemma 4 E4B (Reasoning)
0
Google: Gemini 3.5 Flash
1.0M tokens·$1.50/1M
Lyria 3 Clip Preview
1.0M tokens
Lyria 3 Pro Preview
1.0M tokens
Nano Banana (Gemini 2.5 Flash Image)
33K tokens·$0.30/1M
Nano Banana 2 (Gemini 3.1 Flash Image Preview)
131K tokens·$0.50/1M
Nano Banana Pro (Gemini 3 Pro Image Preview)
66K tokens·$2.00/1M
PALM-2
0

IBM

10 modelos
ModeloOS
Granite 3.3 8B (Non-reasoning)
$0.03/1M
Granite 4.0 1B
0
Granite 4.0 350M
0
Granite 4.0 H 1B
0
Granite 4.0 H 350M
0
Granite 4.0 H Small
$0.06/1M
Granite 4.0 Micro
131K tokens00
Granite 4.1 30B
0
Granite 4.1 3B
0
Granite 4.1 8B
$0.05/1M

Inception

1 modelo
ModeloOS
Inception: Mercury 2
128K tokens·$0.25/1M

Inclusion AI

2 modelos
ModeloOS
Ling 2.6 Flash
$0.10/1M
Ling-2.6-1T
$0.30/1M

InclusionAI

6 modelos
ModeloOS
Ling-1T
0
Ling-flash-2.0
$0.14/1M
Ling-mini-2.0
0
Ring-1T
0
Ring-2.6-1T
$0.30/1M
Ring-flash-2.0
$0.14/1M

Inflection

2 modelos
ModeloOS
Inflection: Inflection 3 Pi
8K tokens·$2.50/1M
Inflection: Inflection 3 Productivity
8K tokens·$2.50/1M

Kimi

2 modelos
ModeloOS
Kimi K2 Thinking
262K tokens·$0.60/1M
Kimi Linear 48B A3B Instruct
0

Korea Telecom

2 modelos
ModeloOS
Mi:dm K 2.5 Pro
0
Mi:dm K 2.5 Pro Preview
0

Kuaishou

1 modelo
ModeloOS
Kling AI 2.0

KwaiKAT

1 modelo
ModeloOS
KAT-Coder-Pro V1
$0.30/1M

Kwaipilot

1 modelo
ModeloOS
Kwaipilot: KAT-Coder-Pro V2
256K tokens·$0.30/1M

LG AI

2 modelos
ModeloOS
EXAONE 4.5 33B
0
K-EXAONE (Reasoning)
0

LG AI Research

3 modelos
ModeloOS
Exaone 4.0 1.2B (Non-reasoning)
0
EXAONE 4.0 32B (Non-reasoning)
0
EXAONE 4.0 32B (Reasoning)
0

Liquid AI

7 modelos
ModeloOS
LFM 40B
0
LFM2 1.2B
0
LFM2 2.6B
0
LFM2 8B A1B
0
LFM2.5-1.2B-Instruct
0
LFM2.5-1.2B-Thinking
0
LFM2.5-VL-1.6B
0

LiquidAI

1 modelo
ModeloOS
LFM2-24B-A2B
33K tokens·$0.03/1M

LongCat

1 modelo
ModeloOS
LongCat Flash Lite
0

Luma AI

1 modelo
ModeloOS
Luma Dream Machine 1.6

MBZUAI Institute of Foundation Models

3 modelos
ModeloOS
K2 Think V2
0
K2-V2 (high)
0
K2-V2 (medium)
0

Magnum v4 72B

1 modelo
ModeloOS
Magnum v4 72B
16K tokens·$3.00/1M

Mancer

1 modelo
ModeloOS
Mancer: Weaver (alpha)
8K tokens·$0.75/1M

Meta

19 modelos
ModeloOS
Llama 2 Chat 13B
0
Llama 2 Chat 70B
0
Llama 2 Chat 7B
$0.05/1M
Llama 3 70B Instruct
8K tokens·$0.65/1M
Llama 3 8B Instruct
8K tokens·$0.04/1M
Llama 3.1 70B Instruct
131K tokens·$0.56/1M
Llama 3.1 8B Instruct
16K tokens·$0.10/1M
Llama 3.1 Instruct 405B
$2.75/1M
Llama 3.2 11B Vision Instruct
131K tokens·$0.24/1M
Llama 3.2 1B Instruct
60K tokens·$0.05/1M
Llama 3.2 3B Instruct
80K tokens·$0.15/1M
Llama 3.2 Instruct 90B (Vision)
$1.38/1M
Llama 3.3 70B Instruct
131K tokens·$0.58/1M
Llama 4 Maverick
1.0M tokens·$0.35/1M
Llama 4 Scout
10.0M tokens·$0.17/1M
Llama 65B
0
Llama Guard 3 8B
131K tokens·$0.48/1M
Llama Guard 4 12B
164K tokens·$0.18/1M
Muse Spark
0

Microsoft

5 modelos
ModeloOS
Microsoft: Phi 4
16K tokens·$0.13/1M
Phi-3 Mini Instruct 3.8B
0
Phi-4 Mini Instruct
0
Phi-4 Multimodal Instruct
0
WizardLM-2 8x22B
66K tokens·$0.62/1M

MiniMax

10 modelos
ModeloOS
Hailuo MiniMax Video-01
MiniMax M1 40k
0
MiniMax M1 80k
$0.55/1M
MiniMax-M2
205K tokens·$0.30/1M
MiniMax: MiniMax M1
1.0M tokens·$0.40/1M
MiniMax: MiniMax M2-her
66K tokens·$0.30/1M
MiniMax: MiniMax M2.1
197K tokens·$0.30/1M
MiniMax: MiniMax M2.5
197K tokens·$0.30/1M
MiniMax: MiniMax M2.7
197K tokens·$0.30/1M
MiniMax: MiniMax-01
1.0M tokens·$0.20/1M

Mistral

21 modelos
ModeloOS
Devstral 2
0
Devstral Small (Jul '25)
131K tokens·$0.10/1M
Devstral Small (May '25)
0
Devstral Small 2
$0.10/1M
Magistral Medium 1
0
Magistral Small 1
0
Magistral Small 1.2
0
Ministral 3 14B
$0.20/1M
Ministral 3 3B
$0.10/1M
Ministral 3 8B
$0.15/1M
Mistral 7B Instruct
$0.20/1M
Mistral Large 2 (Jul '24)
131K tokens·$2.00/1M
Mistral Large 2 (Nov '24)
$2.00/1M
Mistral Large 3
$4.00/1M
Mistral Medium
$2.75/1M
Mistral Small (Feb '24)
$1.00/1M
Mistral Small (Sep '24)
$0.20/1M
Mistral Small 3
$0.07/1M
Mistral Small 3.1
$0.10/1M
Mistral Small 3.2
$0.09/1M
Mixtral 8x22B Instruct
0

Mistral AI

23 modelos
ModeloOS
Magistral Medium 1.2
0
Mistral Large
128K tokens·$2.00/1M
Mistral: Codestral 2508
256K tokens·$0.30/1M
Mistral: Devstral 2 2512
262K tokens·$0.40/1M
Mistral: Devstral Medium
131K tokens·$0.40/1M
Mistral: Devstral Small 1.1
131K tokens·$0.10/1M
Mistral: Ministral 3 14B 2512
262K tokens·$0.20/1M
Mistral: Ministral 3 3B 2512
131K tokens·$0.10/1M
Mistral: Ministral 3 8B 2512
262K tokens·$0.15/1M
Mistral: Mistral 7B Instruct v0.1
3K tokens·$0.11/1M
Mistral: Mistral Medium 3
131K tokens·$0.40/1M
Mistral: Mistral Medium 3.1
131K tokens·$0.40/1M
Mistral: Mistral Medium 3.5
262K tokens·$1.50/1M
Mistral: Mistral Nemo
131K tokens·$0.02/1M
Mistral: Mistral Small 3.1 24B
128K tokens·$0.35/1M
Mistral: Mistral Small 3.2 24B
128K tokens·$0.07/1M
Mistral: Mistral Small 4
262K tokens·$0.20/1M
Mistral: Mistral Small Creative
33K tokens·$0.10/1M
Mistral: Mixtral 8x22B Instruct
66K tokens·$2.00/1M
Mistral: Mixtral 8x7B Instruct
33K tokens·$0.45/1M
Mistral: Pixtral Large 2411
131K tokens·$2.00/1M
Mistral: Saba
33K tokens00
Mistral: Voxtral Small 24B 2507
32K tokens·$0.10/1M

Moonshot AI

1 modelo
ModeloOS
Kimi K2
131K tokens·$0.58/1M

MoonshotAI

4 modelos
ModeloOS
MoonshotAI: Kimi K2 0711
131K tokens·$0.57/1M
MoonshotAI: Kimi K2 0905
262K tokens·$0.60/1M
MoonshotAI: Kimi K2.5
262K tokens·$0.60/1M
MoonshotAI: Kimi K2.6
262K tokens·$0.95/1M

Morph

2 modelos
ModeloOS
Morph: Morph V3 Fast
82K tokens·$0.80/1M
Morph: Morph V3 Large
262K tokens·$0.90/1M

Motif Technologies

1 modelo
ModeloOS
Motif-2-12.7B-Reasoning
0

MythoMax 13B

1 modelo
ModeloOS
MythoMax 13B
4K tokens·$0.06/1M

NVIDIA

17 modelos
ModeloOS
Llama 3.1 Nemotron 70B Instruct
131K tokens·$1.20/1M
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)
0
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
$0.60/1M
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
0
Llama 3.3 Nemotron Super 49B v1 (Reasoning)
0
Llama Nemotron Super 49B v1.5 (Non-reasoning)
$0.10/1M
Llama Nemotron Super 49B v1.5 (Reasoning)
$0.10/1M
Nemotron 3 Nano Omni 30B A3B Reasoning
$0.07/1M
Nemotron Cascade 2 30B A3B
0
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
262K tokens·$0.05/1M
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)
$0.06/1M
NVIDIA Nemotron 3 Nano 4B
0
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)
1.0M tokens·$0.30/1M
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)
$0.20/1M
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
$0.20/1M
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
131K tokens·$0.05/1M
NVIDIA Nemotron Nano 9B V2 (Reasoning)
$0.04/1M

Nanbeige

1 modelo
ModeloOS
Nanbeige4.1-3B
0

Naver

1 modelo
ModeloOS
HyperCLOVA X SEED Think (32B)
0

Nex AGI

1 modelo
ModeloOS
Nex AGI: DeepSeek V3.1 Nex N1
131K tokens·$0.14/1M

Nous

4 modelos
ModeloOS
Nous: Hermes 3 405B Instruct
131K tokens·$1.00/1M
Nous: Hermes 3 70B Instruct
131K tokens·$0.30/1M
Nous: Hermes 4 405B
131K tokens·$1.00/1M
Nous: Hermes 4 70B
131K tokens·$0.13/1M

Nous Research

7 modelos
ModeloOS
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)
0
DeepHermes 3 - Mistral 24B Preview (Non-reasoning)
0
Hermes 3 - Llama-3.1 70B
$0.30/1M
Hermes 4 - Llama-3.1 405B (Non-reasoning)
$1.00/1M
Hermes 4 - Llama-3.1 405B (Reasoning)
$1.00/1M
Hermes 4 - Llama-3.1 70B (Non-reasoning)
$0.13/1M
Hermes 4 - Llama-3.1 70B (Reasoning)
$0.13/1M

NousResearch

1 modelo
ModeloOS
NousResearch: Hermes 2 Pro - Llama-3 8B
8K tokens·$0.14/1M

OpenAI

75 modelos
ModeloOS
GPT Audio
128K tokens·$2.50/1M
GPT Audio Mini
128K tokens·$0.60/1M
GPT Chat Latest
400K tokens·$5.00/1M
GPT-3.5 Turbo
16K tokens·$0.50/1M
GPT-3.5 Turbo
$0.50/1M
GPT-3.5 Turbo (0613)
0
GPT-4 Turbo
128K tokens·$10.00/1M
GPT-4 Turbo Preview
128K tokens·$10.00/1M
GPT-4.1
1.0M tokens·$2.00/1M
GPT-4.1 Mini
1.0M tokens·$0.40/1M
GPT-4.1 Nano
1.0M tokens·$0.10/1M
GPT-4.5 (Preview)
0
GPT-4o (2024-08-06)
128K tokens·$2.50/1M
GPT-4o (2024-11-20)
128K tokens·$2.50/1M
GPT-4o (ChatGPT)
0
GPT-4o (March 2025, chatgpt-4o-latest)
0
GPT-4o Audio
128K tokens·$2.50/1M
GPT-4o mini Realtime (Dec '24)
0
GPT-4o Realtime (Dec '24)
0
GPT-4o Search Preview
128K tokens·$2.50/1M
GPT-4o-mini (2024-07-18)
128K tokens·$0.15/1M
GPT-4o-mini Search Preview
128K tokens·$0.15/1M
GPT-5
400K tokens·$1.25/1M
GPT-5 (ChatGPT)
$1.25/1M
GPT-5 (minimal)
$1.25/1M
GPT-5 Chat
128K tokens·$1.25/1M
GPT-5 Codex
400K tokens·$1.25/1M
GPT-5 Image
400K tokens·$10.00/1M
GPT-5 Image Mini
400K tokens·$2.50/1M
GPT-5 Mini
400K tokens·$0.25/1M
GPT-5 mini (minimal)
$0.25/1M
GPT-5 Nano
400K tokens·$0.05/1M
GPT-5 nano (minimal)
$0.05/1M
GPT-5 Pro
400K tokens·$15.00/1M
GPT-5.1
400K tokens·$1.25/1M
GPT-5.1 Chat
128K tokens·$1.25/1M
GPT-5.1-Codex
400K tokens·$1.25/1M
GPT-5.1-Codex-Max
400K tokens·$1.25/1M
GPT-5.1-Codex-Mini
400K tokens·$0.25/1M
GPT-5.2
400K tokens·$1.75/1M
GPT-5.2 Chat
128K tokens·$1.75/1M
GPT-5.2 Pro
400K tokens·$21.00/1M
GPT-5.2-Codex
400K tokens·$1.75/1M
GPT-5.3 Chat
128K tokens·$1.75/1M
GPT-5.3-Codex
400K tokens·$1.75/1M
GPT-5.4
1.1M tokens·$2.50/1M
GPT-5.4 Image 2
272K tokens·$8.00/1M
GPT-5.4 Mini
400K tokens·$0.75/1M
GPT-5.4 Nano
400K tokens·$0.20/1M
GPT-5.4 Pro
1.1M tokens·$30.00/1M
GPT-5.5
1.1M tokens·$5.00/1M
GPT-5.5 Instant (May 2026)
$5.00/1M
GPT-5.5 Pro
1.1M tokens00
gpt-oss-120b
131K tokens·$0.15/1M
gpt-oss-20b
131K tokens·$0.05/1M
gpt-oss-safeguard-20b
131K tokens·$0.07/1M
o1
200K tokens·$15.00/1M
o1-mini
0
o1-preview
$16.50/1M
o1-pro
200K tokens·$150.00/1M
o3
200K tokens·$2.00/1M
o3 Deep Research
200K tokens·$10.00/1M
o3 Mini
200K tokens·$1.10/1M
o3 Mini High
200K tokens·$1.10/1M
o3 Pro
200K tokens·$20.00/1M
o4 Mini
200K tokens·$1.10/1M
o4 Mini Deep Research
200K tokens·$2.00/1M
o4 Mini High
200K tokens·$1.10/1M
OpenAI: GPT-3.5 Turbo 16k
16K tokens·$3.00/1M
OpenAI: GPT-4
8K tokens·$30.00/1M
OpenAI: GPT-4 Turbo (older v1106)
128K tokens·$10.00/1M
OpenAI: GPT-4o
128K tokens·$2.50/1M
OpenAI: GPT-4o (2024-05-13)
128K tokens·$5.00/1M
OpenAI: GPT-4o-mini
128K tokens·$0.15/1M
Sora

OpenBMB

1 modelo
ModeloOS
MiniCPM-V 4.6 1.3B
0

anthropic

4 modelos
ModeloOS
Claude 3.5
Claude Opus 4.8
Opus 4.7
Opus 4.8

google

1 modelo
ModeloOS
Gemini 3.5

mistral

1 modelo
ModeloOS
Mistral

Guia de Modelos de IA em 2026

O ecossistema de modelos de inteligência artificial em 2026 é dominado por quatro grandes famílias: GPT da OpenAI, Claude da Anthropic, Gemini do Google e Llama da Meta. Cada família tem modelos em diferentes tamanhos e especializações, com preços e capacidades variadas para diferentes casos de uso.

Família GPT (OpenAI)

A OpenAI oferece a linha GPT-4o como modelo principal, com variantes de diferentes custos e velocidades. GPT-4o-mini é a opção mais acessível com excelente custo-benefício. A API da OpenAI é a mais amplamente suportada por ferramentas e integrações de terceiros, tornando-a a escolha default para muitas aplicações.

Família Claude (Anthropic)

Anthropic posiciona Claude com foco em segurança e seguir instruções complexas. Claude Opus é o modelo mais capaz da linha, com context window de 200K tokens — ideal para análise de documentos longos. Claude Haiku é a opção mais rápida e barata. A Anthropic tem forte presença em casos de uso empresariais e compliance-sensíveis.

Família Gemini (Google)

Gemini é notável pelo context window de 1 milhão de tokens — o maior entre modelos comerciais — e pela integração nativa com o ecossistema Google (Search, Workspace, Cloud). Gemini Flash é a opção mais acessível com velocidade excepcional.

Open Source: Llama, Qwen e DeepSeek

O segmento open source avançou significativamente. Meta AI lançou Llama 4 com performance competitiva em certas tarefas. Alibaba mantém a família Qwen com foco em multilingual, incluindo melhor suporte a português. DeepSeek surpreendeu com performance frontier a custo substancialmente menor que modelos proprietários equivalentes.

Perguntas Frequentes

Qual a diferença entre GPT-4o e Claude Opus?

GPT-4o da OpenAI e Claude Opus da Anthropic são ambos modelos frontier com capacidades similares. GPT-4o tem melhor velocidade e integração com o ecossistema OpenAI. Claude Opus se destaca em tarefas com contexto longo e raciocínio complexo.

O que é context window em modelos de IA?

Context window é a quantidade máxima de texto que o modelo pode processar em uma única requisição, medida em tokens (aproximadamente 4 caracteres por token em inglês, 2-3 em português). Modelos com context window maior podem analisar documentos completos e bases de código extensas.

Qual modelo de IA é open source?

Modelos open source incluem Llama (Meta), Qwen (Alibaba), Mistral, DeepSeek e Gemma (Google). São disponibilizados sob licenças que permitem uso, modificação e deploy próprio, sem dependência de API paga.

Como funcionam os preços por token?

LLMs cobram por tokens processados — separado por tokens de input (o que você envia) e output (o que o modelo gera). Preços em US$ por 1 milhão de tokens. Tokens de output geralmente custam 3-5x mais que tokens de input.

Explorar