Lista completa de modelos de IA

2026-07-16

Kimi K3Saiba mais

Kimi / Moonshot AI

Parameters

Context Length

1,048,576 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,048,576 tokens

Max Output:128,000 tokens

Price:$0.003/1K input, $0.015/1K output

2026-07-16

Muse Spark 1.1Saiba mais

Meta AI

Parameters

Context Length

1,048,576 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,048,576 tokens

Max Output:128,000 tokens

Price:$0.00125/1K input, $0.00425/1K output

2026-07-10

KAT Coder Pro V2.5Saiba mais

KwaiPilot

Parameters

Context Length

256,000 tokens

Response Speed

Input Speed

Chat

Código

Max Input:256,000 tokens

Max Output:80,000 tokens

Price:$0.00074/1K input, $0.00296/1K output

2026-07-10

KAT Coder Air V2.5Saiba mais

KwaiPilot

Parameters

Context Length

256,000 tokens

Response Speed

Input Speed

Chat

Código

Max Input:256,000 tokens

Max Output:80,000 tokens

Price:$0.00015/1K input, $0.0006/1K output

2026-07-09

GPT 5.6 SolSaiba mais

OpenAI

Parameters

Context Length

1,050,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,050,000 tokens

Max Output:128,000 tokens

Price:$0.005/1K input, $0.03/1K output

2026-07-09

GPT 5.6 Sol ProSaiba mais

OpenAI

Parameters

Context Length

1,050,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,050,000 tokens

Max Output:128,000 tokens

Price:$0.005/1K input, $0.03/1K output

2026-07-09

GPT 5.6 TerraSaiba mais

OpenAI

Parameters

Context Length

1,050,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,050,000 tokens

Max Output:128,000 tokens

Price:$0.0025/1K input, $0.015/1K output

2026-07-09

GPT 5.6 Terra ProSaiba mais

OpenAI

Parameters

Context Length

1,050,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,050,000 tokens

Max Output:128,000 tokens

Price:$0.0025/1K input, $0.015/1K output

2026-07-09

GPT 5.6 Luna ProSaiba mais

OpenAI

Parameters

Context Length

1,050,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,050,000 tokens

Max Output:128,000 tokens

Price:$0.001/1K input, $0.006/1K output

2026-07-09

GPT 5.6 LunaSaiba mais

OpenAI

Parameters

Context Length

1,050,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,050,000 tokens

Max Output:128,000 tokens

Price:$0.001/1K input, $0.006/1K output

2026-07-08

Grok 4.5Saiba mais

xAI

Parameters

Context Length

500,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:500,000 tokens

Max Output:128,000 tokens

Price:$0.002/1K input, $0.006/1K output

2026-07-07

Aion 3.0Saiba mais

AionLabs

Parameters

Context Length

131,072 tokens

Response Speed

Input Speed

Chat

Max Input:131,072 tokens

Max Output:32,768 tokens

Price:$0.003/1K input, $0.006/1K output

2026-07-07

Aion 3.0 MiniSaiba mais

AionLabs

Parameters

Context Length

131,072 tokens

Response Speed

Input Speed

Chat

Max Input:131,072 tokens

Max Output:32,768 tokens

Price:$0.0007/1K input, $0.0014/1K output

2026-07-06

Tencent Hy3Saiba mais

Tencent

Parameters

295B / 21B active

Context Length

262,144 tokens

Response Speed

Input Speed

Chat

Código

Max Input:262,144 tokens

Max Output:131,072 tokens

Price:$0.0002/1K input, $0.0008/1K output

2026-07-02

Poolside Laguna XS 2.1Saiba mais

Poolside

Parameters

Context Length

262,144 tokens

Response Speed

Input Speed

Chat

Código

Max Input:262,144 tokens

Max Output:32,768 tokens

Price:$0.00006/1K input, $0.00012/1K output

2026-06-30

Claude Sonnet 5Saiba mais

Anthropic

Parameters

Context Length

1,000,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,000,000 tokens

Max Output:128,000 tokens

Price:$0.002/1K input, $0.01/1K output

2026-06-30

Nano Banana 2 LiteSaiba mais

Google AI

Parameters

Context Length

65,536 tokens

Response Speed

Input Speed

Visão

Max Input:65,536 tokens

Max Output:65,536 tokens

Price:$0.00025/1K input, $0.0015/1K output

2026-06-24

Nex N2 MiniSaiba mais

Nex AGI

Parameters

Context Length

262,144 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:262,144 tokens

Max Output:262,144 tokens

Price:$0.000025/1K input, $0.0001/1K output

2026-06-24

Fugu UltraSaiba mais

Sakana AI

Parameters

Context Length

1,000,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,000,000 tokens

Max Output:128,000 tokens

Price:$0.005/1K input, $0.03/1K output

2026-06-16

GLM 5.2Saiba mais

GLM / Z.ai

Parameters

Context Length

1,048,576 tokens

Response Speed

Input Speed

Chat

Código

Max Input:1,048,576 tokens

Max Output:131,072 tokens

Price:$0.0009016/1K input, $0.0028336/1K output

2026-06-12

Kimi K2.7 CodeSaiba mais

Kimi / Moonshot AI

Parameters

Context Length

262,144 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:262,144 tokens

Max Output:262,144 tokens

Price:$0.00075/1K input, $0.0035/1K output

2026-06-09

Claude Fable 5Saiba mais

Anthropic

Parameters

Context Length

1,000,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,000,000 tokens

Max Output:128,000 tokens

Price:$0.01/1K input, $0.05/1K output

2026-06-08

Nex N2 ProSaiba mais

Nex AGI

Parameters

397B / 17B active

Context Length

262,144 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:262,144 tokens

Max Output:262,144 tokens

Price:$0.00025/1K input, $0.001/1K output

2026-06-04

Nemotron 3 UltraSaiba mais

NVIDIA

Parameters

550B / 55B active

Context Length

1,000,000 tokens

Response Speed

Input Speed

Chat

Código

Max Input:1,000,000 tokens

Max Output:128,000 tokens

Price:$0.0006/1K input, $0.0036/1K output

2026-06-03

Qwen3.7 PlusSaiba mais

Qwen / Alibaba Cloud

Parameters

Context Length

1,000,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,000,000 tokens

Max Output:65,536 tokens

Price:$0.00032/1K input, $0.00128/1K output

2026-05-31

MiniMax M3Saiba mais

MiniMax

Parameters

Context Length

1,048,576 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,048,576 tokens

Max Output:512,000 tokens

Price:$0.0003/1K input, $0.0012/1K output

2026-05-28

Claude Opus 4.8Saiba mais

Anthropic

Parameters

Context Length

1,000,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,000,000 tokens

Max Output:128,000 tokens

Price:$0.005/1K input, $0.025/1K output

2026-05-28

Step 3.7 FlashSaiba mais

StepFun

Parameters

196B MoE

Context Length

256,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:256,000 tokens

Max Output:256,000 tokens

Price:$0.0002/1K input, $0.00115/1K output

2026-05-27

Claude Opus 4.8 FastSaiba mais

Anthropic

Parameters

Context Length

1,000,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,000,000 tokens

Max Output:128,000 tokens

Price:$0.01/1K input, $0.05/1K output

2026-05-21

Qwen3.7 MaxSaiba mais

Qwen / Alibaba Cloud

Parameters

Context Length

1,000,000 tokens

Response Speed

Input Speed

Chat

Código

Max Input:1,000,000 tokens

Max Output:65,536 tokens

Price:$0.001475/1K input, $0.004425/1K output

2026-05-19

Gemini 3.5 FlashSaiba mais

Google AI

Parameters

Context Length

1,048,576 tokens

Response Speed

200 tokens/s

Input Speed

20000 tokens/s

Chat

Visão

Código

Max Input:1,048,576 tokens

Max Output:65,536 tokens

Price:$0.0015/1K input, $0.009/1K output

2026-05-19

Grok Build 0.1Saiba mais

xAI

Parameters

Context Length

256,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:256,000 tokens

Max Output:65,536 tokens

Price:$0.001/1K input, $0.002/1K output

2026-05-12

Perceptron Mk1Saiba mais

Perceptron

Parameters

Context Length

32,768 tokens

Response Speed

Input Speed

Chat

Visão

Max Input:32,768 tokens

Max Output:8,192 tokens

Price:$0.00015/1K input, $0.0015/1K output

2026-05-08

Ring 2.6 1TSaiba mais

InclusionAI

Parameters

1T / 63B active

Context Length

262,144 tokens

Response Speed

Input Speed

Chat

Código

Max Input:262,144 tokens

Max Output:65,536 tokens

Price:$0.000075/1K input, $0.000625/1K output

2026-05-07

Gemini 3.1 Flash-LiteSaiba mais

Google AI

Parameters

Context Length

1,048,576 tokens

Response Speed

Input Speed

Chat

Visão

Max Input:1,048,576 tokens

Max Output:65,536 tokens

Price:$0.00025/1K input, $0.0015/1K output

2026-04-30

Mistral Medium 3.5Saiba mais

Mistral

Parameters

Context Length

262,144 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:262,144 tokens

Max Output:65,536 tokens

Price:$0.0015/1K input, $0.0075/1K output

2026-04-30

Grok 4.3Saiba mais

xAI

Parameters

Context Length

1,000,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,000,000 tokens

Max Output:128,000 tokens

Price:$0.00125/1K input, $0.0025/1K output

2026-04-30

Granite 4.1 8BSaiba mais

IBM

Parameters

Context Length

131,072 tokens

Response Speed

Input Speed

Chat

Código

Max Input:131,072 tokens

Max Output:131,072 tokens

Price:$0.00005/1K input, $0.0001/1K output

2026-04-28

Poolside Laguna M.1Saiba mais

Poolside

Parameters

Context Length

262,144 tokens

Response Speed

Input Speed

Chat

Código

Max Input:262,144 tokens

Max Output:32,768 tokens

Price:$0.0002/1K input, $0.0004/1K output

2026-04-27

Qwen3.6 FlashSaiba mais

Qwen / Alibaba Cloud

Parameters

Context Length

1,000,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:1,000,000 tokens

Max Output:65,536 tokens

Price:$0.0001875/1K input, $0.001125/1K output

2026-04-27

Qwen3.6 27BSaiba mais

Qwen / Alibaba Cloud

Parameters

27B

Context Length

262,144 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:262,144 tokens

Max Output:65,536 tokens

Price:$0.00045/1K input, $0.0027/1K output

2026-04-25

GPT 5.5Saiba mais

OpenAI

Parameters

Context Length

1,050,000 tokens

Response Speed

130 tokens/s

Input Speed

12000 tokens/s

Chat

Visão

Código

Max Input:1,050,000 tokens

Max Output:128,000 tokens

Price:$0.0025/1K input, $0.015/1K output

2026-04-24

DeepSeek V4 ProSaiba mais

DeepSeek

Parameters

1.6T / 49B active

Context Length

1,000,000 tokens

Response Speed

130 tokens/s

Input Speed

14000 tokens/s

Chat

Código

Max Input:1,000,000 tokens

Max Output:65,536 tokens

Price:$0.000435/1K input, $0.00087/1K output

2026-04-24

DeepSeek V4 FlashSaiba mais

DeepSeek

Parameters

284B / 13B active

Context Length

1,000,000 tokens

Response Speed

180 tokens/s

Input Speed

18000 tokens/s

Chat

Código

Max Input:1,000,000 tokens

Max Output:65,536 tokens

Price:$0.0001/1K input, $0.0003/1K output

2026-04-22

Claude Opus 4.7Saiba mais

Anthropic

Parameters

Context Length

1,000,000 tokens

Response Speed

105 tokens/s

Input Speed

8000 tokens/s

Chat

Visão

Código

Max Input:1,000,000 tokens

Max Output:64,000 tokens

Price:$0.005/1K input, $0.025/1K output

2026-04-21

GPT Image 2Saiba mais

OpenAI

Parameters

Context Length

Response Speed

Input Speed

Max Input:-

Max Output:-

Price:$0.01/1K input, $0.04/1K output

2026-04-21

GPT-5.4 Image 2Saiba mais

OpenAI

Parameters

Context Length

272,000 tokens

Response Speed

Input Speed

Visão

Max Input:-

Max Output:-

Price:$0.008/1K input, $0.015/1K output

2026-04-20

Kimi K2.6Saiba mais

Kimi / Moonshot AI

Parameters

15T continued pretraining

Context Length

262,144 tokens

Response Speed

180 tokens/s

Input Speed

19000 tokens/s

Chat

Visão

Código

Max Input:262,144 tokens

Max Output:65,536 tokens

Price:$0.00045/1K input, $0.0022/1K output

2026-04-18

Qwen3.6-35B-A3BSaiba mais

Qwen / Alibaba Cloud

Parameters

35B / 3B active

Context Length

262,144 tokens

Response Speed

200 tokens/s

Input Speed

19000 tokens/s

Chat

Visão

Código

Max Input:262,144 tokens

Max Output:65,536 tokens

Price:$0.0001625/1K input, $0.0013/1K output

2026-04-15

MiniMax M2.7Saiba mais

MiniMax

Parameters

Context Length

196,608 tokens

Response Speed

180 tokens/s

Input Speed

18000 tokens/s

Chat

Código

Max Input:196,608 tokens

Max Output:65,536 tokens

Price:$0.000295/1K input, $0.0012/1K output

2026-03-31

Grok 4.20Saiba mais

xAI

Parameters

Context Length

2,000,000 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:2,000,000 tokens

Max Output:128,000 tokens

Price:$0.00125/1K input, $0.0025/1K output

2026-03-31

Grok 4.20 Multi-AgentSaiba mais

xAI

Parameters

Context Length

2,000,000 tokens

Response Speed

Input Speed

Chat

Código

Max Input:2,000,000 tokens

Max Output:128,000 tokens

Price:$0.00125/1K input, $0.0025/1K output

2026-03-16

Mistral Small 4Saiba mais

Mistral

Parameters

119B / 6.5B active

Context Length

262,144 tokens

Response Speed

Input Speed

Chat

Visão

Código

Max Input:262,144 tokens

Max Output:65,536 tokens

Price:$0.00015/1K input, $0.0006/1K output

2026-03-15

GLM 5 TurboSaiba mais

GLM / Z.ai

Parameters

Context Length

202,752 tokens

Response Speed

180 tokens/s

Input Speed

16000 tokens/s

Chat

Código

Max Input:202,752 tokens

Max Output:65,536 tokens

Price:$0.00096/1K input, $0.0032/1K output

2026-03-05

GPT 5.4Saiba mais

OpenAI

Parameters

Context Length

1,050,000 tokens

Response Speed

120 tokens/s

Input Speed

11000 tokens/s

Chat

Visão

Código

Max Input:1,050,000 tokens

Max Output:128,000 tokens

Price:$0.0025/1K input, $0.015/1K output

2026-03-05

GPT 5.4 MiniSaiba mais

OpenAI

Parameters

Context Length

400,000 tokens

Response Speed

180 tokens/s

Input Speed

15000 tokens/s

Chat

Visão

Código

Max Input:400,000 tokens

Max Output:128,000 tokens

Price:$0.00075/1K input, $0.0045/1K output

2026-03-05

GPT 5.4 NanoSaiba mais

OpenAI

Parameters

Context Length

400,000 tokens

Response Speed

220 tokens/s

Input Speed

18000 tokens/s

Chat

Código

Max Input:400,000 tokens

Max Output:128,000 tokens

Price:$0.0002/1K input, $0.00125/1K output

2026-02-19

Gemini 3.1 Pro PreviewSaiba mais

Google AI

Parameters

Context Length

1,048,576 tokens

Response Speed

140 tokens/s

Input Speed

14000 tokens/s

Chat

Visão

Código

Max Input:1,048,576 tokens

Max Output:65,536 tokens

Price:$0.002/1K input, $0.012/1K output

2026-02-18

Qwen3.5-35B-A3BSaiba mais

Qwen / Alibaba Cloud

Parameters

35B / 3B active

Context Length

262,144 tokens

Response Speed

190 tokens/s

Input Speed

18000 tokens/s

Chat

Visão

Código

Max Input:262,144 tokens

Max Output:65,536 tokens

Price:$0.0001625/1K input, $0.0013/1K output

2026-02-18

Qwen3.5-9BSaiba mais

Qwen / Alibaba Cloud

Parameters

Context Length

262,144 tokens

Response Speed

240 tokens/s

Input Speed

24000 tokens/s

Chat

Visão

Código

Max Input:262,144 tokens

Max Output:65,536 tokens

Price:$0.0001/1K input, $0.00015/1K output

2026-02-17

Claude Sonnet 4.6Saiba mais

Anthropic

Parameters

Context Length

1,000,000 tokens

Response Speed

120 tokens/s

Input Speed

9000 tokens/s

Chat

Visão

Código

Max Input:1,000,000 tokens

Max Output:64,000 tokens

Price:$0.003/1K input, $0.015/1K output

2026-02-12

Nano Banana 2Saiba mais

Google AI

Parameters

Context Length

Response Speed

Input Speed

Visão

Max Input:-

Max Output:-

Price:$0.0005/1K input, $0.003/1K output

2026-02-12

MiniMax M2.5Saiba mais

MiniMax

Parameters

Context Length

196,608 tokens

Response Speed

170 tokens/s

Input Speed

17000 tokens/s

Chat

Código

Max Input:196,608 tokens

Max Output:65,536 tokens

Price:$0.000295/1K input, $0.0012/1K output

2026-02-11

GLM 5Saiba mais

GLM / Z.ai

Parameters

Context Length

202,752 tokens

Response Speed

140 tokens/s

Input Speed

13000 tokens/s

Chat

Código

Max Input:202,752 tokens

Max Output:65,536 tokens

Price:$0.00072/1K input, $0.0023/1K output

2026-02-04

Claude Opus 4.6Saiba mais

Anthropic

Parameters

Context Length

1,000,000 tokens

Response Speed

100 tokens/s

Input Speed

8000 tokens/s

Chat

Visão

Código

Max Input:1,000,000 tokens

Max Output:64,000 tokens

Price:$0.005/1K input, $0.025/1K output

2026-02-04

Qwen3 Coder NextSaiba mais

Qwen / Alibaba Cloud

Parameters

80B / 3B active

Context Length

262,144 tokens

Response Speed

220 tokens/s

Input Speed

22000 tokens/s

Chat

Código

Max Input:262,144 tokens

Max Output:65,536 tokens

Price:$0.00012/1K input, $0.00075/1K output

2026-02-01

GPT 5.3 CodexSaiba mais

OpenAI

Parameters

Context Length

400,000 tokens

Response Speed

150 tokens/s

Input Speed

12000 tokens/s

Chat

Visão

Código

Max Input:400,000 tokens

Max Output:128,000 tokens

Price:$0.00175/1K input, $0.014/1K output

2026-01-27

Kimi K2.5Saiba mais

Kimi / Moonshot AI

Parameters

15T continued pretraining

Context Length

262,144 tokens

Response Speed

170 tokens/s

Input Speed

18000 tokens/s

Chat

Visão

Código

Max Input:262,144 tokens

Max Output:65,536 tokens

Price:$0.00045/1K input, $0.0022/1K output

2026-01-19

GLM 4.7Saiba mais

GLM / Z.ai

Parameters

Context Length

202,752 tokens

Response Speed

160 tokens/s

Input Speed

15000 tokens/s

Chat

Código

Max Input:202,752 tokens

Max Output:65,536 tokens

Price:$0.0006/1K input, $0.0022/1K output

2026-01-19

GLM 4.7 FlashSaiba mais

GLM / Z.ai

Parameters

Context Length

202,752 tokens

Response Speed

220 tokens/s

Input Speed

22000 tokens/s

Chat

Código

Max Input:202,752 tokens

Max Output:65,536 tokens

Price:$0.00006/1K input, $0.0004/1K output

2025-12-17

Gemini 3 Flash PreviewSaiba mais

Google AI

Parameters

Context Length

1,048,576 tokens

Response Speed

190 tokens/s

Input Speed

18000 tokens/s

Chat

Visão

Max Input:1,048,576 tokens

Max Output:65,536 tokens

Price:$0.0005/1K input, $0.003/1K output

2025-12-11

GPT 5.2Saiba mais

OpenAI

Parameters

280B

Context Length

128,000 tokens

Response Speed

180 tokens/s

Input Speed

13000 tokens/s

Chat

Visão

Max Input:128,000 tokens

Max Output:16,384 tokens

Price:$0.00025/1K input, $0.0005/1K output

2025-12-08

GLM 4.6VSaiba mais

GLM / Z.ai

Parameters

Context Length

131,072 tokens

Response Speed

150 tokens/s

Input Speed

12000 tokens/s

Chat

Visão

Código

Max Input:131,072 tokens

Max Output:65,536 tokens

Price:$0.0003/1K input, $0.0009/1K output

2025-11-20

Nano Banana ProSaiba mais

Google AI

Parameters

Context Length

65,536 tokens

Response Speed

Input Speed

Visão

Max Input:-

Max Output:-

Price:$0.002/1K input, $0.012/1K output

2025-11-06

Kimi K2 ThinkingSaiba mais

Kimi / Moonshot AI

Parameters

1T MoE / 32B active

Context Length

131,072 tokens

Response Speed

130 tokens/s

Input Speed

13000 tokens/s

Chat

Código

Max Input:131,072 tokens

Max Output:81,920 tokens

Price:$0.00047/1K input, $0.002/1K output

2025-11-01

GPT 5.1Saiba mais

OpenAI

Parameters

250B

Context Length

128,000 tokens

Response Speed

170 tokens/s

Input Speed

12000 tokens/s

Chat

Visão

Max Input:128,000 tokens

Max Output:8,192 tokens

Price:$0.0002/1K input, $0.0004/1K output

2025-10-15

Claude Haiku 4.5Saiba mais

Anthropic

Parameters

Context Length

200,000 tokens

Response Speed

170 tokens/s

Input Speed

14000 tokens/s

Chat

Visão

Código

Max Input:200,000 tokens

Max Output:64,000 tokens

Price:$0.001/1K input, $0.005/1K output

2025-09-29

Claude Sonnet 4.5Saiba mais

Anthropic

Parameters

Context Length

1,000,000 tokens

Response Speed

120 tokens/s

Input Speed

9000 tokens/s

Chat

Visão

Código

Max Input:1,000,000 tokens

Max Output:64,000 tokens

Price:$0.003/1K input, $0.015/1K output

2025-08-26

Nano BananaSaiba mais

Google AI

Parameters

Context Length

Response Speed

Input Speed

Visão

Max Input:-

Max Output:-

Price:$0.0003/1K input, $0.0025/1K output

2025-08-21

DeepSeek V3.1Saiba mais

DeepSeek

Parameters

671B / 37B active

Context Length

131,072 tokens

Response Speed

180 tokens/s

Input Speed

18000 tokens/s

Chat

Código

Max Input:131,072 tokens

Max Output:65,536 tokens

Price:$0.00015/1K input, $0.00075/1K output

2025-08-09

GPT 5Saiba mais

OpenAI

Parameters

220B

Context Length

128,000 tokens

Response Speed

150 tokens/s

Input Speed

12000 tokens/s

Chat

Visão

Max Input:128,000 tokens

Max Output:4,096 tokens

Price:$0.00015/1K input, $0.0003/1K output

2025-08-07

GPT 5 MiniSaiba mais

OpenAI

Parameters

Context Length

400,000 tokens

Response Speed

180 tokens/s

Input Speed

15000 tokens/s

Chat

Visão

Código

Max Input:400,000 tokens

Max Output:128,000 tokens

Price:$0.00025/1K input, $0.002/1K output

2025-08-07

GPT 5 NanoSaiba mais

OpenAI

Parameters

Context Length

400,000 tokens

Response Speed

220 tokens/s

Input Speed

18000 tokens/s

Chat

Código

Max Input:400,000 tokens

Max Output:64,000 tokens

Price:$0.00005/1K input, $0.0004/1K output

2025-08-05

gpt-oss-120bSaiba mais

OpenAI

Parameters

117B / 5.1B active

Context Length

131,072 tokens

Response Speed

170 tokens/s

Input Speed

15000 tokens/s

Chat

Código

Max Input:131,072 tokens

Max Output:65,536 tokens

Price:$0.000039/1K input, $0.00019/1K output

2025-07-25

Qwen3 235B A22B Thinking 2507Saiba mais

Qwen / Alibaba Cloud

Parameters

235B / 22B active

Context Length

262,144 tokens

Response Speed

130 tokens/s

Input Speed

13000 tokens/s

Chat

Código

Max Input:262,144 tokens

Max Output:81,920 tokens

Price:$0.00011/1K input, $0.0006/1K output

2025-07-23

Qwen3 Coder 480B A35BSaiba mais

Qwen / Alibaba Cloud

Parameters

480B / 35B active

Context Length

262,144 tokens

Response Speed

170 tokens/s

Input Speed

16000 tokens/s

Chat

Código

Max Input:262,144 tokens

Max Output:65,536 tokens

Price:$0.00022/1K input, $0.001/1K output

2025-07-21

Qwen3 235B A22B Instruct 2507Saiba mais

Qwen / Alibaba Cloud

Parameters

235B / 22B active

Context Length

262,144 tokens

Response Speed

180 tokens/s

Input Speed

17000 tokens/s

Chat

Código

Max Input:262,144 tokens

Max Output:65,536 tokens

Price:$0.000071/1K input, $0.0001/1K output

2025-07-11

Kimi K2Saiba mais

Kimi / Moonshot AI

Parameters

1T MoE / 32B active

Context Length

131,072 tokens

Response Speed

200 tokens/s

Input Speed

20000 tokens/s

Chat

Código

Max Input:131,072 tokens

Max Output:65,536 tokens

Price:$0.00055/1K input, $0.0022/1K output

2025-06-17

Gemini 2.5 FlashSaiba mais

Google AI

Parameters

Context Length

1,048,576 tokens

Response Speed

180 tokens/s

Input Speed

18000 tokens/s

Chat

Visão

Max Input:1,048,576 tokens

Max Output:65,536 tokens

Price:$0.0003/1K input, $0.0025/1K output

2025-05-28

DeepSeek R1 0528Saiba mais

DeepSeek

Parameters

671B / 37B active

Context Length

163,840 tokens

Response Speed

120 tokens/s

Input Speed

12000 tokens/s

Chat

Código

Max Input:163,840 tokens

Max Output:65,536 tokens

Price:$0.00055/1K input, $0.00219/1K output

2025-05-21

Claude 4 SonnetSaiba mais

Anthropic

Parameters

230B

Context Length

200,000 tokens

Response Speed

140 tokens/s

Input Speed

10000 tokens/s

Chat

Max Input:128,000 tokens

Max Output:128,000 tokens

Price:$undefined/1K input, $undefined/1K output

2025-05-21

Claude 4 OpusSaiba mais

Anthropic

Parameters

230B

Context Length

200,000 tokens

Response Speed

140 tokens/s

Input Speed

10000 tokens/s

Chat

Max Input:128,000 tokens

Max Output:128,000 tokens

Price:$undefined/1K input, $undefined/1K output

2025-04-28

Qwen3 235B A22BSaiba mais

Qwen / Alibaba Cloud

Parameters

235B / 22B active

Context Length

131,072 tokens

Response Speed

160 tokens/s

Input Speed

15000 tokens/s

Chat

Código

Max Input:131,072 tokens

Max Output:65,536 tokens

Price:$0.000455/1K input, $0.00182/1K output

2025-04-16

o3Saiba mais

OpenAI

Parameters

Context Length

200,000 tokens

Response Speed

140 tokens/s

Input Speed

10000 tokens/s

Chat

Visão

Código

Max Input:200,000 tokens

Max Output:100,000 tokens

Price:$0.002/1K input, $0.008/1K output

2025-04-16

O4 MiniSaiba mais

OpenAI

Parameters

Context Length

Response Speed

140 tokens/s

Input Speed

10000 tokens/s

Chat

Max Input:200,000 tokens

Max Output:100,000 tokens

Price:$0.0011/1K input, $0.0044/1K output

2025-04-16

O4 Mini (High)Saiba mais

OpenAI

Parameters

Context Length

Response Speed

140 tokens/s

Input Speed

10000 tokens/s

Chat

Max Input:200,000 tokens

Max Output:100,000 tokens

Price:$0.0011/1K input, $0.0044/1K output

2025-04-10

GPT 4.1Saiba mais

OpenAI

Parameters

220B

Context Length

128,000 tokens

Response Speed

150 tokens/s

Input Speed

12000 tokens/s

Chat

Visão

Max Input:128,000 tokens

Max Output:4,096 tokens

Price:$0.00015/1K input, $0.0003/1K output

2025-04-10

GPT 4.1 MiniSaiba mais

OpenAI

Parameters

220B

Context Length

128,000 tokens

Response Speed

150 tokens/s

Input Speed

12000 tokens/s

Chat

Visão

Max Input:128,000 tokens

Max Output:4,096 tokens

Price:$0.00015/1K input, $0.0003/1K output

2025-04-10

GPT 4.1 NanoSaiba mais

OpenAI

Parameters

220B

Context Length

128,000 tokens

Response Speed

150 tokens/s

Input Speed

12000 tokens/s

Chat

Visão

Max Input:128,000 tokens

Max Output:4,096 tokens

Price:$0.00015/1K input, $0.0003/1K output

2025-04-04

Llama 4 MaverickSaiba mais

Meta AI

Parameters

70B

Context Length

100,000 tokens

Response Speed

150 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:$0.0001/1K input, $0.0002/1K output

2025-04-04

Llama 4 ScoutSaiba mais

Meta AI

Parameters

70B

Context Length

100,000 tokens

Response Speed

150 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:$0.0001/1K input, $0.0002/1K output

2025-03-20

Gemini 2.5 ProSaiba mais

Google AI

Parameters

120B

Context Length

1,000,000 tokens

Response Speed

Input Speed

Chat

Max Input:1,000,000 tokens

Max Output:1,000,000 tokens

Price:-

2025-03-13

Cohere Command ASaiba mais

Cohere

Parameters

180B

Context Length

256,000 tokens

Response Speed

140 tokens/s

Input Speed

10000 tokens/s

Chat

Max Input:128,000 tokens

Max Output:128,000 tokens

Price:$0.0002/1K input, $0.0004/1K output

2025-01-31

O3 Mini (Medium)Saiba mais

OpenAI

Parameters

Context Length

Response Speed

140 tokens/s

Input Speed

10000 tokens/s

Chat

Max Input:200,000 tokens

Max Output:100,000 tokens

Price:$0.0011/1K input, $0.0044/1K output

2025-01-31

O3 Mini (High)Saiba mais

OpenAI

Parameters

Context Length

Response Speed

140 tokens/s

Input Speed

10000 tokens/s

Chat

Max Input:200,000 tokens

Max Output:100,000 tokens

Price:$0.0011/1K input, $0.0044/1K output

2025-01-30

Mistral Small 3Saiba mais

Mistral

Parameters

24B

Context Length

32,000 tokens

Response Speed

150 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:-

2025-01-23

Deepseek R1 Distill Llama 70BSaiba mais

DeepSeek

Parameters

70B

Context Length

100,000 tokens

Response Speed

270 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:-

2025-01-20

Deepseek R1Saiba mais

DeepSeek

Parameters

67B

Context Length

100,000 tokens

Response Speed

150 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:$0.0001/1K input, $0.0002/1K output

2025-01-14

Minimax 01Saiba mais

MiniMax

Parameters

45.9B

Context Length

131,072 tokens

Response Speed

150 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:-

2024-12-25

Deepseek V3Saiba mais

DeepSeek

Parameters

67B

Context Length

100,000 tokens

Response Speed

150 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:$0.0001/1K input, $0.0002/1K output

2024-12-12

Phi 4Saiba mais

Microsoft

Parameters

14.7B

Context Length

131,072 tokens

Response Speed

150 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:-

2024-12-06

Llama 3.3 70BSaiba mais

Meta AI

Parameters

70B

Context Length

100,000 tokens

Response Speed

150 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:$0.0001/1K input, $0.0002/1K output

2024-09-19

Qwen 2.5 Coder 32BSaiba mais

Qwen / Alibaba Cloud

Parameters

Context Length

Response Speed

Input Speed

Chat

Max Input:-

Max Output:-

Price:-

2024-09-19

Qwen 2.5 72BSaiba mais

Qwen / Alibaba Cloud

Parameters

72B

Context Length

131,072 tokens

Response Speed

150 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:-

2024-09-19

Qwen 2.5 7BSaiba mais

Qwen / Alibaba Cloud

Parameters

Context Length

32,000 tokens

Response Speed

200 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:-

2024-08-01

FLUX.1-devSaiba mais

Black Forest Labs

Parameters

Context Length

Response Speed

Input Speed

Max Input:-

Max Output:-

Price:-

2024-07-23

Llama 3.1 8BSaiba mais

Meta AI

Parameters

Context Length

100,000 tokens

Response Speed

180 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:$0.00005/1K input, $0.0001/1K output

2024-07-18

GPT 4o MiniSaiba mais

OpenAI

Parameters

220B

Context Length

128,000 tokens

Response Speed

150 tokens/s

Input Speed

12000 tokens/s

Chat

Visão

Max Input:128,000 tokens

Max Output:4,096 tokens

Price:$0.00015/1K input, $0.0003/1K output

2024-06-18

Mathstral 7BSaiba mais

Mistral

Parameters

Context Length

4,096 tokens

Response Speed

150 tokens/s

Input Speed

16000 tokens/s

Chat

Max Input:4,096 tokens

Max Output:4,096 tokens

Price:-

2024-04-30

Cohere Command R+Saiba mais

Cohere

Parameters

140B

Context Length

128,000 tokens

Response Speed

140 tokens/s

Input Speed

11000 tokens/s

Chat

Max Input:128,000 tokens

Max Output:4,096 tokens

Price:$0.00015/1K input, $0.0003/1K output

2024-03-30

Cohere Command RSaiba mais

Cohere

Parameters

60B

Context Length

32,000 tokens

Response Speed

170 tokens/s

Input Speed

15000 tokens/s

Chat

Max Input:32,000 tokens

Max Output:4,096 tokens

Price:$0.0001/1K input, $0.0002/1K output

2024-03-09

Llama 3.1 70BSaiba mais

Meta AI

Parameters

70B

Context Length

100,000 tokens

Response Speed

150 tokens/s

Input Speed

13000 tokens/s

Chat

Max Input:100,000 tokens

Max Output:100,000 tokens

Price:$0.00015/1K input, $0.0003/1K output