Lista completa de modelos de IA

2026-03-15

GLM 5 TurboSaiba mais

GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Código
Max Input:202,752 tokens
Max Output:65,536 tokens
Price:$0.00096/1K input, $0.0032/1K output
2026-03-12

Grok 4.20 Multi-Agent BetaSaiba mais

xAI
Parameters
-
Context Length
2,000,000 tokens
Response Speed
140 tokens/s
Input Speed
14000 tokens/s
Chat
Código
Max Input:2,000,000 tokens
Max Output:128,000 tokens
Price:$0.002/1K input, $0.006/1K output
2026-03-05

GPT 5.4Saiba mais

OpenAI
Parameters
-
Context Length
1,050,000 tokens
Response Speed
120 tokens/s
Input Speed
11000 tokens/s
Chat
Visão
Código
Max Input:1,050,000 tokens
Max Output:128,000 tokens
Price:$0.0025/1K input, $0.015/1K output
2026-03-05

GPT 5.4 ProSaiba mais

OpenAI
Parameters
-
Context Length
1,050,000 tokens
Response Speed
80 tokens/s
Input Speed
8000 tokens/s
Chat
Visão
Código
Max Input:1,050,000 tokens
Max Output:128,000 tokens
Price:$0.03/1K input, $0.18/1K output
2026-03-05

GPT 5.4 MiniSaiba mais

OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
180 tokens/s
Input Speed
15000 tokens/s
Chat
Visão
Código
Max Input:400,000 tokens
Max Output:128,000 tokens
Price:$0.00075/1K input, $0.0045/1K output
2026-03-05

GPT 5.4 NanoSaiba mais

OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
220 tokens/s
Input Speed
18000 tokens/s
Chat
Código
Max Input:400,000 tokens
Max Output:128,000 tokens
Price:$0.0002/1K input, $0.00125/1K output
2026-03-03

Gemini 3.1 Flash Lite PreviewSaiba mais

Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Visão
Max Input:1,048,576 tokens
Max Output:65,536 tokens
Price:$0.00025/1K input, $0.0015/1K output
2026-02-19

Gemini 3.1 Pro PreviewSaiba mais

Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
140 tokens/s
Input Speed
14000 tokens/s
Chat
Visão
Código
Max Input:1,048,576 tokens
Max Output:65,536 tokens
Price:$0.002/1K input, $0.012/1K output
2026-02-18

Qwen3.5-35B-A3BSaiba mais

Qwen / Alibaba Cloud
Parameters
35B / 3B active
Context Length
262,144 tokens
Response Speed
190 tokens/s
Input Speed
18000 tokens/s
Chat
Visão
Código
Max Input:262,144 tokens
Max Output:65,536 tokens
Price:$0.0001625/1K input, $0.0013/1K output
2026-02-18

Qwen3.5-9BSaiba mais

Qwen / Alibaba Cloud
Parameters
9B
Context Length
262,144 tokens
Response Speed
240 tokens/s
Input Speed
24000 tokens/s
Chat
Visão
Código
Max Input:262,144 tokens
Max Output:65,536 tokens
Price:$0.0001/1K input, $0.00015/1K output
2026-02-17

Claude Sonnet 4.6Saiba mais

Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
120 tokens/s
Input Speed
9000 tokens/s
Chat
Visão
Código
Max Input:1,000,000 tokens
Max Output:64,000 tokens
Price:$0.003/1K input, $0.015/1K output
2026-02-12

MiniMax M2.5Saiba mais

MiniMax
Parameters
-
Context Length
196,608 tokens
Response Speed
170 tokens/s
Input Speed
17000 tokens/s
Chat
Código
Max Input:196,608 tokens
Max Output:65,536 tokens
Price:$0.000295/1K input, $0.0012/1K output
2026-02-11

GLM 5Saiba mais

GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
140 tokens/s
Input Speed
13000 tokens/s
Chat
Código
Max Input:202,752 tokens
Max Output:65,536 tokens
Price:$0.00072/1K input, $0.0023/1K output
2026-02-04

Claude Opus 4.6Saiba mais

Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
100 tokens/s
Input Speed
8000 tokens/s
Chat
Visão
Código
Max Input:1,000,000 tokens
Max Output:64,000 tokens
Price:$0.005/1K input, $0.025/1K output
2026-02-04

Qwen3 Coder NextSaiba mais

Qwen / Alibaba Cloud
Parameters
80B / 3B active
Context Length
262,144 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Código
Max Input:262,144 tokens
Max Output:65,536 tokens
Price:$0.00012/1K input, $0.00075/1K output
2026-02-01

GPT 5.3 InstantSaiba mais

OpenAI
Parameters
-
Context Length
128,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Visão
Código
Max Input:128,000 tokens
Max Output:16,384 tokens
Price:$0.00175/1K input, $0.014/1K output
2026-02-01

GPT 5.3 CodexSaiba mais

OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Código
Max Input:400,000 tokens
Max Output:128,000 tokens
Price:$0.00175/1K input, $0.014/1K output
2026-01-27

Kimi K2.5Saiba mais

Kimi / Moonshot AI
Parameters
15T continued pretraining
Context Length
262,144 tokens
Response Speed
170 tokens/s
Input Speed
18000 tokens/s
Chat
Visão
Código
Max Input:262,144 tokens
Max Output:65,536 tokens
Price:$0.00045/1K input, $0.0022/1K output
2026-01-19

GLM 4.7Saiba mais

GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
160 tokens/s
Input Speed
15000 tokens/s
Chat
Código
Max Input:202,752 tokens
Max Output:65,536 tokens
Price:$0.0006/1K input, $0.0022/1K output
2026-01-19

GLM 4.7 FlashSaiba mais

GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Código
Max Input:202,752 tokens
Max Output:65,536 tokens
Price:$0.00006/1K input, $0.0004/1K output
2025-12-17

Gemini 3 Flash PreviewSaiba mais

Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
190 tokens/s
Input Speed
18000 tokens/s
Chat
Visão
Max Input:1,048,576 tokens
Max Output:65,536 tokens
Price:$0.0005/1K input, $0.003/1K output
2025-12-11

GPT 5.2Saiba mais

OpenAI
Parameters
280B
Context Length
128,000 tokens
Response Speed
180 tokens/s
Input Speed
13000 tokens/s
Chat
Visão
Max Input:128,000 tokens
Max Output:16,384 tokens
Price:$0.00025/1K input, $0.0005/1K output
2025-12-10

GPT 5.2 ProSaiba mais

OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
110 tokens/s
Input Speed
11000 tokens/s
Chat
Visão
Código
Max Input:400,000 tokens
Max Output:128,000 tokens
Price:$0.021/1K input, $0.168/1K output
2025-12-08

GLM 4.6VSaiba mais

GLM / Z.ai
Parameters
-
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Código
Max Input:131,072 tokens
Max Output:65,536 tokens
Price:$0.0003/1K input, $0.0009/1K output
2025-11-06

Kimi K2 ThinkingSaiba mais

Kimi / Moonshot AI
Parameters
1T MoE / 32B active
Context Length
131,072 tokens
Response Speed
130 tokens/s
Input Speed
13000 tokens/s
Chat
Código
Max Input:131,072 tokens
Max Output:81,920 tokens
Price:$0.00047/1K input, $0.002/1K output
2025-11-01

GPT 5.1Saiba mais

OpenAI
Parameters
250B
Context Length
128,000 tokens
Response Speed
170 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Max Input:128,000 tokens
Max Output:8,192 tokens
Price:$0.0002/1K input, $0.0004/1K output
2025-10-15

Claude Haiku 4.5Saiba mais

Anthropic
Parameters
-
Context Length
200,000 tokens
Response Speed
170 tokens/s
Input Speed
14000 tokens/s
Chat
Visão
Código
Max Input:200,000 tokens
Max Output:64,000 tokens
Price:$0.001/1K input, $0.005/1K output
2025-09-29

Claude Sonnet 4.5Saiba mais

Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
120 tokens/s
Input Speed
9000 tokens/s
Chat
Visão
Código
Max Input:1,000,000 tokens
Max Output:64,000 tokens
Price:$0.003/1K input, $0.015/1K output
2025-09-19

Grok 4 FastSaiba mais

xAI
Parameters
-
Context Length
2,000,000 tokens
Response Speed
220 tokens/s
Input Speed
24000 tokens/s
Chat
Visão
Código
Max Input:2,000,000 tokens
Max Output:128,000 tokens
Price:$0.0002/1K input, $0.0005/1K output
2025-08-21

DeepSeek V3.1Saiba mais

DeepSeek
Parameters
671B / 37B active
Context Length
131,072 tokens
Response Speed
180 tokens/s
Input Speed
18000 tokens/s
Chat
Código
Max Input:131,072 tokens
Max Output:65,536 tokens
Price:$0.00015/1K input, $0.00075/1K output
2025-08-09

GPT 5Saiba mais

OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.00015/1K input, $0.0003/1K output
2025-08-07

GPT 5 MiniSaiba mais

OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
180 tokens/s
Input Speed
15000 tokens/s
Chat
Visão
Código
Max Input:400,000 tokens
Max Output:128,000 tokens
Price:$0.00025/1K input, $0.002/1K output
2025-08-07

GPT 5 NanoSaiba mais

OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
220 tokens/s
Input Speed
18000 tokens/s
Chat
Código
Max Input:400,000 tokens
Max Output:64,000 tokens
Price:$0.00005/1K input, $0.0004/1K output
2025-08-05

gpt-oss-120bSaiba mais

OpenAI
Parameters
117B / 5.1B active
Context Length
131,072 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Código
Max Input:131,072 tokens
Max Output:65,536 tokens
Price:$0.000039/1K input, $0.00019/1K output
2025-07-25

Qwen3 235B A22B Thinking 2507Saiba mais

Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
262,144 tokens
Response Speed
130 tokens/s
Input Speed
13000 tokens/s
Chat
Código
Max Input:262,144 tokens
Max Output:81,920 tokens
Price:$0.00011/1K input, $0.0006/1K output
2025-07-23

Qwen3 Coder 480B A35BSaiba mais

Qwen / Alibaba Cloud
Parameters
480B / 35B active
Context Length
262,144 tokens
Response Speed
170 tokens/s
Input Speed
16000 tokens/s
Chat
Código
Max Input:262,144 tokens
Max Output:65,536 tokens
Price:$0.00022/1K input, $0.001/1K output
2025-07-22

Gemini 2.5 Flash-LiteSaiba mais

Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Visão
Max Input:1,048,576 tokens
Max Output:65,536 tokens
Price:$0.0001/1K input, $0.0004/1K output
2025-07-21

Qwen3 235B A22B Instruct 2507Saiba mais

Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
262,144 tokens
Response Speed
180 tokens/s
Input Speed
17000 tokens/s
Chat
Código
Max Input:262,144 tokens
Max Output:65,536 tokens
Price:$0.000071/1K input, $0.0001/1K output
2025-07-11

Kimi K2Saiba mais

Kimi / Moonshot AI
Parameters
1T MoE / 32B active
Context Length
131,072 tokens
Response Speed
200 tokens/s
Input Speed
20000 tokens/s
Chat
Código
Max Input:131,072 tokens
Max Output:65,536 tokens
Price:$0.00055/1K input, $0.0022/1K output
2025-07-09

Grok 4Saiba mais

xAI
Parameters
314B
Context Length
256,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Chat
Código
Visão
Max Input:256,000 tokens
Max Output:128,000 tokens
Price:$0.00008/1K input, $0.00016/1K output
2025-06-17

Gemini 2.5 FlashSaiba mais

Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
180 tokens/s
Input Speed
18000 tokens/s
Chat
Visão
Max Input:1,048,576 tokens
Max Output:65,536 tokens
Price:$0.0003/1K input, $0.0025/1K output
2025-06-10

o3 ProSaiba mais

OpenAI
Parameters
-
Context Length
200,000 tokens
Response Speed
80 tokens/s
Input Speed
8000 tokens/s
Chat
Visão
Código
Max Input:200,000 tokens
Max Output:100,000 tokens
Price:$0.02/1K input, $0.08/1K output
2025-06-10

Grok 3Saiba mais

xAI
Parameters
300B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:128,000 tokens
Price:$undefined/1K input, $undefined/1K output
2025-06-10

Grok 3 MiniSaiba mais

xAI
Parameters
100B
Context Length
131,072 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:128,000 tokens
Price:$undefined/1K input, $undefined/1K output
2025-05-28

DeepSeek R1 0528Saiba mais

DeepSeek
Parameters
671B / 37B active
Context Length
163,840 tokens
Response Speed
120 tokens/s
Input Speed
12000 tokens/s
Chat
Código
Max Input:163,840 tokens
Max Output:65,536 tokens
Price:$0.00055/1K input, $0.00219/1K output
2025-05-21

Claude 4 SonnetSaiba mais

Anthropic
Parameters
230B
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:128,000 tokens
Price:$undefined/1K input, $undefined/1K output
2025-05-21

Claude 4 OpusSaiba mais

Anthropic
Parameters
230B
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:128,000 tokens
Price:$undefined/1K input, $undefined/1K output
2025-04-28

Qwen3 235B A22BSaiba mais

Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
131,072 tokens
Response Speed
160 tokens/s
Input Speed
15000 tokens/s
Chat
Código
Max Input:131,072 tokens
Max Output:65,536 tokens
Price:$0.000455/1K input, $0.00182/1K output
2025-04-16

o3Saiba mais

OpenAI
Parameters
-
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Visão
Código
Max Input:200,000 tokens
Max Output:100,000 tokens
Price:$0.002/1K input, $0.008/1K output
2025-04-16

O4 MiniSaiba mais

OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:200,000 tokens
Max Output:100,000 tokens
Price:$0.0011/1K input, $0.0044/1K output
2025-04-16

O4 Mini (High)Saiba mais

OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:200,000 tokens
Max Output:100,000 tokens
Price:$0.0011/1K input, $0.0044/1K output
2025-04-10

GPT 4.1Saiba mais

OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.00015/1K input, $0.0003/1K output
2025-04-10

GPT 4.1 MiniSaiba mais

OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.00015/1K input, $0.0003/1K output
2025-04-10

GPT 4.1 NanoSaiba mais

OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.00015/1K input, $0.0003/1K output
2025-04-04

Llama 4 MaverickSaiba mais

Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2025-04-04

Llama 4 ScoutSaiba mais

Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2025-03-20

Gemini 2.5 ProSaiba mais

Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
-
Input Speed
-
Chat
Max Input:1,000,000 tokens
Max Output:1,000,000 tokens
Price:-
2025-03-13

Cohere Command ASaiba mais

Cohere
Parameters
180B
Context Length
256,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:128,000 tokens
Price:$0.0002/1K input, $0.0004/1K output
2025-02-26

Imagen 3.0Saiba mais

Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Max Input:1,000,000 tokens
Max Output:1 tokens
Price:-
2025-01-31

O3 Mini (Medium)Saiba mais

OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:200,000 tokens
Max Output:100,000 tokens
Price:$0.0011/1K input, $0.0044/1K output
2025-01-31

O3 Mini (High)Saiba mais

OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:200,000 tokens
Max Output:100,000 tokens
Price:$0.0011/1K input, $0.0044/1K output
2025-01-30

Mistral Small 3Saiba mais

Mistral
Parameters
24B
Context Length
32,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2025-01-28

Qwen 2.5 MaxSaiba mais

Qwen / Alibaba Cloud
Parameters
100B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2025-01-23

Deepseek R1 Distill Llama 70BSaiba mais

DeepSeek
Parameters
70B
Context Length
100,000 tokens
Response Speed
270 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2025-01-20

Deepseek R1Saiba mais

DeepSeek
Parameters
67B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2025-01-14

Minimax 01Saiba mais

MiniMax
Parameters
45.9B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2024-12-25

Deepseek V3Saiba mais

DeepSeek
Parameters
67B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2024-12-12

Grok 2 VisionSaiba mais

xAI
Parameters
500B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Visão
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.0003/1K input, $0.0006/1K output
2024-12-12

Phi 4Saiba mais

Microsoft
Parameters
14.7B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2024-12-11

Gemini 2.0 FlashSaiba mais

Google AI
Parameters
750B
Context Length
1,000,000 tokens
Response Speed
220 tokens/s
Input Speed
20000 tokens/s
Chat
Max Input:1,000,000 tokens
Max Output:1,000,000 tokens
Price:$0.0002/1K input, $0.0004/1K output
2024-12-06

Llama 3.3 70BSaiba mais

Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2024-11-18

Pixtral LargeSaiba mais

Mistral
Parameters
12B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:128,000 tokens
Price:-
2024-09-19

Qwen 2.5 Coder 32BSaiba mais

Qwen / Alibaba Cloud
Parameters
-
Context Length
-
Response Speed
-
Input Speed
-
Chat
Max Input:-
Max Output:-
Price:-
2024-09-19

Qwen 2.5 72BSaiba mais

Qwen / Alibaba Cloud
Parameters
72B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2024-09-19

Qwen 2.5 7BSaiba mais

Qwen / Alibaba Cloud
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2024-09-05

Reflection 70BSaiba mais

Matt Shumer
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2024-08-01

Grok 2Saiba mais

xAI
Parameters
300B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.0002/1K input, $0.0004/1K output
2024-08-01

FLUX.1-devSaiba mais

Black Forest Labs
Parameters
-
Context Length
-
Response Speed
-
Input Speed
-
Max Input:-
Max Output:-
Price:-
2024-07-23

Llama 3.1 405BSaiba mais

Meta AI
Parameters
405B
Context Length
100,000 tokens
Response Speed
130 tokens/s
Input Speed
9000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0003/1K input, $0.0006/1K output
2024-07-23

Llama 3.1 8BSaiba mais

Meta AI
Parameters
8B
Context Length
100,000 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.00005/1K input, $0.0001/1K output
2024-07-18

GPT 4o MiniSaiba mais

OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.00015/1K input, $0.0003/1K output
2024-06-18

Mathstral 7BSaiba mais

Mistral
Parameters
7B
Context Length
4,096 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:4,096 tokens
Max Output:4,096 tokens
Price:-
2024-06-18

Mistral NemoSaiba mais

Mistral
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:32,000 tokens
Max Output:32,000 tokens
Price:$0.00005/1K input, $0.0001/1K output
2024-04-30

Cohere Command R+Saiba mais

Cohere
Parameters
140B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
11000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.00015/1K input, $0.0003/1K output
2024-04-18

Llama 3 8BSaiba mais

Meta AI
Parameters
8B
Context Length
100,000 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.00005/1K input, $0.0001/1K output
2024-03-30

Cohere Command RSaiba mais

Cohere
Parameters
60B
Context Length
32,000 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Max Input:32,000 tokens
Max Output:4,096 tokens
Price:$0.0001/1K input, $0.0002/1K output
2024-03-09

Llama 3.1 70BSaiba mais

Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
13000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.00015/1K input, $0.0003/1K output
2024-02-27

Llama 3 70BSaiba mais

Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
130 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2024-02-25

Mistral LargeSaiba mais

Mistral
Parameters
32B
Context Length
100,000 tokens
Response Speed
160 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0002/1K input, $0.0004/1K output
2024-02-07

Cohere CommandSaiba mais

Cohere
Parameters
180B
Context Length
128,000 tokens
Response Speed
160 tokens/s
Input Speed
13000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.0002/1K input, $0.0004/1K output
2023-09-01

DALL-E 3Saiba mais

OpenAI
Parameters
7B
Context Length
4,000 tokens
Response Speed
1 tokens/s
Input Speed
8000 tokens/s
Max Input:4,000 tokens
Max Output:1 tokens
Price:$0.04/1K input, $0.08/1K output
2023-09-01

Mistral 7BSaiba mais

Mistral
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:32,000 tokens
Max Output:32,000 tokens
Price:$0.00005/1K input, $0.0001/1K output
2023-07-01

Llama 2Saiba mais

Meta AI
Parameters
70B
Context Length
4,096 tokens
Response Speed
160 tokens/s
Input Speed
14000 tokens/s
Chat
Max Input:4,096 tokens
Max Output:4,096 tokens
Price:$0.0001/1K input, $0.0002/1K output
2022-04-01

DALL-E 2Saiba mais

OpenAI
Parameters
3.5B
Context Length
1,000 tokens
Response Speed
1 tokens/s
Input Speed
5000 tokens/s
Max Input:1,000 tokens
Max Output:1 tokens
Price:$0.016/1K input, $0.02/1K output