Lunabot
Português
🇺🇸 English
🇨🇳 简体中文
🇭🇰 繁體中文
🇪🇸 Español
🇯🇵 日本語
🇫🇷 Français
🇰🇷 한국어
🇩🇪 Deutsch
🇮🇹 Italiano
🇦🇪 العربية
🇵🇹 Português
🇷🇺 Русский
Aplicativo
Extensão do Navegador
Extensão do Chrome
Extensão do Edge
Extensão do Firefox
Aplicativo Móvel
Aplicativo para iOS
Aplicativo para Android
Bot do Telegram
Atalho da Siri
Aplicativo de Desktop
Aplicativo para macOS
Aplicativo para Windows
Aplicativo Web
Form GPT
Biblioteca de prompts
Ajuda
Preços
Oferta anual
Iniciar Lunabot Web
Lista completa de modelos de IA
2026-03-15
GLM 5 Turbo
Saiba mais
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Código
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.00096/1K input, $0.0032/1K output
2026-03-12
Grok 4.20 Multi-Agent Beta
Saiba mais
xAI
Parameters
-
Context Length
2,000,000 tokens
Response Speed
140 tokens/s
Input Speed
14000 tokens/s
Chat
Código
Max Input:
2,000,000 tokens
Max Output:
128,000 tokens
Price:
$0.002/1K input, $0.006/1K output
2026-03-05
GPT 5.4
Saiba mais
OpenAI
Parameters
-
Context Length
1,050,000 tokens
Response Speed
120 tokens/s
Input Speed
11000 tokens/s
Chat
Visão
Código
Max Input:
1,050,000 tokens
Max Output:
128,000 tokens
Price:
$0.0025/1K input, $0.015/1K output
2026-03-05
GPT 5.4 Pro
Saiba mais
OpenAI
Parameters
-
Context Length
1,050,000 tokens
Response Speed
80 tokens/s
Input Speed
8000 tokens/s
Chat
Visão
Código
Max Input:
1,050,000 tokens
Max Output:
128,000 tokens
Price:
$0.03/1K input, $0.18/1K output
2026-03-05
GPT 5.4 Mini
Saiba mais
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
180 tokens/s
Input Speed
15000 tokens/s
Chat
Visão
Código
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.00075/1K input, $0.0045/1K output
2026-03-05
GPT 5.4 Nano
Saiba mais
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
220 tokens/s
Input Speed
18000 tokens/s
Chat
Código
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.0002/1K input, $0.00125/1K output
2026-03-03
Gemini 3.1 Flash Lite Preview
Saiba mais
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Visão
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.00025/1K input, $0.0015/1K output
2026-02-19
Gemini 3.1 Pro Preview
Saiba mais
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
140 tokens/s
Input Speed
14000 tokens/s
Chat
Visão
Código
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.002/1K input, $0.012/1K output
2026-02-18
Qwen3.5-35B-A3B
Saiba mais
Qwen / Alibaba Cloud
Parameters
35B / 3B active
Context Length
262,144 tokens
Response Speed
190 tokens/s
Input Speed
18000 tokens/s
Chat
Visão
Código
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.0001625/1K input, $0.0013/1K output
2026-02-18
Qwen3.5-9B
Saiba mais
Qwen / Alibaba Cloud
Parameters
9B
Context Length
262,144 tokens
Response Speed
240 tokens/s
Input Speed
24000 tokens/s
Chat
Visão
Código
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.0001/1K input, $0.00015/1K output
2026-02-17
Claude Sonnet 4.6
Saiba mais
Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
120 tokens/s
Input Speed
9000 tokens/s
Chat
Visão
Código
Max Input:
1,000,000 tokens
Max Output:
64,000 tokens
Price:
$0.003/1K input, $0.015/1K output
2026-02-12
MiniMax M2.5
Saiba mais
MiniMax
Parameters
-
Context Length
196,608 tokens
Response Speed
170 tokens/s
Input Speed
17000 tokens/s
Chat
Código
Max Input:
196,608 tokens
Max Output:
65,536 tokens
Price:
$0.000295/1K input, $0.0012/1K output
2026-02-11
GLM 5
Saiba mais
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
140 tokens/s
Input Speed
13000 tokens/s
Chat
Código
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.00072/1K input, $0.0023/1K output
2026-02-04
Claude Opus 4.6
Saiba mais
Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
100 tokens/s
Input Speed
8000 tokens/s
Chat
Visão
Código
Max Input:
1,000,000 tokens
Max Output:
64,000 tokens
Price:
$0.005/1K input, $0.025/1K output
2026-02-04
Qwen3 Coder Next
Saiba mais
Qwen / Alibaba Cloud
Parameters
80B / 3B active
Context Length
262,144 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Código
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.00012/1K input, $0.00075/1K output
2026-02-01
GPT 5.3 Instant
Saiba mais
OpenAI
Parameters
-
Context Length
128,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Visão
Código
Max Input:
128,000 tokens
Max Output:
16,384 tokens
Price:
$0.00175/1K input, $0.014/1K output
2026-02-01
GPT 5.3 Codex
Saiba mais
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Código
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.00175/1K input, $0.014/1K output
2026-01-27
Kimi K2.5
Saiba mais
Kimi / Moonshot AI
Parameters
15T continued pretraining
Context Length
262,144 tokens
Response Speed
170 tokens/s
Input Speed
18000 tokens/s
Chat
Visão
Código
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.00045/1K input, $0.0022/1K output
2026-01-19
GLM 4.7
Saiba mais
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
160 tokens/s
Input Speed
15000 tokens/s
Chat
Código
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.0006/1K input, $0.0022/1K output
2026-01-19
GLM 4.7 Flash
Saiba mais
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Código
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.00006/1K input, $0.0004/1K output
2025-12-17
Gemini 3 Flash Preview
Saiba mais
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
190 tokens/s
Input Speed
18000 tokens/s
Chat
Visão
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.0005/1K input, $0.003/1K output
2025-12-11
GPT 5.2
Saiba mais
OpenAI
Parameters
280B
Context Length
128,000 tokens
Response Speed
180 tokens/s
Input Speed
13000 tokens/s
Chat
Visão
Max Input:
128,000 tokens
Max Output:
16,384 tokens
Price:
$0.00025/1K input, $0.0005/1K output
2025-12-10
GPT 5.2 Pro
Saiba mais
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
110 tokens/s
Input Speed
11000 tokens/s
Chat
Visão
Código
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.021/1K input, $0.168/1K output
2025-12-08
GLM 4.6V
Saiba mais
GLM / Z.ai
Parameters
-
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Código
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.0003/1K input, $0.0009/1K output
2025-11-06
Kimi K2 Thinking
Saiba mais
Kimi / Moonshot AI
Parameters
1T MoE / 32B active
Context Length
131,072 tokens
Response Speed
130 tokens/s
Input Speed
13000 tokens/s
Chat
Código
Max Input:
131,072 tokens
Max Output:
81,920 tokens
Price:
$0.00047/1K input, $0.002/1K output
2025-11-01
GPT 5.1
Saiba mais
OpenAI
Parameters
250B
Context Length
128,000 tokens
Response Speed
170 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Max Input:
128,000 tokens
Max Output:
8,192 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2025-10-15
Claude Haiku 4.5
Saiba mais
Anthropic
Parameters
-
Context Length
200,000 tokens
Response Speed
170 tokens/s
Input Speed
14000 tokens/s
Chat
Visão
Código
Max Input:
200,000 tokens
Max Output:
64,000 tokens
Price:
$0.001/1K input, $0.005/1K output
2025-09-29
Claude Sonnet 4.5
Saiba mais
Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
120 tokens/s
Input Speed
9000 tokens/s
Chat
Visão
Código
Max Input:
1,000,000 tokens
Max Output:
64,000 tokens
Price:
$0.003/1K input, $0.015/1K output
2025-09-19
Grok 4 Fast
Saiba mais
xAI
Parameters
-
Context Length
2,000,000 tokens
Response Speed
220 tokens/s
Input Speed
24000 tokens/s
Chat
Visão
Código
Max Input:
2,000,000 tokens
Max Output:
128,000 tokens
Price:
$0.0002/1K input, $0.0005/1K output
2025-08-21
DeepSeek V3.1
Saiba mais
DeepSeek
Parameters
671B / 37B active
Context Length
131,072 tokens
Response Speed
180 tokens/s
Input Speed
18000 tokens/s
Chat
Código
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.00015/1K input, $0.00075/1K output
2025-08-09
GPT 5
Saiba mais
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-08-07
GPT 5 Mini
Saiba mais
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
180 tokens/s
Input Speed
15000 tokens/s
Chat
Visão
Código
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.00025/1K input, $0.002/1K output
2025-08-07
GPT 5 Nano
Saiba mais
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
220 tokens/s
Input Speed
18000 tokens/s
Chat
Código
Max Input:
400,000 tokens
Max Output:
64,000 tokens
Price:
$0.00005/1K input, $0.0004/1K output
2025-08-05
gpt-oss-120b
Saiba mais
OpenAI
Parameters
117B / 5.1B active
Context Length
131,072 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Código
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.000039/1K input, $0.00019/1K output
2025-07-25
Qwen3 235B A22B Thinking 2507
Saiba mais
Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
262,144 tokens
Response Speed
130 tokens/s
Input Speed
13000 tokens/s
Chat
Código
Max Input:
262,144 tokens
Max Output:
81,920 tokens
Price:
$0.00011/1K input, $0.0006/1K output
2025-07-23
Qwen3 Coder 480B A35B
Saiba mais
Qwen / Alibaba Cloud
Parameters
480B / 35B active
Context Length
262,144 tokens
Response Speed
170 tokens/s
Input Speed
16000 tokens/s
Chat
Código
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.00022/1K input, $0.001/1K output
2025-07-22
Gemini 2.5 Flash-Lite
Saiba mais
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Visão
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.0001/1K input, $0.0004/1K output
2025-07-21
Qwen3 235B A22B Instruct 2507
Saiba mais
Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
262,144 tokens
Response Speed
180 tokens/s
Input Speed
17000 tokens/s
Chat
Código
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.000071/1K input, $0.0001/1K output
2025-07-11
Kimi K2
Saiba mais
Kimi / Moonshot AI
Parameters
1T MoE / 32B active
Context Length
131,072 tokens
Response Speed
200 tokens/s
Input Speed
20000 tokens/s
Chat
Código
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.00055/1K input, $0.0022/1K output
2025-07-09
Grok 4
Saiba mais
xAI
Parameters
314B
Context Length
256,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Chat
Código
Visão
Max Input:
256,000 tokens
Max Output:
128,000 tokens
Price:
$0.00008/1K input, $0.00016/1K output
2025-06-17
Gemini 2.5 Flash
Saiba mais
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
180 tokens/s
Input Speed
18000 tokens/s
Chat
Visão
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.0003/1K input, $0.0025/1K output
2025-06-10
o3 Pro
Saiba mais
OpenAI
Parameters
-
Context Length
200,000 tokens
Response Speed
80 tokens/s
Input Speed
8000 tokens/s
Chat
Visão
Código
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.02/1K input, $0.08/1K output
2025-06-10
Grok 3
Saiba mais
xAI
Parameters
300B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-06-10
Grok 3 Mini
Saiba mais
xAI
Parameters
100B
Context Length
131,072 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-05-28
DeepSeek R1 0528
Saiba mais
DeepSeek
Parameters
671B / 37B active
Context Length
163,840 tokens
Response Speed
120 tokens/s
Input Speed
12000 tokens/s
Chat
Código
Max Input:
163,840 tokens
Max Output:
65,536 tokens
Price:
$0.00055/1K input, $0.00219/1K output
2025-05-21
Claude 4 Sonnet
Saiba mais
Anthropic
Parameters
230B
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-05-21
Claude 4 Opus
Saiba mais
Anthropic
Parameters
230B
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-04-28
Qwen3 235B A22B
Saiba mais
Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
131,072 tokens
Response Speed
160 tokens/s
Input Speed
15000 tokens/s
Chat
Código
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.000455/1K input, $0.00182/1K output
2025-04-16
o3
Saiba mais
OpenAI
Parameters
-
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Visão
Código
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.002/1K input, $0.008/1K output
2025-04-16
O4 Mini
Saiba mais
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-04-16
O4 Mini (High)
Saiba mais
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-04-10
GPT 4.1
Saiba mais
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-04-10
GPT 4.1 Mini
Saiba mais
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-04-10
GPT 4.1 Nano
Saiba mais
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-04-04
Llama 4 Maverick
Saiba mais
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2025-04-04
Llama 4 Scout
Saiba mais
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2025-03-20
Gemini 2.5 Pro
Saiba mais
Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
-
Input Speed
-
Chat
Max Input:
1,000,000 tokens
Max Output:
1,000,000 tokens
Price:
-
2025-03-13
Cohere Command A
Saiba mais
Cohere
Parameters
180B
Context Length
256,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2025-02-26
Imagen 3.0
Saiba mais
Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Max Input:
1,000,000 tokens
Max Output:
1 tokens
Price:
-
2025-01-31
O3 Mini (Medium)
Saiba mais
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-01-31
O3 Mini (High)
Saiba mais
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-01-30
Mistral Small 3
Saiba mais
Mistral
Parameters
24B
Context Length
32,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2025-01-28
Qwen 2.5 Max
Saiba mais
Qwen / Alibaba Cloud
Parameters
100B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2025-01-23
Deepseek R1 Distill Llama 70B
Saiba mais
DeepSeek
Parameters
70B
Context Length
100,000 tokens
Response Speed
270 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2025-01-20
Deepseek R1
Saiba mais
DeepSeek
Parameters
67B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2025-01-14
Minimax 01
Saiba mais
MiniMax
Parameters
45.9B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-12-25
Deepseek V3
Saiba mais
DeepSeek
Parameters
67B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-12-12
Grok 2 Vision
Saiba mais
xAI
Parameters
500B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Visão
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.0003/1K input, $0.0006/1K output
2024-12-12
Phi 4
Saiba mais
Microsoft
Parameters
14.7B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-12-11
Gemini 2.0 Flash
Saiba mais
Google AI
Parameters
750B
Context Length
1,000,000 tokens
Response Speed
220 tokens/s
Input Speed
20000 tokens/s
Chat
Max Input:
1,000,000 tokens
Max Output:
1,000,000 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2024-12-06
Llama 3.3 70B
Saiba mais
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-11-18
Pixtral Large
Saiba mais
Mistral
Parameters
12B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
-
2024-09-19
Qwen 2.5 Coder 32B
Saiba mais
Qwen / Alibaba Cloud
Parameters
-
Context Length
-
Response Speed
-
Input Speed
-
Chat
Max Input:
-
Max Output:
-
Price:
-
2024-09-19
Qwen 2.5 72B
Saiba mais
Qwen / Alibaba Cloud
Parameters
72B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-09-19
Qwen 2.5 7B
Saiba mais
Qwen / Alibaba Cloud
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-09-05
Reflection 70B
Saiba mais
Matt Shumer
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-08-01
Grok 2
Saiba mais
xAI
Parameters
300B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2024-08-01
FLUX.1-dev
Saiba mais
Black Forest Labs
Parameters
-
Context Length
-
Response Speed
-
Input Speed
-
Max Input:
-
Max Output:
-
Price:
-
2024-07-23
Llama 3.1 405B
Saiba mais
Meta AI
Parameters
405B
Context Length
100,000 tokens
Response Speed
130 tokens/s
Input Speed
9000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0003/1K input, $0.0006/1K output
2024-07-23
Llama 3.1 8B
Saiba mais
Meta AI
Parameters
8B
Context Length
100,000 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2024-07-18
GPT 4o Mini
Saiba mais
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Visão
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2024-06-18
Mathstral 7B
Saiba mais
Mistral
Parameters
7B
Context Length
4,096 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
4,096 tokens
Max Output:
4,096 tokens
Price:
-
2024-06-18
Mistral Nemo
Saiba mais
Mistral
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
32,000 tokens
Max Output:
32,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2024-04-30
Cohere Command R+
Saiba mais
Cohere
Parameters
140B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
11000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2024-04-18
Llama 3 8B
Saiba mais
Meta AI
Parameters
8B
Context Length
100,000 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2024-03-30
Cohere Command R
Saiba mais
Cohere
Parameters
60B
Context Length
32,000 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Max Input:
32,000 tokens
Max Output:
4,096 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-03-09
Llama 3.1 70B
Saiba mais
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
13000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2024-02-27
Llama 3 70B
Saiba mais
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
130 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-02-25
Mistral Large
Saiba mais
Mistral
Parameters
32B
Context Length
100,000 tokens
Response Speed
160 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2024-02-07
Cohere Command
Saiba mais
Cohere
Parameters
180B
Context Length
128,000 tokens
Response Speed
160 tokens/s
Input Speed
13000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2023-09-01
DALL-E 3
Saiba mais
OpenAI
Parameters
7B
Context Length
4,000 tokens
Response Speed
1 tokens/s
Input Speed
8000 tokens/s
Max Input:
4,000 tokens
Max Output:
1 tokens
Price:
$0.04/1K input, $0.08/1K output
2023-09-01
Mistral 7B
Saiba mais
Mistral
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
32,000 tokens
Max Output:
32,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2023-07-01
Llama 2
Saiba mais
Meta AI
Parameters
70B
Context Length
4,096 tokens
Response Speed
160 tokens/s
Input Speed
14000 tokens/s
Chat
Max Input:
4,096 tokens
Max Output:
4,096 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2022-04-01
DALL-E 2
Saiba mais
OpenAI
Parameters
3.5B
Context Length
1,000 tokens
Response Speed
1 tokens/s
Input Speed
5000 tokens/s
Max Input:
1,000 tokens
Max Output:
1 tokens
Price:
$0.016/1K input, $0.02/1K output