Lunabot
Français
🇺🇸 English
🇨🇳 简体中文
🇭🇰 繁體中文
🇪🇸 Español
🇯🇵 日本語
🇫🇷 Français
🇰🇷 한국어
🇩🇪 Deutsch
🇮🇹 Italiano
🇦🇪 العربية
🇵🇹 Português
🇷🇺 Русский
Application
Extension de Navigateur
Extension Chrome
Extension Edge
Extension Firefox
Application Mobile
Application iOS
Application Android
Bot Telegram
Raccourci Siri
Application de Bureau
Application macOS
Application Windows
Application Web
Form GPT
Bibliothèque de prompts
Centre d'aide
Tarification
Offre annuelle
Lancer Lunabot Web
Liste complète des modèles IA
2026-03-15
GLM 5 Turbo
En savoir plus
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Code
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.00096/1K input, $0.0032/1K output
2026-03-12
Grok 4.20 Multi-Agent Beta
En savoir plus
xAI
Parameters
-
Context Length
2,000,000 tokens
Response Speed
140 tokens/s
Input Speed
14000 tokens/s
Chat
Code
Max Input:
2,000,000 tokens
Max Output:
128,000 tokens
Price:
$0.002/1K input, $0.006/1K output
2026-03-05
GPT 5.4
En savoir plus
OpenAI
Parameters
-
Context Length
1,050,000 tokens
Response Speed
120 tokens/s
Input Speed
11000 tokens/s
Chat
Vision
Code
Max Input:
1,050,000 tokens
Max Output:
128,000 tokens
Price:
$0.0025/1K input, $0.015/1K output
2026-03-05
GPT 5.4 Pro
En savoir plus
OpenAI
Parameters
-
Context Length
1,050,000 tokens
Response Speed
80 tokens/s
Input Speed
8000 tokens/s
Chat
Vision
Code
Max Input:
1,050,000 tokens
Max Output:
128,000 tokens
Price:
$0.03/1K input, $0.18/1K output
2026-03-05
GPT 5.4 Mini
En savoir plus
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
180 tokens/s
Input Speed
15000 tokens/s
Chat
Vision
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.00075/1K input, $0.0045/1K output
2026-03-05
GPT 5.4 Nano
En savoir plus
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
220 tokens/s
Input Speed
18000 tokens/s
Chat
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.0002/1K input, $0.00125/1K output
2026-03-03
Gemini 3.1 Flash Lite Preview
En savoir plus
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Vision
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.00025/1K input, $0.0015/1K output
2026-02-19
Gemini 3.1 Pro Preview
En savoir plus
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
140 tokens/s
Input Speed
14000 tokens/s
Chat
Vision
Code
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.002/1K input, $0.012/1K output
2026-02-18
Qwen3.5-35B-A3B
En savoir plus
Qwen / Alibaba Cloud
Parameters
35B / 3B active
Context Length
262,144 tokens
Response Speed
190 tokens/s
Input Speed
18000 tokens/s
Chat
Vision
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.0001625/1K input, $0.0013/1K output
2026-02-18
Qwen3.5-9B
En savoir plus
Qwen / Alibaba Cloud
Parameters
9B
Context Length
262,144 tokens
Response Speed
240 tokens/s
Input Speed
24000 tokens/s
Chat
Vision
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.0001/1K input, $0.00015/1K output
2026-02-17
Claude Sonnet 4.6
En savoir plus
Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
120 tokens/s
Input Speed
9000 tokens/s
Chat
Vision
Code
Max Input:
1,000,000 tokens
Max Output:
64,000 tokens
Price:
$0.003/1K input, $0.015/1K output
2026-02-12
MiniMax M2.5
En savoir plus
MiniMax
Parameters
-
Context Length
196,608 tokens
Response Speed
170 tokens/s
Input Speed
17000 tokens/s
Chat
Code
Max Input:
196,608 tokens
Max Output:
65,536 tokens
Price:
$0.000295/1K input, $0.0012/1K output
2026-02-11
GLM 5
En savoir plus
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
140 tokens/s
Input Speed
13000 tokens/s
Chat
Code
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.00072/1K input, $0.0023/1K output
2026-02-04
Claude Opus 4.6
En savoir plus
Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
100 tokens/s
Input Speed
8000 tokens/s
Chat
Vision
Code
Max Input:
1,000,000 tokens
Max Output:
64,000 tokens
Price:
$0.005/1K input, $0.025/1K output
2026-02-04
Qwen3 Coder Next
En savoir plus
Qwen / Alibaba Cloud
Parameters
80B / 3B active
Context Length
262,144 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.00012/1K input, $0.00075/1K output
2026-02-01
GPT 5.3 Instant
En savoir plus
OpenAI
Parameters
-
Context Length
128,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Vision
Code
Max Input:
128,000 tokens
Max Output:
16,384 tokens
Price:
$0.00175/1K input, $0.014/1K output
2026-02-01
GPT 5.3 Codex
En savoir plus
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.00175/1K input, $0.014/1K output
2026-01-27
Kimi K2.5
En savoir plus
Kimi / Moonshot AI
Parameters
15T continued pretraining
Context Length
262,144 tokens
Response Speed
170 tokens/s
Input Speed
18000 tokens/s
Chat
Vision
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.00045/1K input, $0.0022/1K output
2026-01-19
GLM 4.7
En savoir plus
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
160 tokens/s
Input Speed
15000 tokens/s
Chat
Code
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.0006/1K input, $0.0022/1K output
2026-01-19
GLM 4.7 Flash
En savoir plus
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Code
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.00006/1K input, $0.0004/1K output
2025-12-17
Gemini 3 Flash Preview
En savoir plus
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
190 tokens/s
Input Speed
18000 tokens/s
Chat
Vision
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.0005/1K input, $0.003/1K output
2025-12-11
GPT 5.2
En savoir plus
OpenAI
Parameters
280B
Context Length
128,000 tokens
Response Speed
180 tokens/s
Input Speed
13000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
16,384 tokens
Price:
$0.00025/1K input, $0.0005/1K output
2025-12-10
GPT 5.2 Pro
En savoir plus
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
110 tokens/s
Input Speed
11000 tokens/s
Chat
Vision
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.021/1K input, $0.168/1K output
2025-12-08
GLM 4.6V
En savoir plus
GLM / Z.ai
Parameters
-
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.0003/1K input, $0.0009/1K output
2025-11-06
Kimi K2 Thinking
En savoir plus
Kimi / Moonshot AI
Parameters
1T MoE / 32B active
Context Length
131,072 tokens
Response Speed
130 tokens/s
Input Speed
13000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
81,920 tokens
Price:
$0.00047/1K input, $0.002/1K output
2025-11-01
GPT 5.1
En savoir plus
OpenAI
Parameters
250B
Context Length
128,000 tokens
Response Speed
170 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
8,192 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2025-10-15
Claude Haiku 4.5
En savoir plus
Anthropic
Parameters
-
Context Length
200,000 tokens
Response Speed
170 tokens/s
Input Speed
14000 tokens/s
Chat
Vision
Code
Max Input:
200,000 tokens
Max Output:
64,000 tokens
Price:
$0.001/1K input, $0.005/1K output
2025-09-29
Claude Sonnet 4.5
En savoir plus
Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
120 tokens/s
Input Speed
9000 tokens/s
Chat
Vision
Code
Max Input:
1,000,000 tokens
Max Output:
64,000 tokens
Price:
$0.003/1K input, $0.015/1K output
2025-09-19
Grok 4 Fast
En savoir plus
xAI
Parameters
-
Context Length
2,000,000 tokens
Response Speed
220 tokens/s
Input Speed
24000 tokens/s
Chat
Vision
Code
Max Input:
2,000,000 tokens
Max Output:
128,000 tokens
Price:
$0.0002/1K input, $0.0005/1K output
2025-08-21
DeepSeek V3.1
En savoir plus
DeepSeek
Parameters
671B / 37B active
Context Length
131,072 tokens
Response Speed
180 tokens/s
Input Speed
18000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.00015/1K input, $0.00075/1K output
2025-08-09
GPT 5
En savoir plus
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-08-07
GPT 5 Mini
En savoir plus
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
180 tokens/s
Input Speed
15000 tokens/s
Chat
Vision
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.00025/1K input, $0.002/1K output
2025-08-07
GPT 5 Nano
En savoir plus
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
220 tokens/s
Input Speed
18000 tokens/s
Chat
Code
Max Input:
400,000 tokens
Max Output:
64,000 tokens
Price:
$0.00005/1K input, $0.0004/1K output
2025-08-05
gpt-oss-120b
En savoir plus
OpenAI
Parameters
117B / 5.1B active
Context Length
131,072 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.000039/1K input, $0.00019/1K output
2025-07-25
Qwen3 235B A22B Thinking 2507
En savoir plus
Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
262,144 tokens
Response Speed
130 tokens/s
Input Speed
13000 tokens/s
Chat
Code
Max Input:
262,144 tokens
Max Output:
81,920 tokens
Price:
$0.00011/1K input, $0.0006/1K output
2025-07-23
Qwen3 Coder 480B A35B
En savoir plus
Qwen / Alibaba Cloud
Parameters
480B / 35B active
Context Length
262,144 tokens
Response Speed
170 tokens/s
Input Speed
16000 tokens/s
Chat
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.00022/1K input, $0.001/1K output
2025-07-22
Gemini 2.5 Flash-Lite
En savoir plus
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Vision
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.0001/1K input, $0.0004/1K output
2025-07-21
Qwen3 235B A22B Instruct 2507
En savoir plus
Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
262,144 tokens
Response Speed
180 tokens/s
Input Speed
17000 tokens/s
Chat
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.000071/1K input, $0.0001/1K output
2025-07-11
Kimi K2
En savoir plus
Kimi / Moonshot AI
Parameters
1T MoE / 32B active
Context Length
131,072 tokens
Response Speed
200 tokens/s
Input Speed
20000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.00055/1K input, $0.0022/1K output
2025-07-09
Grok 4
En savoir plus
xAI
Parameters
314B
Context Length
256,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Chat
Code
Vision
Max Input:
256,000 tokens
Max Output:
128,000 tokens
Price:
$0.00008/1K input, $0.00016/1K output
2025-06-17
Gemini 2.5 Flash
En savoir plus
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
180 tokens/s
Input Speed
18000 tokens/s
Chat
Vision
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.0003/1K input, $0.0025/1K output
2025-06-10
o3 Pro
En savoir plus
OpenAI
Parameters
-
Context Length
200,000 tokens
Response Speed
80 tokens/s
Input Speed
8000 tokens/s
Chat
Vision
Code
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.02/1K input, $0.08/1K output
2025-06-10
Grok 3
En savoir plus
xAI
Parameters
300B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-06-10
Grok 3 Mini
En savoir plus
xAI
Parameters
100B
Context Length
131,072 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-05-28
DeepSeek R1 0528
En savoir plus
DeepSeek
Parameters
671B / 37B active
Context Length
163,840 tokens
Response Speed
120 tokens/s
Input Speed
12000 tokens/s
Chat
Code
Max Input:
163,840 tokens
Max Output:
65,536 tokens
Price:
$0.00055/1K input, $0.00219/1K output
2025-05-21
Claude 4 Sonnet
En savoir plus
Anthropic
Parameters
230B
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-05-21
Claude 4 Opus
En savoir plus
Anthropic
Parameters
230B
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-04-28
Qwen3 235B A22B
En savoir plus
Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
131,072 tokens
Response Speed
160 tokens/s
Input Speed
15000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.000455/1K input, $0.00182/1K output
2025-04-16
o3
En savoir plus
OpenAI
Parameters
-
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Vision
Code
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.002/1K input, $0.008/1K output
2025-04-16
O4 Mini
En savoir plus
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-04-16
O4 Mini (High)
En savoir plus
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-04-10
GPT 4.1
En savoir plus
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-04-10
GPT 4.1 Mini
En savoir plus
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-04-10
GPT 4.1 Nano
En savoir plus
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-04-04
Llama 4 Maverick
En savoir plus
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2025-04-04
Llama 4 Scout
En savoir plus
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2025-03-20
Gemini 2.5 Pro
En savoir plus
Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
-
Input Speed
-
Chat
Max Input:
1,000,000 tokens
Max Output:
1,000,000 tokens
Price:
-
2025-03-13
Cohere Command A
En savoir plus
Cohere
Parameters
180B
Context Length
256,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2025-02-26
Imagen 3.0
En savoir plus
Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Max Input:
1,000,000 tokens
Max Output:
1 tokens
Price:
-
2025-01-31
O3 Mini (Medium)
En savoir plus
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-01-31
O3 Mini (High)
En savoir plus
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-01-30
Mistral Small 3
En savoir plus
Mistral
Parameters
24B
Context Length
32,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2025-01-28
Qwen 2.5 Max
En savoir plus
Qwen / Alibaba Cloud
Parameters
100B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2025-01-23
Deepseek R1 Distill Llama 70B
En savoir plus
DeepSeek
Parameters
70B
Context Length
100,000 tokens
Response Speed
270 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2025-01-20
Deepseek R1
En savoir plus
DeepSeek
Parameters
67B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2025-01-14
Minimax 01
En savoir plus
MiniMax
Parameters
45.9B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-12-25
Deepseek V3
En savoir plus
DeepSeek
Parameters
67B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-12-12
Grok 2 Vision
En savoir plus
xAI
Parameters
500B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.0003/1K input, $0.0006/1K output
2024-12-12
Phi 4
En savoir plus
Microsoft
Parameters
14.7B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-12-11
Gemini 2.0 Flash
En savoir plus
Google AI
Parameters
750B
Context Length
1,000,000 tokens
Response Speed
220 tokens/s
Input Speed
20000 tokens/s
Chat
Max Input:
1,000,000 tokens
Max Output:
1,000,000 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2024-12-06
Llama 3.3 70B
En savoir plus
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-11-18
Pixtral Large
En savoir plus
Mistral
Parameters
12B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
-
2024-09-19
Qwen 2.5 Coder 32B
En savoir plus
Qwen / Alibaba Cloud
Parameters
-
Context Length
-
Response Speed
-
Input Speed
-
Chat
Max Input:
-
Max Output:
-
Price:
-
2024-09-19
Qwen 2.5 72B
En savoir plus
Qwen / Alibaba Cloud
Parameters
72B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-09-19
Qwen 2.5 7B
En savoir plus
Qwen / Alibaba Cloud
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-09-05
Reflection 70B
En savoir plus
Matt Shumer
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-08-01
Grok 2
En savoir plus
xAI
Parameters
300B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2024-08-01
FLUX.1-dev
En savoir plus
Black Forest Labs
Parameters
-
Context Length
-
Response Speed
-
Input Speed
-
Max Input:
-
Max Output:
-
Price:
-
2024-07-23
Llama 3.1 405B
En savoir plus
Meta AI
Parameters
405B
Context Length
100,000 tokens
Response Speed
130 tokens/s
Input Speed
9000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0003/1K input, $0.0006/1K output
2024-07-23
Llama 3.1 8B
En savoir plus
Meta AI
Parameters
8B
Context Length
100,000 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2024-07-18
GPT 4o Mini
En savoir plus
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2024-06-18
Mathstral 7B
En savoir plus
Mistral
Parameters
7B
Context Length
4,096 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
4,096 tokens
Max Output:
4,096 tokens
Price:
-
2024-06-18
Mistral Nemo
En savoir plus
Mistral
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
32,000 tokens
Max Output:
32,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2024-04-30
Cohere Command R+
En savoir plus
Cohere
Parameters
140B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
11000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2024-04-18
Llama 3 8B
En savoir plus
Meta AI
Parameters
8B
Context Length
100,000 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2024-03-30
Cohere Command R
En savoir plus
Cohere
Parameters
60B
Context Length
32,000 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Max Input:
32,000 tokens
Max Output:
4,096 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-03-09
Llama 3.1 70B
En savoir plus
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
13000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2024-02-27
Llama 3 70B
En savoir plus
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
130 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-02-25
Mistral Large
En savoir plus
Mistral
Parameters
32B
Context Length
100,000 tokens
Response Speed
160 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2024-02-07
Cohere Command
En savoir plus
Cohere
Parameters
180B
Context Length
128,000 tokens
Response Speed
160 tokens/s
Input Speed
13000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2023-09-01
DALL-E 3
En savoir plus
OpenAI
Parameters
7B
Context Length
4,000 tokens
Response Speed
1 tokens/s
Input Speed
8000 tokens/s
Max Input:
4,000 tokens
Max Output:
1 tokens
Price:
$0.04/1K input, $0.08/1K output
2023-09-01
Mistral 7B
En savoir plus
Mistral
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
32,000 tokens
Max Output:
32,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2023-07-01
Llama 2
En savoir plus
Meta AI
Parameters
70B
Context Length
4,096 tokens
Response Speed
160 tokens/s
Input Speed
14000 tokens/s
Chat
Max Input:
4,096 tokens
Max Output:
4,096 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2022-04-01
DALL-E 2
En savoir plus
OpenAI
Parameters
3.5B
Context Length
1,000 tokens
Response Speed
1 tokens/s
Input Speed
5000 tokens/s
Max Input:
1,000 tokens
Max Output:
1 tokens
Price:
$0.016/1K input, $0.02/1K output