Lunabot
Deutsch
🇺🇸 English
🇨🇳 简体中文
🇭🇰 繁體中文
🇪🇸 Español
🇯🇵 日本語
🇫🇷 Français
🇰🇷 한국어
🇩🇪 Deutsch
🇮🇹 Italiano
🇦🇪 العربية
🇵🇹 Português
🇷🇺 Русский
App
Browser-Erweiterung
Chrome Erweiterung
Edge Erweiterung
Firefox Erweiterung
Mobile App
iOS App
Android App
Telegram-Bot
Siri-Kurzbefehl
Desktop-App
macOS App
Windows App
Web-App
Form GPT
Prompt-Bibliothek
Hilfe
Preise
Jahresangebot
Lunabot Web starten
KI-Modell-Zeitleiste
2026-03-15
GLM 5 Turbo
Mehr erfahren
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Code
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.00096/1K input, $0.0032/1K output
2026-03-12
Grok 4.20 Multi-Agent Beta
Mehr erfahren
xAI
Parameters
-
Context Length
2,000,000 tokens
Response Speed
140 tokens/s
Input Speed
14000 tokens/s
Chat
Code
Max Input:
2,000,000 tokens
Max Output:
128,000 tokens
Price:
$0.002/1K input, $0.006/1K output
2026-03-05
GPT 5.4
Mehr erfahren
OpenAI
Parameters
-
Context Length
1,050,000 tokens
Response Speed
120 tokens/s
Input Speed
11000 tokens/s
Chat
Vision
Code
Max Input:
1,050,000 tokens
Max Output:
128,000 tokens
Price:
$0.0025/1K input, $0.015/1K output
2026-03-05
GPT 5.4 Pro
Mehr erfahren
OpenAI
Parameters
-
Context Length
1,050,000 tokens
Response Speed
80 tokens/s
Input Speed
8000 tokens/s
Chat
Vision
Code
Max Input:
1,050,000 tokens
Max Output:
128,000 tokens
Price:
$0.03/1K input, $0.18/1K output
2026-03-05
GPT 5.4 Mini
Mehr erfahren
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
180 tokens/s
Input Speed
15000 tokens/s
Chat
Vision
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.00075/1K input, $0.0045/1K output
2026-03-05
GPT 5.4 Nano
Mehr erfahren
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
220 tokens/s
Input Speed
18000 tokens/s
Chat
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.0002/1K input, $0.00125/1K output
2026-03-03
Gemini 3.1 Flash Lite Preview
Mehr erfahren
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Vision
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.00025/1K input, $0.0015/1K output
2026-02-19
Gemini 3.1 Pro Preview
Mehr erfahren
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
140 tokens/s
Input Speed
14000 tokens/s
Chat
Vision
Code
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.002/1K input, $0.012/1K output
2026-02-18
Qwen3.5-35B-A3B
Mehr erfahren
Qwen / Alibaba Cloud
Parameters
35B / 3B active
Context Length
262,144 tokens
Response Speed
190 tokens/s
Input Speed
18000 tokens/s
Chat
Vision
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.0001625/1K input, $0.0013/1K output
2026-02-18
Qwen3.5-9B
Mehr erfahren
Qwen / Alibaba Cloud
Parameters
9B
Context Length
262,144 tokens
Response Speed
240 tokens/s
Input Speed
24000 tokens/s
Chat
Vision
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.0001/1K input, $0.00015/1K output
2026-02-17
Claude Sonnet 4.6
Mehr erfahren
Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
120 tokens/s
Input Speed
9000 tokens/s
Chat
Vision
Code
Max Input:
1,000,000 tokens
Max Output:
64,000 tokens
Price:
$0.003/1K input, $0.015/1K output
2026-02-12
MiniMax M2.5
Mehr erfahren
MiniMax
Parameters
-
Context Length
196,608 tokens
Response Speed
170 tokens/s
Input Speed
17000 tokens/s
Chat
Code
Max Input:
196,608 tokens
Max Output:
65,536 tokens
Price:
$0.000295/1K input, $0.0012/1K output
2026-02-11
GLM 5
Mehr erfahren
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
140 tokens/s
Input Speed
13000 tokens/s
Chat
Code
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.00072/1K input, $0.0023/1K output
2026-02-04
Claude Opus 4.6
Mehr erfahren
Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
100 tokens/s
Input Speed
8000 tokens/s
Chat
Vision
Code
Max Input:
1,000,000 tokens
Max Output:
64,000 tokens
Price:
$0.005/1K input, $0.025/1K output
2026-02-04
Qwen3 Coder Next
Mehr erfahren
Qwen / Alibaba Cloud
Parameters
80B / 3B active
Context Length
262,144 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.00012/1K input, $0.00075/1K output
2026-02-01
GPT 5.3 Instant
Mehr erfahren
OpenAI
Parameters
-
Context Length
128,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Vision
Code
Max Input:
128,000 tokens
Max Output:
16,384 tokens
Price:
$0.00175/1K input, $0.014/1K output
2026-02-01
GPT 5.3 Codex
Mehr erfahren
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.00175/1K input, $0.014/1K output
2026-01-27
Kimi K2.5
Mehr erfahren
Kimi / Moonshot AI
Parameters
15T continued pretraining
Context Length
262,144 tokens
Response Speed
170 tokens/s
Input Speed
18000 tokens/s
Chat
Vision
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.00045/1K input, $0.0022/1K output
2026-01-19
GLM 4.7
Mehr erfahren
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
160 tokens/s
Input Speed
15000 tokens/s
Chat
Code
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.0006/1K input, $0.0022/1K output
2026-01-19
GLM 4.7 Flash
Mehr erfahren
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Code
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.00006/1K input, $0.0004/1K output
2025-12-17
Gemini 3 Flash Preview
Mehr erfahren
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
190 tokens/s
Input Speed
18000 tokens/s
Chat
Vision
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.0005/1K input, $0.003/1K output
2025-12-11
GPT 5.2
Mehr erfahren
OpenAI
Parameters
280B
Context Length
128,000 tokens
Response Speed
180 tokens/s
Input Speed
13000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
16,384 tokens
Price:
$0.00025/1K input, $0.0005/1K output
2025-12-10
GPT 5.2 Pro
Mehr erfahren
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
110 tokens/s
Input Speed
11000 tokens/s
Chat
Vision
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.021/1K input, $0.168/1K output
2025-12-08
GLM 4.6V
Mehr erfahren
GLM / Z.ai
Parameters
-
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.0003/1K input, $0.0009/1K output
2025-11-06
Kimi K2 Thinking
Mehr erfahren
Kimi / Moonshot AI
Parameters
1T MoE / 32B active
Context Length
131,072 tokens
Response Speed
130 tokens/s
Input Speed
13000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
81,920 tokens
Price:
$0.00047/1K input, $0.002/1K output
2025-11-01
GPT 5.1
Mehr erfahren
OpenAI
Parameters
250B
Context Length
128,000 tokens
Response Speed
170 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
8,192 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2025-10-15
Claude Haiku 4.5
Mehr erfahren
Anthropic
Parameters
-
Context Length
200,000 tokens
Response Speed
170 tokens/s
Input Speed
14000 tokens/s
Chat
Vision
Code
Max Input:
200,000 tokens
Max Output:
64,000 tokens
Price:
$0.001/1K input, $0.005/1K output
2025-09-29
Claude Sonnet 4.5
Mehr erfahren
Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
120 tokens/s
Input Speed
9000 tokens/s
Chat
Vision
Code
Max Input:
1,000,000 tokens
Max Output:
64,000 tokens
Price:
$0.003/1K input, $0.015/1K output
2025-09-19
Grok 4 Fast
Mehr erfahren
xAI
Parameters
-
Context Length
2,000,000 tokens
Response Speed
220 tokens/s
Input Speed
24000 tokens/s
Chat
Vision
Code
Max Input:
2,000,000 tokens
Max Output:
128,000 tokens
Price:
$0.0002/1K input, $0.0005/1K output
2025-08-21
DeepSeek V3.1
Mehr erfahren
DeepSeek
Parameters
671B / 37B active
Context Length
131,072 tokens
Response Speed
180 tokens/s
Input Speed
18000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.00015/1K input, $0.00075/1K output
2025-08-09
GPT 5
Mehr erfahren
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-08-07
GPT 5 Mini
Mehr erfahren
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
180 tokens/s
Input Speed
15000 tokens/s
Chat
Vision
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.00025/1K input, $0.002/1K output
2025-08-07
GPT 5 Nano
Mehr erfahren
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
220 tokens/s
Input Speed
18000 tokens/s
Chat
Code
Max Input:
400,000 tokens
Max Output:
64,000 tokens
Price:
$0.00005/1K input, $0.0004/1K output
2025-08-05
gpt-oss-120b
Mehr erfahren
OpenAI
Parameters
117B / 5.1B active
Context Length
131,072 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.000039/1K input, $0.00019/1K output
2025-07-25
Qwen3 235B A22B Thinking 2507
Mehr erfahren
Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
262,144 tokens
Response Speed
130 tokens/s
Input Speed
13000 tokens/s
Chat
Code
Max Input:
262,144 tokens
Max Output:
81,920 tokens
Price:
$0.00011/1K input, $0.0006/1K output
2025-07-23
Qwen3 Coder 480B A35B
Mehr erfahren
Qwen / Alibaba Cloud
Parameters
480B / 35B active
Context Length
262,144 tokens
Response Speed
170 tokens/s
Input Speed
16000 tokens/s
Chat
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.00022/1K input, $0.001/1K output
2025-07-22
Gemini 2.5 Flash-Lite
Mehr erfahren
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Vision
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.0001/1K input, $0.0004/1K output
2025-07-21
Qwen3 235B A22B Instruct 2507
Mehr erfahren
Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
262,144 tokens
Response Speed
180 tokens/s
Input Speed
17000 tokens/s
Chat
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.000071/1K input, $0.0001/1K output
2025-07-11
Kimi K2
Mehr erfahren
Kimi / Moonshot AI
Parameters
1T MoE / 32B active
Context Length
131,072 tokens
Response Speed
200 tokens/s
Input Speed
20000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.00055/1K input, $0.0022/1K output
2025-07-09
Grok 4
Mehr erfahren
xAI
Parameters
314B
Context Length
256,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Chat
Code
Vision
Max Input:
256,000 tokens
Max Output:
128,000 tokens
Price:
$0.00008/1K input, $0.00016/1K output
2025-06-17
Gemini 2.5 Flash
Mehr erfahren
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
180 tokens/s
Input Speed
18000 tokens/s
Chat
Vision
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.0003/1K input, $0.0025/1K output
2025-06-10
o3 Pro
Mehr erfahren
OpenAI
Parameters
-
Context Length
200,000 tokens
Response Speed
80 tokens/s
Input Speed
8000 tokens/s
Chat
Vision
Code
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.02/1K input, $0.08/1K output
2025-06-10
Grok 3
Mehr erfahren
xAI
Parameters
300B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-06-10
Grok 3 Mini
Mehr erfahren
xAI
Parameters
100B
Context Length
131,072 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-05-28
DeepSeek R1 0528
Mehr erfahren
DeepSeek
Parameters
671B / 37B active
Context Length
163,840 tokens
Response Speed
120 tokens/s
Input Speed
12000 tokens/s
Chat
Code
Max Input:
163,840 tokens
Max Output:
65,536 tokens
Price:
$0.00055/1K input, $0.00219/1K output
2025-05-21
Claude 4 Sonnet
Mehr erfahren
Anthropic
Parameters
230B
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-05-21
Claude 4 Opus
Mehr erfahren
Anthropic
Parameters
230B
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-04-28
Qwen3 235B A22B
Mehr erfahren
Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
131,072 tokens
Response Speed
160 tokens/s
Input Speed
15000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.000455/1K input, $0.00182/1K output
2025-04-16
o3
Mehr erfahren
OpenAI
Parameters
-
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Vision
Code
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.002/1K input, $0.008/1K output
2025-04-16
O4 Mini
Mehr erfahren
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-04-16
O4 Mini (High)
Mehr erfahren
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-04-10
GPT 4.1
Mehr erfahren
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-04-10
GPT 4.1 Mini
Mehr erfahren
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-04-10
GPT 4.1 Nano
Mehr erfahren
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-04-04
Llama 4 Maverick
Mehr erfahren
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2025-04-04
Llama 4 Scout
Mehr erfahren
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2025-03-20
Gemini 2.5 Pro
Mehr erfahren
Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
-
Input Speed
-
Chat
Max Input:
1,000,000 tokens
Max Output:
1,000,000 tokens
Price:
-
2025-03-13
Cohere Command A
Mehr erfahren
Cohere
Parameters
180B
Context Length
256,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2025-02-26
Imagen 3.0
Mehr erfahren
Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Max Input:
1,000,000 tokens
Max Output:
1 tokens
Price:
-
2025-01-31
O3 Mini (Medium)
Mehr erfahren
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-01-31
O3 Mini (High)
Mehr erfahren
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-01-30
Mistral Small 3
Mehr erfahren
Mistral
Parameters
24B
Context Length
32,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2025-01-28
Qwen 2.5 Max
Mehr erfahren
Qwen / Alibaba Cloud
Parameters
100B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2025-01-23
Deepseek R1 Distill Llama 70B
Mehr erfahren
DeepSeek
Parameters
70B
Context Length
100,000 tokens
Response Speed
270 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2025-01-20
Deepseek R1
Mehr erfahren
DeepSeek
Parameters
67B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2025-01-14
Minimax 01
Mehr erfahren
MiniMax
Parameters
45.9B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-12-25
Deepseek V3
Mehr erfahren
DeepSeek
Parameters
67B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-12-12
Grok 2 Vision
Mehr erfahren
xAI
Parameters
500B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.0003/1K input, $0.0006/1K output
2024-12-12
Phi 4
Mehr erfahren
Microsoft
Parameters
14.7B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-12-11
Gemini 2.0 Flash
Mehr erfahren
Google AI
Parameters
750B
Context Length
1,000,000 tokens
Response Speed
220 tokens/s
Input Speed
20000 tokens/s
Chat
Max Input:
1,000,000 tokens
Max Output:
1,000,000 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2024-12-06
Llama 3.3 70B
Mehr erfahren
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-11-18
Pixtral Large
Mehr erfahren
Mistral
Parameters
12B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
-
2024-09-19
Qwen 2.5 Coder 32B
Mehr erfahren
Qwen / Alibaba Cloud
Parameters
-
Context Length
-
Response Speed
-
Input Speed
-
Chat
Max Input:
-
Max Output:
-
Price:
-
2024-09-19
Qwen 2.5 72B
Mehr erfahren
Qwen / Alibaba Cloud
Parameters
72B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-09-19
Qwen 2.5 7B
Mehr erfahren
Qwen / Alibaba Cloud
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-09-05
Reflection 70B
Mehr erfahren
Matt Shumer
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-08-01
Grok 2
Mehr erfahren
xAI
Parameters
300B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2024-08-01
FLUX.1-dev
Mehr erfahren
Black Forest Labs
Parameters
-
Context Length
-
Response Speed
-
Input Speed
-
Max Input:
-
Max Output:
-
Price:
-
2024-07-23
Llama 3.1 405B
Mehr erfahren
Meta AI
Parameters
405B
Context Length
100,000 tokens
Response Speed
130 tokens/s
Input Speed
9000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0003/1K input, $0.0006/1K output
2024-07-23
Llama 3.1 8B
Mehr erfahren
Meta AI
Parameters
8B
Context Length
100,000 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2024-07-18
GPT 4o Mini
Mehr erfahren
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2024-06-18
Mathstral 7B
Mehr erfahren
Mistral
Parameters
7B
Context Length
4,096 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
4,096 tokens
Max Output:
4,096 tokens
Price:
-
2024-06-18
Mistral Nemo
Mehr erfahren
Mistral
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
32,000 tokens
Max Output:
32,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2024-04-30
Cohere Command R+
Mehr erfahren
Cohere
Parameters
140B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
11000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2024-04-18
Llama 3 8B
Mehr erfahren
Meta AI
Parameters
8B
Context Length
100,000 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2024-03-30
Cohere Command R
Mehr erfahren
Cohere
Parameters
60B
Context Length
32,000 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Max Input:
32,000 tokens
Max Output:
4,096 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-03-09
Llama 3.1 70B
Mehr erfahren
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
13000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2024-02-27
Llama 3 70B
Mehr erfahren
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
130 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-02-25
Mistral Large
Mehr erfahren
Mistral
Parameters
32B
Context Length
100,000 tokens
Response Speed
160 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2024-02-07
Cohere Command
Mehr erfahren
Cohere
Parameters
180B
Context Length
128,000 tokens
Response Speed
160 tokens/s
Input Speed
13000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2023-09-01
DALL-E 3
Mehr erfahren
OpenAI
Parameters
7B
Context Length
4,000 tokens
Response Speed
1 tokens/s
Input Speed
8000 tokens/s
Max Input:
4,000 tokens
Max Output:
1 tokens
Price:
$0.04/1K input, $0.08/1K output
2023-09-01
Mistral 7B
Mehr erfahren
Mistral
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
32,000 tokens
Max Output:
32,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2023-07-01
Llama 2
Mehr erfahren
Meta AI
Parameters
70B
Context Length
4,096 tokens
Response Speed
160 tokens/s
Input Speed
14000 tokens/s
Chat
Max Input:
4,096 tokens
Max Output:
4,096 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2022-04-01
DALL-E 2
Mehr erfahren
OpenAI
Parameters
3.5B
Context Length
1,000 tokens
Response Speed
1 tokens/s
Input Speed
5000 tokens/s
Max Input:
1,000 tokens
Max Output:
1 tokens
Price:
$0.016/1K input, $0.02/1K output