Lunabot
English
🇺🇸 English
🇨🇳 简体中文
🇭🇰 繁體中文
🇪🇸 Español
🇯🇵 日本語
🇫🇷 Français
🇰🇷 한국어
🇩🇪 Deutsch
🇮🇹 Italiano
🇦🇪 العربية
🇵🇹 Português
🇷🇺 Русский
App
Browser Extension
Chrome Extension
Edge Extension
Firefox Extension
Mobile App
iOS App
Android App
Telegram Bot
Siri Shortcut
Desktop App
macOS App
Windows App
Web App
Form GPT
Prompt Library
Help Center
Pricing
Yearly Special
Launch Lunabot Web
AI Model Timeline
2026-03-15
GLM 5 Turbo
Learn More
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Code
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.00096/1K input, $0.0032/1K output
2026-03-12
Grok 4.20 Multi-Agent Beta
Learn More
xAI
Parameters
-
Context Length
2,000,000 tokens
Response Speed
140 tokens/s
Input Speed
14000 tokens/s
Chat
Code
Max Input:
2,000,000 tokens
Max Output:
128,000 tokens
Price:
$0.002/1K input, $0.006/1K output
2026-03-05
GPT 5.4
Learn More
OpenAI
Parameters
-
Context Length
1,050,000 tokens
Response Speed
120 tokens/s
Input Speed
11000 tokens/s
Chat
Vision
Code
Max Input:
1,050,000 tokens
Max Output:
128,000 tokens
Price:
$0.0025/1K input, $0.015/1K output
2026-03-05
GPT 5.4 Pro
Learn More
OpenAI
Parameters
-
Context Length
1,050,000 tokens
Response Speed
80 tokens/s
Input Speed
8000 tokens/s
Chat
Vision
Code
Max Input:
1,050,000 tokens
Max Output:
128,000 tokens
Price:
$0.03/1K input, $0.18/1K output
2026-03-05
GPT 5.4 Mini
Learn More
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
180 tokens/s
Input Speed
15000 tokens/s
Chat
Vision
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.00075/1K input, $0.0045/1K output
2026-03-05
GPT 5.4 Nano
Learn More
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
220 tokens/s
Input Speed
18000 tokens/s
Chat
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.0002/1K input, $0.00125/1K output
2026-03-03
Gemini 3.1 Flash Lite Preview
Learn More
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Vision
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.00025/1K input, $0.0015/1K output
2026-02-19
Gemini 3.1 Pro Preview
Learn More
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
140 tokens/s
Input Speed
14000 tokens/s
Chat
Vision
Code
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.002/1K input, $0.012/1K output
2026-02-18
Qwen3.5-35B-A3B
Learn More
Qwen / Alibaba Cloud
Parameters
35B / 3B active
Context Length
262,144 tokens
Response Speed
190 tokens/s
Input Speed
18000 tokens/s
Chat
Vision
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.0001625/1K input, $0.0013/1K output
2026-02-18
Qwen3.5-9B
Learn More
Qwen / Alibaba Cloud
Parameters
9B
Context Length
262,144 tokens
Response Speed
240 tokens/s
Input Speed
24000 tokens/s
Chat
Vision
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.0001/1K input, $0.00015/1K output
2026-02-17
Claude Sonnet 4.6
Learn More
Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
120 tokens/s
Input Speed
9000 tokens/s
Chat
Vision
Code
Max Input:
1,000,000 tokens
Max Output:
64,000 tokens
Price:
$0.003/1K input, $0.015/1K output
2026-02-12
MiniMax M2.5
Learn More
MiniMax
Parameters
-
Context Length
196,608 tokens
Response Speed
170 tokens/s
Input Speed
17000 tokens/s
Chat
Code
Max Input:
196,608 tokens
Max Output:
65,536 tokens
Price:
$0.000295/1K input, $0.0012/1K output
2026-02-11
GLM 5
Learn More
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
140 tokens/s
Input Speed
13000 tokens/s
Chat
Code
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.00072/1K input, $0.0023/1K output
2026-02-04
Claude Opus 4.6
Learn More
Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
100 tokens/s
Input Speed
8000 tokens/s
Chat
Vision
Code
Max Input:
1,000,000 tokens
Max Output:
64,000 tokens
Price:
$0.005/1K input, $0.025/1K output
2026-02-04
Qwen3 Coder Next
Learn More
Qwen / Alibaba Cloud
Parameters
80B / 3B active
Context Length
262,144 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.00012/1K input, $0.00075/1K output
2026-02-01
GPT 5.3 Instant
Learn More
OpenAI
Parameters
-
Context Length
128,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Vision
Code
Max Input:
128,000 tokens
Max Output:
16,384 tokens
Price:
$0.00175/1K input, $0.014/1K output
2026-02-01
GPT 5.3 Codex
Learn More
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.00175/1K input, $0.014/1K output
2026-01-27
Kimi K2.5
Learn More
Kimi / Moonshot AI
Parameters
15T continued pretraining
Context Length
262,144 tokens
Response Speed
170 tokens/s
Input Speed
18000 tokens/s
Chat
Vision
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.00045/1K input, $0.0022/1K output
2026-01-19
GLM 4.7
Learn More
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
160 tokens/s
Input Speed
15000 tokens/s
Chat
Code
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.0006/1K input, $0.0022/1K output
2026-01-19
GLM 4.7 Flash
Learn More
GLM / Z.ai
Parameters
-
Context Length
202,752 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Code
Max Input:
202,752 tokens
Max Output:
65,536 tokens
Price:
$0.00006/1K input, $0.0004/1K output
2025-12-17
Gemini 3 Flash Preview
Learn More
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
190 tokens/s
Input Speed
18000 tokens/s
Chat
Vision
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.0005/1K input, $0.003/1K output
2025-12-11
GPT 5.2
Learn More
OpenAI
Parameters
280B
Context Length
128,000 tokens
Response Speed
180 tokens/s
Input Speed
13000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
16,384 tokens
Price:
$0.00025/1K input, $0.0005/1K output
2025-12-10
GPT 5.2 Pro
Learn More
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
110 tokens/s
Input Speed
11000 tokens/s
Chat
Vision
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.021/1K input, $0.168/1K output
2025-12-08
GLM 4.6V
Learn More
GLM / Z.ai
Parameters
-
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.0003/1K input, $0.0009/1K output
2025-11-06
Kimi K2 Thinking
Learn More
Kimi / Moonshot AI
Parameters
1T MoE / 32B active
Context Length
131,072 tokens
Response Speed
130 tokens/s
Input Speed
13000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
81,920 tokens
Price:
$0.00047/1K input, $0.002/1K output
2025-11-01
GPT 5.1
Learn More
OpenAI
Parameters
250B
Context Length
128,000 tokens
Response Speed
170 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
8,192 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2025-10-15
Claude Haiku 4.5
Learn More
Anthropic
Parameters
-
Context Length
200,000 tokens
Response Speed
170 tokens/s
Input Speed
14000 tokens/s
Chat
Vision
Code
Max Input:
200,000 tokens
Max Output:
64,000 tokens
Price:
$0.001/1K input, $0.005/1K output
2025-09-29
Claude Sonnet 4.5
Learn More
Anthropic
Parameters
-
Context Length
1,000,000 tokens
Response Speed
120 tokens/s
Input Speed
9000 tokens/s
Chat
Vision
Code
Max Input:
1,000,000 tokens
Max Output:
64,000 tokens
Price:
$0.003/1K input, $0.015/1K output
2025-09-19
Grok 4 Fast
Learn More
xAI
Parameters
-
Context Length
2,000,000 tokens
Response Speed
220 tokens/s
Input Speed
24000 tokens/s
Chat
Vision
Code
Max Input:
2,000,000 tokens
Max Output:
128,000 tokens
Price:
$0.0002/1K input, $0.0005/1K output
2025-08-21
DeepSeek V3.1
Learn More
DeepSeek
Parameters
671B / 37B active
Context Length
131,072 tokens
Response Speed
180 tokens/s
Input Speed
18000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.00015/1K input, $0.00075/1K output
2025-08-09
GPT 5
Learn More
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-08-07
GPT 5 Mini
Learn More
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
180 tokens/s
Input Speed
15000 tokens/s
Chat
Vision
Code
Max Input:
400,000 tokens
Max Output:
128,000 tokens
Price:
$0.00025/1K input, $0.002/1K output
2025-08-07
GPT 5 Nano
Learn More
OpenAI
Parameters
-
Context Length
400,000 tokens
Response Speed
220 tokens/s
Input Speed
18000 tokens/s
Chat
Code
Max Input:
400,000 tokens
Max Output:
64,000 tokens
Price:
$0.00005/1K input, $0.0004/1K output
2025-08-05
gpt-oss-120b
Learn More
OpenAI
Parameters
117B / 5.1B active
Context Length
131,072 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.000039/1K input, $0.00019/1K output
2025-07-25
Qwen3 235B A22B Thinking 2507
Learn More
Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
262,144 tokens
Response Speed
130 tokens/s
Input Speed
13000 tokens/s
Chat
Code
Max Input:
262,144 tokens
Max Output:
81,920 tokens
Price:
$0.00011/1K input, $0.0006/1K output
2025-07-23
Qwen3 Coder 480B A35B
Learn More
Qwen / Alibaba Cloud
Parameters
480B / 35B active
Context Length
262,144 tokens
Response Speed
170 tokens/s
Input Speed
16000 tokens/s
Chat
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.00022/1K input, $0.001/1K output
2025-07-22
Gemini 2.5 Flash-Lite
Learn More
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
220 tokens/s
Input Speed
22000 tokens/s
Chat
Vision
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.0001/1K input, $0.0004/1K output
2025-07-21
Qwen3 235B A22B Instruct 2507
Learn More
Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
262,144 tokens
Response Speed
180 tokens/s
Input Speed
17000 tokens/s
Chat
Code
Max Input:
262,144 tokens
Max Output:
65,536 tokens
Price:
$0.000071/1K input, $0.0001/1K output
2025-07-11
Kimi K2
Learn More
Kimi / Moonshot AI
Parameters
1T MoE / 32B active
Context Length
131,072 tokens
Response Speed
200 tokens/s
Input Speed
20000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.00055/1K input, $0.0022/1K output
2025-07-09
Grok 4
Learn More
xAI
Parameters
314B
Context Length
256,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Chat
Code
Vision
Max Input:
256,000 tokens
Max Output:
128,000 tokens
Price:
$0.00008/1K input, $0.00016/1K output
2025-06-17
Gemini 2.5 Flash
Learn More
Google AI
Parameters
-
Context Length
1,048,576 tokens
Response Speed
180 tokens/s
Input Speed
18000 tokens/s
Chat
Vision
Max Input:
1,048,576 tokens
Max Output:
65,536 tokens
Price:
$0.0003/1K input, $0.0025/1K output
2025-06-10
o3 Pro
Learn More
OpenAI
Parameters
-
Context Length
200,000 tokens
Response Speed
80 tokens/s
Input Speed
8000 tokens/s
Chat
Vision
Code
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.02/1K input, $0.08/1K output
2025-06-10
Grok 3
Learn More
xAI
Parameters
300B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-06-10
Grok 3 Mini
Learn More
xAI
Parameters
100B
Context Length
131,072 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-05-28
DeepSeek R1 0528
Learn More
DeepSeek
Parameters
671B / 37B active
Context Length
163,840 tokens
Response Speed
120 tokens/s
Input Speed
12000 tokens/s
Chat
Code
Max Input:
163,840 tokens
Max Output:
65,536 tokens
Price:
$0.00055/1K input, $0.00219/1K output
2025-05-21
Claude 4 Sonnet
Learn More
Anthropic
Parameters
230B
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-05-21
Claude 4 Opus
Learn More
Anthropic
Parameters
230B
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$undefined/1K input, $undefined/1K output
2025-04-28
Qwen3 235B A22B
Learn More
Qwen / Alibaba Cloud
Parameters
235B / 22B active
Context Length
131,072 tokens
Response Speed
160 tokens/s
Input Speed
15000 tokens/s
Chat
Code
Max Input:
131,072 tokens
Max Output:
65,536 tokens
Price:
$0.000455/1K input, $0.00182/1K output
2025-04-16
o3
Learn More
OpenAI
Parameters
-
Context Length
200,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Vision
Code
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.002/1K input, $0.008/1K output
2025-04-16
O4 Mini
Learn More
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-04-16
O4 Mini (High)
Learn More
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-04-10
GPT 4.1
Learn More
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-04-10
GPT 4.1 Mini
Learn More
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-04-10
GPT 4.1 Nano
Learn More
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2025-04-04
Llama 4 Maverick
Learn More
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2025-04-04
Llama 4 Scout
Learn More
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2025-03-20
Gemini 2.5 Pro
Learn More
Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
-
Input Speed
-
Chat
Max Input:
1,000,000 tokens
Max Output:
1,000,000 tokens
Price:
-
2025-03-13
Cohere Command A
Learn More
Cohere
Parameters
180B
Context Length
256,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2025-02-26
Imagen 3.0
Learn More
Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Max Input:
1,000,000 tokens
Max Output:
1 tokens
Price:
-
2025-01-31
O3 Mini (Medium)
Learn More
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-01-31
O3 Mini (High)
Learn More
OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
200,000 tokens
Max Output:
100,000 tokens
Price:
$0.0011/1K input, $0.0044/1K output
2025-01-30
Mistral Small 3
Learn More
Mistral
Parameters
24B
Context Length
32,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2025-01-28
Qwen 2.5 Max
Learn More
Qwen / Alibaba Cloud
Parameters
100B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2025-01-23
Deepseek R1 Distill Llama 70B
Learn More
DeepSeek
Parameters
70B
Context Length
100,000 tokens
Response Speed
270 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2025-01-20
Deepseek R1
Learn More
DeepSeek
Parameters
67B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2025-01-14
Minimax 01
Learn More
MiniMax
Parameters
45.9B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-12-25
Deepseek V3
Learn More
DeepSeek
Parameters
67B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-12-12
Grok 2 Vision
Learn More
xAI
Parameters
500B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.0003/1K input, $0.0006/1K output
2024-12-12
Phi 4
Learn More
Microsoft
Parameters
14.7B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-12-11
Gemini 2.0 Flash
Learn More
Google AI
Parameters
750B
Context Length
1,000,000 tokens
Response Speed
220 tokens/s
Input Speed
20000 tokens/s
Chat
Max Input:
1,000,000 tokens
Max Output:
1,000,000 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2024-12-06
Llama 3.3 70B
Learn More
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-11-18
Pixtral Large
Learn More
Mistral
Parameters
12B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
128,000 tokens
Price:
-
2024-09-19
Qwen 2.5 Coder 32B
Learn More
Qwen / Alibaba Cloud
Parameters
-
Context Length
-
Response Speed
-
Input Speed
-
Chat
Max Input:
-
Max Output:
-
Price:
-
2024-09-19
Qwen 2.5 72B
Learn More
Qwen / Alibaba Cloud
Parameters
72B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-09-19
Qwen 2.5 7B
Learn More
Qwen / Alibaba Cloud
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
-
2024-09-05
Reflection 70B
Learn More
Matt Shumer
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-08-01
Grok 2
Learn More
xAI
Parameters
300B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2024-08-01
FLUX.1-dev
Learn More
Black Forest Labs
Parameters
-
Context Length
-
Response Speed
-
Input Speed
-
Max Input:
-
Max Output:
-
Price:
-
2024-07-23
Llama 3.1 405B
Learn More
Meta AI
Parameters
405B
Context Length
100,000 tokens
Response Speed
130 tokens/s
Input Speed
9000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0003/1K input, $0.0006/1K output
2024-07-23
Llama 3.1 8B
Learn More
Meta AI
Parameters
8B
Context Length
100,000 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2024-07-18
GPT 4o Mini
Learn More
OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2024-06-18
Mathstral 7B
Learn More
Mistral
Parameters
7B
Context Length
4,096 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
4,096 tokens
Max Output:
4,096 tokens
Price:
-
2024-06-18
Mistral Nemo
Learn More
Mistral
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
32,000 tokens
Max Output:
32,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2024-04-30
Cohere Command R+
Learn More
Cohere
Parameters
140B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
11000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2024-04-18
Llama 3 8B
Learn More
Meta AI
Parameters
8B
Context Length
100,000 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2024-03-30
Cohere Command R
Learn More
Cohere
Parameters
60B
Context Length
32,000 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Max Input:
32,000 tokens
Max Output:
4,096 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-03-09
Llama 3.1 70B
Learn More
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
13000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.00015/1K input, $0.0003/1K output
2024-02-27
Llama 3 70B
Learn More
Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
130 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2024-02-25
Mistral Large
Learn More
Mistral
Parameters
32B
Context Length
100,000 tokens
Response Speed
160 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
100,000 tokens
Max Output:
100,000 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2024-02-07
Cohere Command
Learn More
Cohere
Parameters
180B
Context Length
128,000 tokens
Response Speed
160 tokens/s
Input Speed
13000 tokens/s
Chat
Max Input:
128,000 tokens
Max Output:
4,096 tokens
Price:
$0.0002/1K input, $0.0004/1K output
2023-09-01
DALL-E 3
Learn More
OpenAI
Parameters
7B
Context Length
4,000 tokens
Response Speed
1 tokens/s
Input Speed
8000 tokens/s
Max Input:
4,000 tokens
Max Output:
1 tokens
Price:
$0.04/1K input, $0.08/1K output
2023-09-01
Mistral 7B
Learn More
Mistral
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:
32,000 tokens
Max Output:
32,000 tokens
Price:
$0.00005/1K input, $0.0001/1K output
2023-07-01
Llama 2
Learn More
Meta AI
Parameters
70B
Context Length
4,096 tokens
Response Speed
160 tokens/s
Input Speed
14000 tokens/s
Chat
Max Input:
4,096 tokens
Max Output:
4,096 tokens
Price:
$0.0001/1K input, $0.0002/1K output
2022-04-01
DALL-E 2
Learn More
OpenAI
Parameters
3.5B
Context Length
1,000 tokens
Response Speed
1 tokens/s
Input Speed
5000 tokens/s
Max Input:
1,000 tokens
Max Output:
1 tokens
Price:
$0.016/1K input, $0.02/1K output