AI Model Timeline

2025-02-05

Gemini 2.0 ProLearn More

Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Chat
Max Input:1,000,000 tokens
Max Output:1,000,000 tokens
Price:-
2025-02-05

Gemini 2.0 Flash LiteLearn More

Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Chat
Max Input:1,000,000 tokens
Max Output:1,000,000 tokens
Price:-
2025-01-31

O3 MiniLearn More

OpenAI
Parameters
-
Context Length
-
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:200,000 tokens
Max Output:100,000 tokens
Price:$0.0011/1K input, $0.0044/1K output
2025-01-30

Mistral Small 3Learn More

Mistral
Parameters
24B
Context Length
32,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2025-01-28

Qwen 2.5 MaxLearn More

Alibaba
Parameters
100B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2025-01-23

Deepseek R1 Distill Llama 70BLearn More

DeepSeek
Parameters
70B
Context Length
100,000 tokens
Response Speed
270 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2025-01-20

Deepseek R1Learn More

DeepSeek
Parameters
67B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2025-01-14

Minimax 01Learn More

minimax
Parameters
45.9B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2024-12-25

Deepseek V3Learn More

DeepSeek
Parameters
67B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2024-12-19

Gemini 2.0 Flash ThinkingLearn More

Google AI
Parameters
1.5T
Context Length
1,000,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Chat
Max Input:1,000,000 tokens
Max Output:1,000,000 tokens
Price:$0.0004/1K input, $0.0008/1K output
2024-12-12

Grok 2 VisionLearn More

xAI
Parameters
500B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Vision
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.0003/1K input, $0.0006/1K output
2024-12-12

Phi 4Learn More

Microsoft
Parameters
14.7B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2024-12-11

Gemini 2.0 FlashLearn More

Google AI
Parameters
750B
Context Length
1,000,000 tokens
Response Speed
220 tokens/s
Input Speed
20000 tokens/s
Chat
Max Input:1,000,000 tokens
Max Output:1,000,000 tokens
Price:$0.0002/1K input, $0.0004/1K output
2024-12-06

Llama 3.3 70BLearn More

Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2024-12-05

O1Learn More

OpenAI
Parameters
2T
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.0005/1K input, $0.001/1K output
2024-11-18

Pixtral LargeLearn More

Mistral
Parameters
12B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:128,000 tokens
Price:-
2024-09-19

Qwen 2.5 Coder 32BLearn More

Alibaba
Parameters
-
Context Length
-
Response Speed
-
Input Speed
-
Chat
Max Input:-
Max Output:-
Price:-
2024-09-19

Qwen 2.5 7BLearn More

Alibaba
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2024-09-19

Qwen 2.5 72BLearn More

Alibaba
Parameters
72B
Context Length
131,072 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:-
2024-09-12

O1 MiniLearn More

OpenAI
Parameters
250B
Context Length
128,000 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.0002/1K input, $0.0004/1K output
2024-09-12

O1 PreviewLearn More

OpenAI
Parameters
2T
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.0004/1K input, $0.0008/1K output
2024-09-05

Reflection 70BLearn More

mattshumer
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2024-08-01

Grok 2Learn More

xAI
Parameters
300B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.0002/1K input, $0.0004/1K output
2024-08-01

FLUX.1-devLearn More

black-forest-labs
Parameters
-
Context Length
-
Response Speed
-
Input Speed
-
Max Input:-
Max Output:-
Price:-
2024-07-23

Llama 3.1 405BLearn More

Meta AI
Parameters
405B
Context Length
100,000 tokens
Response Speed
130 tokens/s
Input Speed
9000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0003/1K input, $0.0006/1K output
2024-07-23

Llama 3.1 8BLearn More

Meta AI
Parameters
8B
Context Length
100,000 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.00005/1K input, $0.0001/1K output
2024-07-18

GPT 4o MiniLearn More

OpenAI
Parameters
220B
Context Length
128,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.00015/1K input, $0.0003/1K output
2024-06-27

Gemma 2 9BLearn More

Google AI
Parameters
9B
Context Length
8,192 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:8,192 tokens
Max Output:8,192 tokens
Price:$0.00005/1K input, $0.0001/1K output
2024-06-18

Mathstral 7BLearn More

Mistral
Parameters
7B
Context Length
4,096 tokens
Response Speed
150 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:4,096 tokens
Max Output:4,096 tokens
Price:-
2024-06-18

Mistral NemoLearn More

Mistral
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:32,000 tokens
Max Output:32,000 tokens
Price:$0.00005/1K input, $0.0001/1K output
2024-05-13

GPT-4oLearn More

OpenAI
Parameters
1.8T
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Vision
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.0003/1K input, $0.0006/1K output
2024-05-13

ChatGPT-4oLearn More

OpenAI
Parameters
1.8T
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
10000 tokens/s
Chat
Vision
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.0003/1K input, $0.0006/1K output
2024-04-30

Cohere Command R+Learn More

Cohere
Parameters
140B
Context Length
128,000 tokens
Response Speed
140 tokens/s
Input Speed
11000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.00015/1K input, $0.0003/1K output
2024-04-23

GPT-4 VisionLearn More

OpenAI
Parameters
1.8T
Context Length
128,000 tokens
Response Speed
120 tokens/s
Input Speed
8000 tokens/s
Chat
Vision
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.00035/1K input, $0.0007/1K output
2024-04-18

Llama 3 8BLearn More

Meta AI
Parameters
8B
Context Length
100,000 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.00005/1K input, $0.0001/1K output
2024-03-30

Cohere Command RLearn More

Cohere
Parameters
60B
Context Length
32,000 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Max Input:32,000 tokens
Max Output:4,096 tokens
Price:$0.0001/1K input, $0.0002/1K output
2024-03-14

Claude 3.5 SonnetLearn More

Anthropic
Parameters
230B
Context Length
200,000 tokens
Response Speed
180 tokens/s
Input Speed
15000 tokens/s
Chat
Vision
Max Input:200,000 tokens
Max Output:200,000 tokens
Price:$0.00025/1K input, $0.0005/1K output
2024-03-14

Claude 3.5 HaikuLearn More

Anthropic
Parameters
180B
Context Length
200,000 tokens
Response Speed
190 tokens/s
Input Speed
16000 tokens/s
Chat
Vision
Max Input:200,000 tokens
Max Output:200,000 tokens
Price:$0.00015/1K input, $0.0003/1K output
2024-03-13

Claude 3 HaikuLearn More

Anthropic
Parameters
140B
Context Length
200,000 tokens
Response Speed
190 tokens/s
Input Speed
16000 tokens/s
Chat
Vision
Max Input:200,000 tokens
Max Output:200,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2024-03-09

Llama 3.1 70BLearn More

Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
13000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.00015/1K input, $0.0003/1K output
2024-03-03

Claude 3 SonnetLearn More

Anthropic
Parameters
200B
Context Length
200,000 tokens
Response Speed
160 tokens/s
Input Speed
13000 tokens/s
Chat
Vision
Max Input:200,000 tokens
Max Output:200,000 tokens
Price:$0.0002/1K input, $0.0004/1K output
2024-03-03

Claude 3 OpusLearn More

Anthropic
Parameters
400B
Context Length
200,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Vision
Max Input:200,000 tokens
Max Output:200,000 tokens
Price:$0.00035/1K input, $0.0007/1K output
2024-02-27

Llama 3 70BLearn More

Meta AI
Parameters
70B
Context Length
100,000 tokens
Response Speed
130 tokens/s
Input Speed
10000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2024-02-25

Mistral LargeLearn More

Mistral
Parameters
32B
Context Length
100,000 tokens
Response Speed
160 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0002/1K input, $0.0004/1K output
2024-02-14

Gemini Flash 1.5Learn More

Google AI
Parameters
120B
Context Length
1,000,000 tokens
Response Speed
200 tokens/s
Input Speed
18000 tokens/s
Chat
Max Input:1,000,000 tokens
Max Output:1,000,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2024-02-14

Gemini Pro 1.5Learn More

Google AI
Parameters
500B
Context Length
1,000,000 tokens
Response Speed
170 tokens/s
Input Speed
15000 tokens/s
Chat
Vision
Max Input:1,000,000 tokens
Max Output:1,000,000 tokens
Price:$0.0003/1K input, $0.0006/1K output
2024-02-07

Cohere CommandLearn More

Cohere
Parameters
180B
Context Length
128,000 tokens
Response Speed
160 tokens/s
Input Speed
13000 tokens/s
Chat
Max Input:128,000 tokens
Max Output:4,096 tokens
Price:$0.0002/1K input, $0.0004/1K output
2023-12-06

Gemini Pro 1.0Learn More

Google AI
Parameters
240B
Context Length
32,000 tokens
Response Speed
160 tokens/s
Input Speed
14000 tokens/s
Chat
Max Input:32,000 tokens
Max Output:32,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2023-11-05

GPT-4 TurboLearn More

OpenAI
Parameters
1.8T
Context Length
128,000 tokens
Response Speed
130 tokens/s
Input Speed
9000 tokens/s
Chat
Vision
Max Input:128,000 tokens
Max Output:128,000 tokens
Price:$0.00025/1K input, $0.0005/1K output
2023-09-01

DALL-E 3Learn More

OpenAI
Parameters
7B
Context Length
4,000 tokens
Response Speed
1 tokens/s
Input Speed
8000 tokens/s
Max Input:4,000 tokens
Max Output:1 tokens
Price:$0.04/1K input, $0.08/1K output
2023-09-01

Mistral 7BLearn More

Mistral
Parameters
7B
Context Length
32,000 tokens
Response Speed
200 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:32,000 tokens
Max Output:32,000 tokens
Price:$0.00005/1K input, $0.0001/1K output
2023-08-09

Claude v1.2Learn More

Anthropic
Parameters
150B
Context Length
100,000 tokens
Response Speed
150 tokens/s
Input Speed
12000 tokens/s
Chat
Max Input:100,000 tokens
Max Output:100,000 tokens
Price:$0.0001/1K input, $0.0002/1K output
2023-07-01

Llama 2Learn More

Meta AI
Parameters
70B
Context Length
4,096 tokens
Response Speed
160 tokens/s
Input Speed
14000 tokens/s
Chat
Max Input:4,096 tokens
Max Output:4,096 tokens
Price:$0.0001/1K input, $0.0002/1K output
2023-03-13

GPT-4Learn More

OpenAI
Parameters
1.8T
Context Length
8,192 tokens
Response Speed
120 tokens/s
Input Speed
8000 tokens/s
Chat
Max Input:8,192 tokens
Max Output:8,192 tokens
Price:$0.0003/1K input, $0.0006/1K output
2022-11-29

GPT-3.5 TurboLearn More

OpenAI
Parameters
175B
Context Length
4,096 tokens
Response Speed
180 tokens/s
Input Speed
16000 tokens/s
Chat
Max Input:4,096 tokens
Max Output:4,096 tokens
Price:$0.0001/1K input, $0.0002/1K output
2022-04-01

DALL-E 2Learn More

OpenAI
Parameters
3.5B
Context Length
1,000 tokens
Response Speed
1 tokens/s
Input Speed
5000 tokens/s
Max Input:1,000 tokens
Max Output:1 tokens
Price:$0.016/1K input, $0.02/1K output