Back to Models

GPT-4 Vision

OpenAIOpenAI

Introduction

GPT-4 Vision is OpenAI's groundbreaking multimodal model, capable of processing both text and image inputs. It allows users to interact with complex visual and textual tasks, such as analyzing images, generating detailed captions, or solving visual puzzles. This model is widely used in education, creative design, and data analysis.

1.8T
128,000 tokens
120 tokens/s
8000 tokens/s

Chat
Vision

Input Tokens128,000 tokens
Output Tokens4,096 tokens
$0.00035/1K input, $0.0007/1K output

Release Information

Release Date2024-04-23
Versiongpt4v