Models
Pricing
Enterprise
Resources
Start Free
Start Free
O
OpenAI Models
Browse models from OpenAI
OpenAI GPT is a series of general multimodal large models. GPT-5 was released in August 2025, excelling in advanced coding and debugging, cost-efficient, and suitable for content generation and reasoning applications.
O
GPT-5.2 Pro
Context:
400,000
Input:
$21/M
Output:
$168/M
gpt-5.2-pro is the highest-capability, production-oriented member of OpenAI’s GPT-5.2 family, exposed through the Responses API for workloads that demand maximal fidelity, multi-step reasoning, extensive tool use and the largest context/throughput budgets OpenAI offers.
O
GPT-5.2 Chat
O
GPT-5.2 Chat
Context:
128,000
Input:
$1.75/M
Output:
$14/M
gpt-5.2-chat-latest is the Chat-optimized snapshot of OpenAI’s GPT-5.2 family (branded in ChatGPT as GPT-5.2 Instant). It is the model for interactive/chat use cases that need a blend of speed, long-context handling, multimodal inputs and reliable conversational behaviour.
O
GPT-5.2
Context:
400,000
Input:
$1.75/M
Output:
$14/M
GPT-5.2 is a multi-flavored model suite (Instant, Thinking, Pro) engineered for better long-context understanding, stronger coding and tool use, and materially higher performance on professional “knowledge-work” benchmarks.
O
GPT-5.1 Chat
O
GPT-5.1 Chat
Context:
400.0k
Input:
$1.25/M
Output:
$10/M
GPT-5.1 Chat is an instruction-tuned conversational language model for general-purpose chat, reasoning, and writing. It supports multi-turn dialogue, summarization, drafting, knowledge-base QA, and lightweight code assistance for in-app assistants, support automation, and workflow copilots. Technical highlights include chat-optimized alignment, controllable and structured outputs, and integration paths for tool invocation and retrieval workflows when available.
O
GPT-5.1
O
GPT-5.1
Input:
$1.25/M
Output:
$10/M
GPT-5.1 is a general-purpose instruction-tuned language model focused on text generation and reasoning across product workflows. It supports multi-turn dialogue, structured output formatting, and code-oriented tasks such as drafting, refactoring, and explanation. Typical uses include chat assistants, retrieval-augmented QA, data transformation, and agent-style automation with tools or APIs when supported. Technical highlights include text-centric modality, instruction following, JSON-style outputs, and compatibility with function calling in common orchestration frameworks.
O
GPT Image 1.5
O
GPT Image 1.5
Input:
$8/M
Output:
$16/M
GPT-Image-1.5 is OpenAI’s image model in the GPT Image family . It is a natively multimodal GPT model designed to generate images from text prompts and to perform high-fidelity edits of input images while following user instructions closely.
O
GPT-5 nano
O
GPT-5 nano
Context:
400K
Input:
$0.05/M
Output:
$0.4/M
GPT-5 Nano is an artificial intelligence model provided by OpenAI.
O
GPT-5 mini
O
GPT-5 mini
Context:
400K
Input:
$0.25/M
Output:
$2/M
GPT-5 mini is OpenAI’s cost- and latency-optimized member of the GPT-5 family, intended to deliver much of GPT-5’s multimodal and instruction-following strengths at substantially lower cost for large-scale production use. It targets environments where throughput, predictable per-token pricing, and fast responses are the primary constraints while still providing strong general-purpose capabilities.
O
GPT 5 Chat
O
GPT 5 Chat
Context:
400K
Input:
$1.25/M
Output:
$10/M
GPT-5 Chat (latest) is an artificial intelligence model provided by OpenAI.
O
GPT-5
O
GPT-5
Context:
400K
Input:
$1.25/M
Output:
$10/M
GPT-5 is OpenAI's most powerful coding model to date. It shows significant improvements in complex front-end generation and debugging large codebases. It can transform ideas into reality with intuitive and aesthetically pleasing results, creating beautiful and responsive websites, applications, and games with a keen sense of aesthetics, all from a single prompt. Early testers have also noted its design choices, with a deeper understanding of elements like spacing, typography, and white space.
O
GPT-4.1 nano
O
GPT-4.1 nano
Context:
1.0M
Input:
$0.1/M
Output:
$0.2/M
GPT-4.1 nano is an artificial intelligence model provided by OpenAI. gpt-4.1-nano: Features a larger context window—supporting up to 1 million context tokens and capable of better utilizing that context through improved long-context understanding. Has an updated knowledge cutoff time of June 2024. This model supports a maximum context length of 1,047,576 tokens.
O
GPT-4.1
O
GPT-4.1
Context:
1.0M
Input:
$2/M
Output:
$4/M
GPT-4.1 is an artificial intelligence model provided by OpenAI. gpt-4.1-nano: Features a larger context window—supporting up to 1 million context tokens and capable of better utilizing that context through improved long-context understanding. Has an updated knowledge cutoff time of June 2024. This model supports a maximum context length of 1,047,576 tokens.
O
GPT-4o mini
O
GPT-4o mini
Input:
$0.15/M
Output:
$0.6/M
GPT-4o mini is an artificial intelligence model provided by OpenAI.
O
Whisper-1
O
Whisper-1
Input:
$30/M
Output:
$120/M
Speech to text, creating translations
O
TTS
O
TTS
Input:
$15/M
Output:
$60/M
OpenAI Text-to-Speech
O
Sora 2 Pro
O
Sora 2 Pro
Per Second:
$0.3
Sora 2 Pro is our most advanced and powerful media generation model, capable of generating videos with synchronized Audio. It can create detailed, dynamic video clips from natural language or images.
O
Sora 2
O
Sora 2
Per Second:
$0.1
Super powerful video generation model, with sound effects, supports chat format.
O
GPT Image 1 mini
O
GPT Image 1 mini
Input:
$8/M
Output:
$16/M
Cost-optimized version of GPT Image 1. It is a native Multimodal language model that accepts both text and image input and generates image output.
O
GPT-4o
O
GPT-4o
Input:
$75/M
Output:
$300/M
<div>GPT-4o is OpenAI's most advanced Multimodal model, faster and cheaper than GPT-4 Turbo, with stronger visual capabilities. This model has a 128K context and a knowledge cutoff of October 2023. Models in the 1106 series and above support tool_calls and function_call.</div> This model supports a maximum context length of 128,000 tokens.
O
GPT 4.1 mini
O
GPT 4.1 mini
Context:
1.0M
Input:
$0.4/M
Output:
$0.8/M
GPT-4.1 mini is an artificial intelligence model provided by OpenAI. gpt-4.1-mini: A significant leap in small model performance, even beating GPT-4o in many benchmarks. It meets or exceeds GPT-4o in intelligence evaluation while reducing latency by nearly half and cost by 83%. This model supports a maximum context length of 1,047,576 tokens.
O
GPT Image 2
O
GPT Image 2
Input:
$75/M
Output:
$150/M
O
o4-mini
O
o4-mini
Input:
$1.1/M
Output:
$4.4/M
O4-mini is an artificial intelligence model provided by OpenAI.
O
O3 Pro
O
O3 Pro
Context:
200K
Input:
$20/M
Output:
$80/M
OpenAI o3‑pro is a “pro” variant of the o3 reasoning model engineered to think longer and deliver the most dependable responses by employing private chain‑of‑thought reinforcement learning and setting new state‑of‑the‑art benchmarks across domains like science, programming, and business—while autonomously integrating tools such as web search, file analysis, Python execution, and visual reasoning within API.
O
o3-mini
O
o3-mini
Input:
$1.1/M
Output:
$4.4/M
O3-mini is an artificial intelligence model provided by OpenAI.
O
o3
O
o3
Input:
$2/M
Output:
$8/M
O3 is an artificial intelligence model provided by OpenAI.
O
GPT-4o mini Audio
O
GPT-4o mini Audio
Input:
$0.15/M
Output:
$0.6/M
GPT-4o mini Audio is a multimodal model for speech and text interactions. It performs speech recognition, translation, and text-to-speech, follows instructions, and can call tools for structured actions with streaming responses. Typical uses include real-time voice assistants, live captioning and translation, call summarization, and voice-controlled applications. Technical highlights include audio input and output, streaming responses, function calling, and structured JSON output.
O
codex-mini-latest
O
codex-mini-latest
Input:
$1.5/M
Output:
$6/M
Codex Mini is an artificial intelligence model provided by OpenAI. It is OpenAI's latest achievement in code generation, a lightweight model specifically optimized for the Codex command-line interface (CLI). As a fine-tuned version of o4-mini, this model inherits the base model's high efficiency and response speed while being specially optimized for code understanding and generation.
O
GPT-4o Audio Preview
O
GPT-4o Audio Preview
Input:
$75/M
Output:
$300/M
This model supports a maximum context length of 128,000 tokens.
O
GPT-4o mini TTS
O
GPT-4o mini TTS
Input:
$12/M
Output:
$240/M
GPT-4o mini TTS is a neural text-to-speech model designed for natural, low-latency voice generation in user-facing applications. It converts text to natural-sounding speech with selectable voices, multi-format output, and streaming synthesis for responsive experiences. Typical uses include voice assistants, IVR and contact flows, product read-aloud, and media narration. Technical highlights include API-based streaming and export to common audio formats such as MP3 and WAV.
O
GPT-4o mini Search Preview
O
GPT-4o mini Search Preview
Input:
$75/M
Output:
$300/M
GPT-4o mini Search Preview is a compact multimodal model in the GPT-4o family geared toward search-oriented interactions and retrieval workflows. It interprets and reformulates queries, synthesizes concise answers, and can ground responses via external search when integrated through tool/function calling. Typical uses include in-product search assistants, knowledge-base QA, e-commerce discovery, and query understanding for ranking and routing. Technical highlights include text-and-image inputs, instruction following, structured output formats, and tool use integration for RAG pipelines.
O
GPT-4o Transcribe
O
GPT-4o Transcribe
Input:
$75/M
Output:
$300/M
GPT-4o Transcribe is an audio-to-text model for multilingual, low-latency speech recognition. It supports real-time streaming and batch transcription from common audio formats with punctuation and sentence segmentation. Typical uses include live captions, voice assistant input, meeting notes, and media or call recording transcription. Technical highlights include audio modality support, long-form processing, and APIs suited for interactive and server-side workflows.
O
GPT-4o Search
O
GPT-4o Search
Input:
$75/M
Output:
$300/M
GPT-4o Search is a GPT-4o-based multimodal model configured for search-augmented reasoning and grounded, current answers. It follows instructions and uses web search tools to retrieve, evaluate, and synthesize external information, with source context when available. Typical uses include research assistance, fact-checking, news and trend monitoring, and answering time-sensitive queries. Technical highlights include tool/function calling for browsing and retrieval, long-context handling, and structured outputs suitable for citations and links.
O
GPT-4o Realtime
O
GPT-4o Realtime
Input:
$75/M
Output:
$300/M
The Realtime API allows developers to build low-latency, Multimodal experiences, including speech-to-speech functionality. Text and Audio processed by the Realtime API are priced separately. This model supports a maximum context length of 128,000 tokens.
O
GPT-4o mini Realtime Preview
O
GPT-4o mini Realtime Preview
Input:
$75/M
Output:
$300/M
GPT-4o mini Realtime Preview is a real-time multimodal model for interactive voice and visual experiences. It handles speech, text, and images with streaming input and output, plus tool/function calling for grounded actions. Typical uses include voice assistants, live call handling, real-time captioning, and visual question answering over camera or screen content. Technical highlights include bidirectional audio, vision understanding, streaming responses, and structured outputs via functions.
O
GPT-4o mini Audio Preview
O
GPT-4o mini Audio Preview
Input:
$75/M
Output:
$300/M
GPT-4o mini Audio Preview is a compact multimodal model for building conversational audio applications. It supports speech input and output alongside text, enabling speech recognition, speech synthesis, and mixed text-audio dialogs with tool/function calling for structured actions. Typical uses include voice assistants, streaming transcription with summarization, IVR and call-bot workflows, and audio-enabled in-app helpers. Technical highlights include audio I/O, streaming responses, instruction following, and integration via chat and tools APIs.
G
Gemini omni fast
G
Gemini omni fast
Per Second:
$0.4
Omni is the new model that can create anything from any input — starting with video. With Omni, you can combine images, audio, video and text as input and generate high-quality videos grounded in Gemini's real-world knowledge. You can also easily edit your videos through conversation.
O
gpt-5.5
O
gpt-5.5
Input:
$75/M
Output:
$600/M
O
111
O
111
Per Request:
$20
test
O
FLUX.2
O
FLUX.2
Input:
$75/M
Output:
$300/M
O
tts-1-hd-1106
O
tts-1-hd-1106
Input:
$30/M
Output:
$120/M
O
tts-1-hd
O
tts-1-hd
Input:
$30/M
Output:
$120/M
O
tts-1-1106
O
tts-1-1106
Input:
$15/M
Output:
$60/M
O
tts-1
O
tts-1
Input:
$15/M
Output:
$60/M
O
text-embedding-ada-002
O
text-embedding-ada-002
Input:
$0.1/M
Output:
$0.4/M
An Ada-based text embedding model optimized for various NLP tasks.
O
text-embedding-3-small
O
text-embedding-3-small
Input:
$0.02/M
Output:
$0.08/M
A small text embedding model for efficient processing.
O
text-embedding-3-large
O
text-embedding-3-large
Input:
$0.13/M
Output:
$0.52/M
A large text embedding model for a wide range of natural language processing tasks.
O
omni-moderation-latest
O
omni-moderation-latest
Per Request:
$0.002
O
omni-moderation-2024-09-26
O
omni-moderation-2024-09-26
Per Request:
$0.002
O
o1-pro-all
O
o1-pro-all
Input:
$150/M
Output:
$300/M
O
o1-pro-2025-03-19
O
o1-pro-2025-03-19
Input:
$150/M
Output:
$600/M
O
o1-pro
O
o1-pro
Input:
$150/M
Output:
$600/M
O1-pro is an artificial intelligence model provided by OpenAI.
O
o1-preview-all
O
o1-preview-all
Per Request:
$0.2
O
o1-preview-2024-09-12
O
o1-preview-2024-09-12
Input:
$15/M
Output:
$60/M
O
o1-preview
O
o1-preview
Input:
$15/M
Output:
$60/M
O1-preview is an artificial intelligence model provided by OpenAI.
O
o1-mini-all
O
o1-mini-all
Per Request:
$0.1
O
o1-mini-2024-09-12
O
o1-mini-2024-09-12
Input:
$1.1/M
Output:
$4.4/M
O
o1-mini
O
o1-mini
Input:
$1.1/M
Output:
$4.4/M
O1-mini is an artificial intelligence model provided by OpenAI.
O
o1-all
O
o1-all
Per Request:
$0.2
O
o1-2024-12-17
O
o1-2024-12-17
Input:
$15/M
Output:
$60/M
O
o1
O
o1
Input:
$15/M
Output:
$60/M
O1 is an artificial intelligence model provided by OpenAI.
O
gpt-realtime-mini
O
gpt-realtime-mini
Input:
$0.6/M
Output:
$1.2/M
An economical version of the real-time GPT—capable of responding to Audio and text input in real-time via WebRTC, WebSocket, or SIP connections.
C
gpt-oss-20b
C
gpt-oss-20b
Input:
$0.1/M
Output:
$0.2/M
gpt-oss-20b is an artificial intelligence model provided by cloudflare-workers-ai.
C
gpt-oss-120b
C
gpt-oss-120b
Input:
$0.2/M
Output:
$0.4/M
gpt-oss-120b is an artificial intelligence model provided by cloudflare-workers-ai.
O
gpt-image-1
O
gpt-image-1
Input:
$10/M
Output:
$80/M
An advanced AI model for generating images from text descriptions.
O
gpt-4o-all
O
gpt-4o-all
Input:
$2.5/M
Output:
$5/M
<div>GPT-4o is OpenAI's most advanced Multimodal model, faster and cheaper than GPT-4 Turbo, with stronger visual capabilities. This model has a 128K context and a knowledge cutoff of October 2023. Models in the 1106 series and above support tool_calls and function_call.</div> This model supports a maximum context length of 128,000 tokens.
O
gpt-4-vision-preview
O
gpt-4-vision-preview
Input:
$10/M
Output:
$20/M
This model supports a maximum context length of 128,000 tokens.
O
gpt-4-vision
O
gpt-4-vision
Input:
$10/M
Output:
$20/M
This model supports a maximum context length of 128,000 tokens.
O
gpt-4-v
O
gpt-4-v
Per Request:
$0.05
O
gpt-4-turbo-preview
O
gpt-4-turbo-preview
Input:
$10/M
Output:
$30/M
<div>gpt-4-turbo-preview Upgraded version, stronger code generation capabilities, reduced model "laziness", fixed non-English UTF-8 generation issues.</div> This model supports a maximum context length of 128,000 tokens.
O
gpt-4-turbo-2024-04-09
O
gpt-4-turbo-2024-04-09
Input:
$10/M
Output:
$30/M
<div>gpt-4-turbo-2024-04-09 Upgraded version, stronger code generation capabilities, reduced model "laziness", fixed non-English UTF-8 generation issues.</div> This model supports a maximum context length of 128,000 tokens.
O
gpt-4-turbo
O
gpt-4-turbo
Input:
$10/M
Output:
$30/M
GPT-4 Turbo is an artificial intelligence model provided by OpenAI.
O
gpt-4-search
O
gpt-4-search
Per Request:
$0.05
O
gpt-4-gizmo-*
O
gpt-4-gizmo-*
Input:
$30/M
Output:
$60/M
O
gpt-4-gizmo
O
gpt-4-gizmo
Input:
$30/M
Output:
$60/M
O
gpt-4-dalle
O
gpt-4-dalle
Per Request:
$0.05
O
gpt-4-all
O
gpt-4-all
Input:
$30/M
Output:
$60/M
A
gpt-4-32k
A
gpt-4-32k
Input:
$60/M
Output:
$120/M
GPT-4 32K is an artificial intelligence model provided by Azure.
O
gpt-4-1106-preview
O
gpt-4-1106-preview
Input:
$10/M
Output:
$20/M
O
gpt-4-0613
O
gpt-4-0613
Input:
$30/M
Output:
$60/M
O
gpt-4-0314
O
gpt-4-0314
Input:
$30/M
Output:
$60/M
O
gpt-4-0125-preview
O
gpt-4-0125-preview
Input:
$10/M
Output:
$20/M
O
gpt-4
O
gpt-4
Input:
$30/M
Output:
$60/M
GPT-4 is an artificial intelligence model provided by OpenAI.
O
gpt-3.5-turbo-0125
O
gpt-3.5-turbo-0125
Input:
$0.5/M
Output:
$1/M
GPT-3.5 Turbo 0125 is an artificial intelligence model provided by OpenAI. A pure official high-speed GPT-3.5 series, supporting tools_call. This model supports a maximum context length of 4096 tokens.
O
dall-e-3
O
dall-e-3
Per Request:
$0.02
New version of DALL-E for image generation.
O
dall-e-2
O
dall-e-2
Input:
$10/M
Output:
$40/M
An AI model that generates images from text descriptions.