OpenAI ChatGPT API Pricing Calculator - All Text/Chat Models
Calculate OpenAI API costs for all models per token, per request, and per month. Compare 26 text/chat models across Batch, Flex, Standard, and Priority tiers.
Pricing TLDR
- • $5 free credits for new users (no credit card required)
- • Pay-per-token: GPT-4o Mini ($0.15/$0.60) • GPT-5 ($1.25/$10.00) • O1 ($15/$60) per million tokens
- • Batch API 50% off • 50-90% prompt caching savings (varies by model)
Official pricing:
OpenAIOpenAI API Cost Calculator - Monthly Pricing
Pricing Tier
Calculate by
Input Tokens
Output Tokens
API Calls / Month
Quick Examples:
Cost Optimization:
GPT-5 (gpt-5)
Context
Intelligence
Per 1M Tokens
In: $1.25
Out: $10.00
Monthly Cost
GPT-5 Chat Latest (gpt-5-chat-latest)
Context
Intelligence
Per 1M Tokens
In: $1.25
Out: $10.00
Monthly Cost
GPT-5 Codex (gpt-5-codex)
Context
Intelligence
Per 1M Tokens
In: $1.25
Out: $10.00
Monthly Cost
GPT-5 Search API (gpt-5-search-api)
Context
Intelligence
Per 1M Tokens
In: $1.25
Out: $10.00
Monthly Cost
O3 (o3)
Context
Intelligence
Per 1M Tokens
In: $2.00
Out: $8.00
Monthly Cost
GPT-5 Mini (gpt-5-mini)
Context
Intelligence
Per 1M Tokens
In: $0.25
Out: $2.00
Monthly Cost
GPT-5 Nano (gpt-5-nano)
Context
Intelligence
Per 1M Tokens
In: $0.05
Out: $0.40
Monthly Cost
GPT-4.1 Nano (gpt-4.1-nano)
Context
Intelligence
Per 1M Tokens
In: $0.10
Out: $0.40
Monthly Cost
GPT-4o Mini (gpt-4o-mini)
Context
Intelligence
Per 1M Tokens
In: $0.15
Out: $0.60
Monthly Cost
GPT-4o Mini Search Preview (gpt-4o-mini-search-preview)
Context
Intelligence
Per 1M Tokens
In: $0.15
Out: $0.60
Monthly Cost
GPT-4.1 Mini (gpt-4.1-mini)
Context
Intelligence
Per 1M Tokens
In: $0.40
Out: $1.60
Monthly Cost
O1 Mini (o1-mini)
Context
Intelligence
Per 1M Tokens
In: $1.10
Out: $4.40
Monthly Cost
O3 Mini (o3-mini)
Context
Intelligence
Per 1M Tokens
In: $1.10
Out: $4.40
Monthly Cost
O4 Mini (o4-mini)
Context
Intelligence
Per 1M Tokens
In: $1.10
Out: $4.40
Monthly Cost
Codex Mini Latest (codex-mini-latest)
Context
Intelligence
Per 1M Tokens
In: $1.50
Out: $6.00
Monthly Cost
GPT-4.1 (gpt-4.1)
Context
Intelligence
Per 1M Tokens
In: $2.00
Out: $8.00
Monthly Cost
O4 Mini Deep Research (o4-mini-deep-research)
Context
Intelligence
Per 1M Tokens
In: $2.00
Out: $8.00
Monthly Cost
GPT-4o (gpt-4o)
Context
Intelligence
Per 1M Tokens
In: $2.50
Out: $10.00
Monthly Cost
GPT-4o Search Preview (gpt-4o-search-preview)
Context
Intelligence
Per 1M Tokens
In: $2.50
Out: $10.00
Monthly Cost
Computer Use Preview (computer-use-preview)
Context
Intelligence
Per 1M Tokens
In: $3.00
Out: $12.00
Monthly Cost
GPT-4o (2024-05-13) (gpt-4o-2024-05-13)
Context
Intelligence
Per 1M Tokens
In: $5.00
Out: $15.00
Monthly Cost
O3 Deep Research (o3-deep-research)
Context
Intelligence
Per 1M Tokens
In: $10.00
Out: $40.00
Monthly Cost
GPT-5 Pro (gpt-5-pro)
Context
Intelligence
Per 1M Tokens
In: $15.00
Out: $120.00
Monthly Cost
O1 (o1)
Context
Intelligence
Per 1M Tokens
In: $15.00
Out: $60.00
Monthly Cost
O3 Pro (o3-pro)
Context
Intelligence
Per 1M Tokens
In: $20.00
Out: $80.00
Monthly Cost
O1 Pro (o1-pro)
Context
Intelligence
Per 1M Tokens
In: $150.00
Out: $600.00
Monthly Cost
About OpenAI API
What is OpenAI API?
The OpenAI API provides access to 26 text/chat models across four main families: GPT-5 (flagship models for coding and agentic tasks), GPT-4.1 (advanced general purpose), GPT-4o (multimodal with vision), and O-series (specialized reasoning for complex problem-solving and STEM). Each family offers multiple sizes (standard, mini, nano, pro) and supports four pricing tiers (Batch, Flex, Standard, Priority) to optimize for your performance and budget requirements.
- Model Families by Use Case: GPT-5 family: Flagship for coding, agents, and production apps (includes Codex for code, Search API for web-augmented tasks). GPT-4.1 family: Cost-effective general purpose. GPT-4o family: Multimodal with vision capabilities. O-series: Deep reasoning for math, science, and multi-step problem solving (includes Deep Research variants).
- Flexible Pricing Tiers: Batch (50% discount, 24hr processing), Flex (low cost, variable latency, select models), Standard (default balanced tier), Priority (2x cost, 2x faster processing). Switch tiers per request to optimize cost vs speed. Not all models available in all tiers.
- Advanced Cost Features: Prompt caching saves 50-90% on repeated content (GPT-5 family: 90%, GPT-4.1 family: 75%, GPT-4o/O-series: 50%). Fine-tuning available for GPT-4.1, GPT-4o, and o4-mini. Function calling for structured outputs. Embeddings for semantic search. Built-in tools: Code Interpreter, File Search, Web Search.
When to Use OpenAI API
Start with GPT-5 Nano or GPT-4o Mini for cost-sensitive applications, upgrade to GPT-5 Mini for balanced performance, and use GPT-5 or O-series models for complex reasoning and agentic workflows. Use Batch tier for non-urgent workloads (50% savings) and Priority for user-facing applications.
Ideal for
- Coding and agentic applications with GPT-5 or GPT-5 Codex
- Complex reasoning and STEM problems with O-series models (O3, O4-mini)
- Chatbots and conversational AI with GPT-5 Mini or GPT-4o Mini
- Multimodal applications (vision) with GPT-4o family
- Document analysis and summarization with GPT-5 Nano or GPT-4.1 Nano + Batch tier
Not ideal for
- Real-time applications requiring <50ms latency (even Priority tier)
- Simple text parsing tasks (regex is cheaper)
- Applications needing guaranteed deterministic outputs
- O-series for simple tasks (reasoning tokens add hidden costs)
OpenAI API Pricing Breakdown
Free Tier
New users receive $5 in free credits with no credit card required. These credits expire after 3 months and are automatically applied to your usage across all models and tiers.
- Sign up at platform.openai.com - no credit card required
- Receive $5 free credits instantly upon registration
- Credits expire after 3 months from grant date
- Works across all models (GPT-5, GPT-4.1, GPT-4o, O-series)
- Generate API key and start building immediately
Pricing Tiers Explained
Batch Tier (50% Discount)
Process non-urgent workloads within 24 hours at half price. Example: GPT-5 drops to $0.625/$5.00 per M tokens (vs $1.25/$10.00 standard). Perfect for data processing, content generation, and analysis tasks that aren't time-sensitive.
Flex Tier (Variable Latency)
Lower cost with variable latency. Available for select models (GPT-5 family, O3, O4-mini). Automatically routes requests during lower-demand periods. Good balance between Batch and Standard for non-critical applications.
Standard Tier (Default)
Default tier with balanced cost and performance. Most common choice for production applications. Consistent latency and throughput. All models available. Auto-scales based on usage tier (increases with spend).
Priority Tier (2x Faster)
Higher cost (typically 2x standard) for guaranteed faster processing. Best for user-facing applications requiring quick responses. Available for select models (GPT-5 family, GPT-4.1 family, GPT-4o family, O3/O4-mini).
Cost Optimization Features
Prompt Caching (50-90% Savings)
Cache frequently used prompts, system messages, or documents. Savings vary by model: GPT-5 family (90% off), GPT-4.1 family (75% off), GPT-4o/O-series (50% off). Cache persists 5-10 minutes. Example: GPT-5 standard input $1.25/M → cached $0.125/M (90% savings).
Fine-Tuning
Train custom models on your data for improved performance and potentially lower per-request costs. Training costs vary by model ($1.50-$100/hour). Inference costs 1.5-3x base model rates but can reduce prompt length. Available for GPT-4.1, GPT-4o, and o4-mini models.
Understanding O-Series Reasoning Tokens
O-series models (O1, O3, O4-mini) use 'reasoning tokens' for internal thinking steps that are billed as output tokens but not visible in API responses. These tokens occupy context window space and can significantly increase costs. A 500-token visible response may consume 2000+ total tokens. Monitor carefully and use only for tasks requiring deep reasoning.
Prepaid Credits System
Buy credits upfront with no monthly fees. Credits expire after 12 months. Set auto-reload thresholds to never run out. Volume discounts available for enterprise. No setup fees or subscription costs.
Usage Tracking & Limits
Real-time usage monitoring in dashboard. Set spending limits and alerts. Failed requests aren't charged. Rate limits increase automatically with usage tier (based on cumulative spend).
OpenAI API Monthly Cost Estimates
Light Use
$5-30/mo
• Personal projects
• <1K requests/day
• GPT-4o Mini or GPT-5 Nano
Medium Use
$30-150/mo
• Small apps
• 1-5K requests/day
• Mix of Mini models + Batch tier
Heavy Use
$150-1K/mo
• Production apps
• 5-20K requests/day
• Standard tier, mixed models
Enterprise
$1K+/mo
• Large scale
• 20K+ requests/day
• Priority tier, volume discounts
7 OpenAI API Cost Optimization Tips
Understand O-Series Reasoning Token Costs
O-series models (O1, O3, O4-mini) use 'reasoning tokens' for internal thinking that are billed as output tokens but not returned in the response. This means actual costs can be significantly higher than estimated based on visible output. A response showing 500 output tokens may actually consume 2000+ tokens. Monitor O-series usage carefully and avoid using for simple tasks where GPT-5/GPT-4.1 models would suffice.
Use Prompt Caching for Maximum Savings
Save 50-90% on repeated content by caching frequently used prompts, system messages, or documents. Discount varies by model: GPT-5 family (90% off), GPT-4.1 family (75% off), GPT-4o/O-series (50% off). Cache persists 5-10 minutes. Example: 10K requests with 80% cache hits on GPT-5 saves $900/month. Works best for chatbots with consistent system prompts or RAG applications.
Leverage Batch Tier for Non-Urgent Tasks
Get 50% discount by processing non-urgent workloads within 24 hours using Batch API. Perfect for data processing, content generation, analytics, and background jobs. GPT-5 batch: $0.625/$5.00 per M tokens vs $1.25/$10.00 standard. Can save $1000s/month on high-volume applications.
Start with Smaller Models
Use the cheapest models (GPT-5 Nano at $0.05/$0.40 per M tokens or GPT-4o Mini at $0.15/$0.60) for classification, routing, and simple tasks. Only escalate to expensive models (GPT-5, O-series) when necessary for complex reasoning or generation.
Implement Smart Model Routing
Create cascading escalation: GPT-5 Nano → GPT-5 Mini for moderate complexity → GPT-5 for advanced tasks → O-series only for deep reasoning. Route by task complexity to avoid overpaying. Can reduce costs by 60-80% vs using premium models for all requests.
Optimize Token Usage
Minimize prompt tokens without sacrificing quality. Use concise system messages. Remove unnecessary examples after fine-tuning. Set max_tokens limits on outputs. Use function calling instead of verbose JSON in prompts. Each 1000-token reduction saves $0.0015-$0.015 per request on standard tier.
Monitor OpenAI API Usage in Real-Time
Track OpenAI spending per model with CostGoat's token-level visibility. Get instant alerts when switching from Mini to premium models, when prompt caching savings drop unexpectedly, or when batch processing opportunities are missed. Prevent budget overruns before they happen.
OpenAI Model Selection Guide
Use Case
Customer Support Chat
Recommended Model
GPT-5 Nano or GPT-4o Mini
Fast & Affordable
Monthly Cost (Est.)
~$10-50
Why This Model?
Lowest cost for conversational tasks with prompt caching
Use Case
Code Generation & Review
Recommended Model
GPT-5 Codex
Code Specialized
Monthly Cost (Est.)
~$125-600
Why This Model?
Specialized model with best code understanding
Use Case
Complex Reasoning & STEM
Recommended Model
O3 or O4-mini
Deep Reasoning
Monthly Cost (Est.)
~$110-800
Why This Model?
Specialized for multi-step reasoning (note: reasoning tokens add cost)
Use Case
Web-Augmented Tasks
Recommended Model
GPT-5 Search API
Search Optimized
Monthly Cost (Est.)
~$125-500 + search
Why This Model?
Built-in web search for current information retrieval
Use Case
Content Writing
Recommended Model
GPT-5 Mini
Balanced
Monthly Cost (Est.)
~$50-250
Why This Model?
Great quality-to-cost ratio for creative content
Use Case
Data Extraction
Recommended Model
GPT-5 Nano + Batch
Budget-Friendly
Monthly Cost (Est.)
~$5-60
Why This Model?
Lowest cost with 50% batch discount for structured extraction
Use Case
Image Understanding
Recommended Model
GPT-4o
Multimodal
Monthly Cost (Est.)
~$100-400
Why This Model?
Native vision capabilities for image analysis
OpenAI API Rate Limits & Usage Tiers
Usage Tier
Spend Requirement
GPT-5 Limits
200K TPM
GPT-4o Limits
200K TPM
Usage Tier
Spend Requirement
GPT-5 Limits
2M TPM
GPT-4o Limits
2M TPM
Usage Tier
Spend Requirement
GPT-5 Limits
10M TPM
GPT-4o Limits
10M TPM
Usage Tier
Spend Requirement
GPT-5 Limits
30M TPM
GPT-4o Limits
30M TPM
Usage Tier
Spend Requirement
GPT-5 Limits
100M+ TPM
GPT-4o Limits
100M+ TPM
RPM: Requests Per Minute | TPM: Tokens Per Minute. Limits vary by model and tier. Contact sales for enterprise-level rate limits.
Track Your OpenAI API Costs in Real-Time
Monitor your OpenAI API usage and spending across all models - GPT, DALL-E, Whisper, and more. CostGoat runs on your desktop with privacy-first local monitoring. 7-day free trial, then $9/month.
Start Free TrialOpenAI API Pricing FAQ
Common questions about OpenAI API costs, billing, models, and optimization
