🚀 EARLY ACCESS OFFER: Get CostGoat lifetime license for just $199 instead of $299! Get it now

CostGoat Logo

CostGoat

BETA
Try For Free
LAST UPDATED: NOVEMBER 5, 2025

OpenAI ChatGPT API Pricing Calculator - All Text/Chat Models

Calculate OpenAI API costs for all models per token, per request, and per month. Compare 26 text/chat models across Batch, Flex, Standard, and Priority tiers.

CalculatorPricing GuideExamplesSave MoneyFAQ

Pricing TLDR

  • • $5 free credits for new users (no credit card required)
  • • Pay-per-token: GPT-4o Mini ($0.15/$0.60) • GPT-5 ($1.25/$10.00) • O1 ($15/$60) per million tokens
  • • Batch API 50% off • 50-90% prompt caching savings (varies by model)

Official pricing:

OpenAI

OpenAI API Cost Calculator - Monthly Pricing

Pricing Tier

Calculate by

Input Tokens

Output Tokens

API Calls / Month

Quick Examples:

Cost Optimization:

GPT-5 (gpt-5)

Context

128K

Intelligence

68

Per 1M Tokens

In: $1.25

Out: $10.00

Monthly Cost

$6.25

GPT-5 Chat Latest (gpt-5-chat-latest)

Context

128K

Intelligence

68

Per 1M Tokens

In: $1.25

Out: $10.00

Monthly Cost

$6.25

GPT-5 Codex (gpt-5-codex)

Context

128K

Intelligence

68

Per 1M Tokens

In: $1.25

Out: $10.00

Monthly Cost

$6.25

GPT-5 Search API (gpt-5-search-api)

Context

128K

Intelligence

68

Per 1M Tokens

In: $1.25

Out: $10.00

Monthly Cost

$6.25

O3 (o3)

Context

200K

Intelligence

65

Per 1M Tokens

In: $2.00

Out: $8.00

Monthly Cost

$6.00

GPT-5 Mini (gpt-5-mini)

Context

128K

Intelligence

64

Per 1M Tokens

In: $0.25

Out: $2.00

Monthly Cost

$1.25

GPT-5 Nano (gpt-5-nano)

Context

128K

Intelligence

51

Per 1M Tokens

In: $0.05

Out: $0.40

Monthly Cost

$0.25

GPT-4.1 Nano (gpt-4.1-nano)

Context

128K

Intelligence

-

Per 1M Tokens

In: $0.10

Out: $0.40

Monthly Cost

$0.30

GPT-4o Mini (gpt-4o-mini)

Context

128K

Intelligence

-

Per 1M Tokens

In: $0.15

Out: $0.60

Monthly Cost

$0.45

GPT-4o Mini Search Preview (gpt-4o-mini-search-preview)

Context

128K

Intelligence

-

Per 1M Tokens

In: $0.15

Out: $0.60

Monthly Cost

$0.45

GPT-4.1 Mini (gpt-4.1-mini)

Context

128K

Intelligence

-

Per 1M Tokens

In: $0.40

Out: $1.60

Monthly Cost

$1.20

O1 Mini (o1-mini)

Context

128K

Intelligence

-

Per 1M Tokens

In: $1.10

Out: $4.40

Monthly Cost

$3.30

O3 Mini (o3-mini)

Context

128K

Intelligence

-

Per 1M Tokens

In: $1.10

Out: $4.40

Monthly Cost

$3.30

O4 Mini (o4-mini)

Context

128K

Intelligence

-

Per 1M Tokens

In: $1.10

Out: $4.40

Monthly Cost

$3.30

Codex Mini Latest (codex-mini-latest)

Context

128K

Intelligence

-

Per 1M Tokens

In: $1.50

Out: $6.00

Monthly Cost

$4.50

GPT-4.1 (gpt-4.1)

Context

128K

Intelligence

-

Per 1M Tokens

In: $2.00

Out: $8.00

Monthly Cost

$6.00

O4 Mini Deep Research (o4-mini-deep-research)

Context

128K

Intelligence

-

Per 1M Tokens

In: $2.00

Out: $8.00

Monthly Cost

$6.00

GPT-4o (gpt-4o)

Context

128K

Intelligence

-

Per 1M Tokens

In: $2.50

Out: $10.00

Monthly Cost

$7.50

GPT-4o Search Preview (gpt-4o-search-preview)

Context

128K

Intelligence

-

Per 1M Tokens

In: $2.50

Out: $10.00

Monthly Cost

$7.50

Computer Use Preview (computer-use-preview)

Context

128K

Intelligence

-

Per 1M Tokens

In: $3.00

Out: $12.00

Monthly Cost

$9.00

GPT-4o (2024-05-13) (gpt-4o-2024-05-13)

Context

128K

Intelligence

-

Per 1M Tokens

In: $5.00

Out: $15.00

Monthly Cost

$12.50

O3 Deep Research (o3-deep-research)

Context

200K

Intelligence

-

Per 1M Tokens

In: $10.00

Out: $40.00

Monthly Cost

$30.00

GPT-5 Pro (gpt-5-pro)

Context

128K

Intelligence

-

Per 1M Tokens

In: $15.00

Out: $120.00

Monthly Cost

$75.00

O1 (o1)

Context

200K

Intelligence

-

Per 1M Tokens

In: $15.00

Out: $60.00

Monthly Cost

$45.00

O3 Pro (o3-pro)

Context

200K

Intelligence

-

Per 1M Tokens

In: $20.00

Out: $80.00

Monthly Cost

$60.00

O1 Pro (o1-pro)

Context

200K

Intelligence

-

Per 1M Tokens

In: $150.00

Out: $600.00

Monthly Cost

$450.00

About OpenAI API

What is OpenAI API?

The OpenAI API provides access to 26 text/chat models across four main families: GPT-5 (flagship models for coding and agentic tasks), GPT-4.1 (advanced general purpose), GPT-4o (multimodal with vision), and O-series (specialized reasoning for complex problem-solving and STEM). Each family offers multiple sizes (standard, mini, nano, pro) and supports four pricing tiers (Batch, Flex, Standard, Priority) to optimize for your performance and budget requirements.

  • Model Families by Use Case: GPT-5 family: Flagship for coding, agents, and production apps (includes Codex for code, Search API for web-augmented tasks). GPT-4.1 family: Cost-effective general purpose. GPT-4o family: Multimodal with vision capabilities. O-series: Deep reasoning for math, science, and multi-step problem solving (includes Deep Research variants).
  • Flexible Pricing Tiers: Batch (50% discount, 24hr processing), Flex (low cost, variable latency, select models), Standard (default balanced tier), Priority (2x cost, 2x faster processing). Switch tiers per request to optimize cost vs speed. Not all models available in all tiers.
  • Advanced Cost Features: Prompt caching saves 50-90% on repeated content (GPT-5 family: 90%, GPT-4.1 family: 75%, GPT-4o/O-series: 50%). Fine-tuning available for GPT-4.1, GPT-4o, and o4-mini. Function calling for structured outputs. Embeddings for semantic search. Built-in tools: Code Interpreter, File Search, Web Search.

When to Use OpenAI API

Start with GPT-5 Nano or GPT-4o Mini for cost-sensitive applications, upgrade to GPT-5 Mini for balanced performance, and use GPT-5 or O-series models for complex reasoning and agentic workflows. Use Batch tier for non-urgent workloads (50% savings) and Priority for user-facing applications.

Ideal for

  • Coding and agentic applications with GPT-5 or GPT-5 Codex
  • Complex reasoning and STEM problems with O-series models (O3, O4-mini)
  • Chatbots and conversational AI with GPT-5 Mini or GPT-4o Mini
  • Multimodal applications (vision) with GPT-4o family
  • Document analysis and summarization with GPT-5 Nano or GPT-4.1 Nano + Batch tier

Not ideal for

  • Real-time applications requiring <50ms latency (even Priority tier)
  • Simple text parsing tasks (regex is cheaper)
  • Applications needing guaranteed deterministic outputs
  • O-series for simple tasks (reasoning tokens add hidden costs)

OpenAI API Pricing Breakdown

Free Tier

New users receive $5 in free credits with no credit card required. These credits expire after 3 months and are automatically applied to your usage across all models and tiers.

  • Sign up at platform.openai.com - no credit card required
  • Receive $5 free credits instantly upon registration
  • Credits expire after 3 months from grant date
  • Works across all models (GPT-5, GPT-4.1, GPT-4o, O-series)
  • Generate API key and start building immediately

Pricing Tiers Explained

Batch Tier (50% Discount)

Process non-urgent workloads within 24 hours at half price. Example: GPT-5 drops to $0.625/$5.00 per M tokens (vs $1.25/$10.00 standard). Perfect for data processing, content generation, and analysis tasks that aren't time-sensitive.

Flex Tier (Variable Latency)

Lower cost with variable latency. Available for select models (GPT-5 family, O3, O4-mini). Automatically routes requests during lower-demand periods. Good balance between Batch and Standard for non-critical applications.

Standard Tier (Default)

Default tier with balanced cost and performance. Most common choice for production applications. Consistent latency and throughput. All models available. Auto-scales based on usage tier (increases with spend).

Priority Tier (2x Faster)

Higher cost (typically 2x standard) for guaranteed faster processing. Best for user-facing applications requiring quick responses. Available for select models (GPT-5 family, GPT-4.1 family, GPT-4o family, O3/O4-mini).

Cost Optimization Features

Prompt Caching (50-90% Savings)

Cache frequently used prompts, system messages, or documents. Savings vary by model: GPT-5 family (90% off), GPT-4.1 family (75% off), GPT-4o/O-series (50% off). Cache persists 5-10 minutes. Example: GPT-5 standard input $1.25/M → cached $0.125/M (90% savings).

Fine-Tuning

Train custom models on your data for improved performance and potentially lower per-request costs. Training costs vary by model ($1.50-$100/hour). Inference costs 1.5-3x base model rates but can reduce prompt length. Available for GPT-4.1, GPT-4o, and o4-mini models.

Understanding O-Series Reasoning Tokens

O-series models (O1, O3, O4-mini) use 'reasoning tokens' for internal thinking steps that are billed as output tokens but not visible in API responses. These tokens occupy context window space and can significantly increase costs. A 500-token visible response may consume 2000+ total tokens. Monitor carefully and use only for tasks requiring deep reasoning.

Prepaid Credits System

Buy credits upfront with no monthly fees. Credits expire after 12 months. Set auto-reload thresholds to never run out. Volume discounts available for enterprise. No setup fees or subscription costs.

Usage Tracking & Limits

Real-time usage monitoring in dashboard. Set spending limits and alerts. Failed requests aren't charged. Rate limits increase automatically with usage tier (based on cumulative spend).

OpenAI API Monthly Cost Estimates

Light Use

$5-30/mo

• Personal projects

• <1K requests/day

• GPT-4o Mini or GPT-5 Nano

Medium Use

$30-150/mo

• Small apps

• 1-5K requests/day

• Mix of Mini models + Batch tier

Heavy Use

$150-1K/mo

• Production apps

• 5-20K requests/day

• Standard tier, mixed models

Enterprise

$1K+/mo

• Large scale

• 20K+ requests/day

• Priority tier, volume discounts

7 OpenAI API Cost Optimization Tips

1

Understand O-Series Reasoning Token Costs

O-series models (O1, O3, O4-mini) use 'reasoning tokens' for internal thinking that are billed as output tokens but not returned in the response. This means actual costs can be significantly higher than estimated based on visible output. A response showing 500 output tokens may actually consume 2000+ tokens. Monitor O-series usage carefully and avoid using for simple tasks where GPT-5/GPT-4.1 models would suffice.

2

Use Prompt Caching for Maximum Savings

Save 50-90% on repeated content by caching frequently used prompts, system messages, or documents. Discount varies by model: GPT-5 family (90% off), GPT-4.1 family (75% off), GPT-4o/O-series (50% off). Cache persists 5-10 minutes. Example: 10K requests with 80% cache hits on GPT-5 saves $900/month. Works best for chatbots with consistent system prompts or RAG applications.

3

Leverage Batch Tier for Non-Urgent Tasks

Get 50% discount by processing non-urgent workloads within 24 hours using Batch API. Perfect for data processing, content generation, analytics, and background jobs. GPT-5 batch: $0.625/$5.00 per M tokens vs $1.25/$10.00 standard. Can save $1000s/month on high-volume applications.

4

Start with Smaller Models

Use the cheapest models (GPT-5 Nano at $0.05/$0.40 per M tokens or GPT-4o Mini at $0.15/$0.60) for classification, routing, and simple tasks. Only escalate to expensive models (GPT-5, O-series) when necessary for complex reasoning or generation.

5

Implement Smart Model Routing

Create cascading escalation: GPT-5 Nano → GPT-5 Mini for moderate complexity → GPT-5 for advanced tasks → O-series only for deep reasoning. Route by task complexity to avoid overpaying. Can reduce costs by 60-80% vs using premium models for all requests.

6

Optimize Token Usage

Minimize prompt tokens without sacrificing quality. Use concise system messages. Remove unnecessary examples after fine-tuning. Set max_tokens limits on outputs. Use function calling instead of verbose JSON in prompts. Each 1000-token reduction saves $0.0015-$0.015 per request on standard tier.

7

Monitor OpenAI API Usage in Real-Time

Track OpenAI spending per model with CostGoat's token-level visibility. Get instant alerts when switching from Mini to premium models, when prompt caching savings drop unexpectedly, or when batch processing opportunities are missed. Prevent budget overruns before they happen.

OpenAI Model Selection Guide

Use Case

Customer Support Chat

Recommended Model

GPT-5 Nano or GPT-4o Mini

Fast & Affordable

Monthly Cost (Est.)

~$10-50

Why This Model?

Lowest cost for conversational tasks with prompt caching

Use Case

Code Generation & Review

Recommended Model

GPT-5 Codex

Code Specialized

Monthly Cost (Est.)

~$125-600

Why This Model?

Specialized model with best code understanding

Use Case

Complex Reasoning & STEM

Recommended Model

O3 or O4-mini

Deep Reasoning

Monthly Cost (Est.)

~$110-800

Why This Model?

Specialized for multi-step reasoning (note: reasoning tokens add cost)

Use Case

Web-Augmented Tasks

Recommended Model

GPT-5 Search API

Search Optimized

Monthly Cost (Est.)

~$125-500 + search

Why This Model?

Built-in web search for current information retrieval

Use Case

Content Writing

Recommended Model

GPT-5 Mini

Balanced

Monthly Cost (Est.)

~$50-250

Why This Model?

Great quality-to-cost ratio for creative content

Use Case

Data Extraction

Recommended Model

GPT-5 Nano + Batch

Budget-Friendly

Monthly Cost (Est.)

~$5-60

Why This Model?

Lowest cost with 50% batch discount for structured extraction

Use Case

Image Understanding

Recommended Model

GPT-4o

Multimodal

Monthly Cost (Est.)

~$100-400

Why This Model?

Native vision capabilities for image analysis

OpenAI API Rate Limits & Usage Tiers

Usage Tier

Free Tier

Spend Requirement

$0 spent

GPT-5 Limits

500 RPM

200K TPM

GPT-4o Limits

500 RPM

200K TPM

Usage Tier

Tier 1

Spend Requirement

$100 spent

GPT-5 Limits

2K RPM

2M TPM

GPT-4o Limits

2K RPM

2M TPM

Usage Tier

Tier 2

Spend Requirement

$1,000 spent

GPT-5 Limits

5K RPM

10M TPM

GPT-4o Limits

5K RPM

10M TPM

Usage Tier

Tier 3

Spend Requirement

$10,000 spent

GPT-5 Limits

10K RPM

30M TPM

GPT-4o Limits

10K RPM

30M TPM

Usage Tier

Tier 4+

Spend Requirement

$50,000+ spent

GPT-5 Limits

20K+ RPM

100M+ TPM

GPT-4o Limits

20K+ RPM

100M+ TPM

RPM: Requests Per Minute | TPM: Tokens Per Minute. Limits vary by model and tier. Contact sales for enterprise-level rate limits.

Track Your OpenAI API Costs in Real-Time

Monitor your OpenAI API usage and spending across all models - GPT, DALL-E, Whisper, and more. CostGoat runs on your desktop with privacy-first local monitoring. 7-day free trial, then $9/month.

Start Free Trial

OpenAI API Pricing FAQ

Common questions about OpenAI API costs, billing, models, and optimization

Pricing Calculators

Claude API PricingGoogle Veo PricingAWS Lambda PricingAWS Cost CalculatorsOpenAI Sora 2 PricingOpenAI Text API Pricing
PricingDashboardContactAffiliate ProgramTermsPrivacy

© 2025 CostGoat. All rights reserved.