NEW: Real-Time Usage Tracking for AI Agents — track Claude Code, Kimi, Codex & more. Try it free →

CostGoat Logo

CostGoat

Try For Free
LAST UPDATED: FEBRUARY 14, 2026

LLM API Pricing Comparison

Compare pricing across 289+ LLM APIs from OpenAI, Anthropic, Google, DeepSeek, Mistral, xAI, and more. Sorted by quality, price, or value score.

ComparisonValue RankingsPricing GuideSave MoneyFAQ

Pricing TLDR

  • Budget models from $0.07/M input tokens — premium models up to $75/M output tokens
  • Quality scores from 0-100 based on independent benchmarks (Theozard)
  • Value score = quality per dollar of output cost — find the best bang for your buck

Official pricing:

OpenRouter API (live pricing)

Quality Scores: Theozard

LLM API Cost Comparison — Monthly Pricing

Calculate by

Input Tokens

Output Tokens

API Calls / Month

Quick Examples:

Sort:

(anthropic/claude-opus-4.6)

Context

1.0M

Quality

100

Per 1M Tokens

In: $5.00

Out: $25.00

Value

4.0

Monthly Cost

$17.50

(openai/gpt-5.2-chat)

Context

128K

Quality

96

Per 1M Tokens

In: $1.75

Out: $14.00

Value

6.9

Monthly Cost

$8.75

(openai/gpt-5.2)

Context

400K

Quality

96

Per 1M Tokens

In: $1.75

Out: $14.00

Value

6.9

Monthly Cost

$8.75

(openai/gpt-5.2-pro)

Context

400K

Quality

96

Per 1M Tokens

In: $21.00

Out: $168.00

Value

0.6

Monthly Cost

$105.00

(z-ai/glm-5)

Context

203K

Quality

94

Per 1M Tokens

In: $0.80

Out: $2.56

Value

36.7

Monthly Cost

$2.08

(anthropic/claude-opus-4.5)

Context

200K

Quality

94

Per 1M Tokens

In: $5.00

Out: $25.00

Value

3.8

Monthly Cost

$17.50

(openai/gpt-5.2-codex)

Context

400K

Quality

92

Per 1M Tokens

In: $1.75

Out: $14.00

Value

6.6

Monthly Cost

$8.75

(openai/gpt-5.1)

Context

400K

Quality

91

Per 1M Tokens

In: $1.25

Out: $10.00

Value

9.1

Monthly Cost

$6.25

(openai/gpt-5.1-chat)

Context

128K

Quality

91

Per 1M Tokens

In: $1.25

Out: $10.00

Value

9.1

Monthly Cost

$6.25

(google/gemini-3-pro-image-preview)

Context

66K

Quality

91

Per 1M Tokens

In: $2.00

Out: $12.00

Value

7.6

Monthly Cost

$8.00

(google/gemini-3-pro-preview)

Context

1.0M

Quality

91

Per 1M Tokens

In: $2.00

Out: $12.00

Value

7.6

Monthly Cost

$8.00

(moonshotai/kimi-k2.5)

Context

262K

Quality

89

Per 1M Tokens

In: $0.45

Out: $2.25

Value

39.6

Monthly Cost

$1.58

(google/gemini-3-flash-preview)

Context

1.0M

Quality

87

Per 1M Tokens

In: $0.50

Out: $3.00

Value

29.0

Monthly Cost

$2.00

(openai/gpt-5-codex)

Context

400K

Quality

85

Per 1M Tokens

In: $1.25

Out: $10.00

Value

8.5

Monthly Cost

$6.25

(openai/gpt-5-chat)

Context

128K

Quality

85

Per 1M Tokens

In: $1.25

Out: $10.00

Value

8.5

Monthly Cost

$6.25

(openai/gpt-5)

Context

400K

Quality

85

Per 1M Tokens

In: $1.25

Out: $10.00

Value

8.5

Monthly Cost

$6.25

(openai/gpt-5-pro)

Context

400K

Quality

85

Per 1M Tokens

In: $15.00

Out: $120.00

Value

0.7

Monthly Cost

$75.00

(anthropic/claude-sonnet-4.5)

Context

1.0M

Quality

81

Per 1M Tokens

In: $3.00

Out: $15.00

Value

5.4

Monthly Cost

$10.50

(deepseek/deepseek-v3.2)

Context

164K

Quality

79

Per 1M Tokens

In: $0.25

Out: $0.38

Value

207.9

Monthly Cost

$0.44

(minimax/minimax-m2.5)

Context

205K

Quality

79

Per 1M Tokens

In: $0.30

Out: $1.20

Value

65.8

Monthly Cost

$0.90

(z-ai/glm-4.7)

Context

203K

Quality

79

Per 1M Tokens

In: $0.40

Out: $1.50

Value

52.7

Monthly Cost

$1.15

(openai/gpt-5.1-codex-max)

Context

400K

Quality

79

Per 1M Tokens

In: $1.25

Out: $10.00

Value

7.9

Monthly Cost

$6.25

(openai/gpt-5.1-codex)

Context

400K

Quality

79

Per 1M Tokens

In: $1.25

Out: $10.00

Value

7.9

Monthly Cost

$6.25

(xiaomi/mimo-v2-flash)

Context

262K

Quality

77

Per 1M Tokens

In: $0.09

Out: $0.29

Value

265.5

Monthly Cost

$0.24

(moonshotai/kimi-k2-thinking)

Context

262K

Quality

77

Per 1M Tokens

In: $0.40

Out: $1.75

Value

44.0

Monthly Cost

$1.28

(openai/gpt-5-mini)

Context

400K

Quality

77

Per 1M Tokens

In: $0.25

Out: $2.00

Value

38.5

Monthly Cost

$1.25

(x-ai/grok-4)

Context

256K

Quality

77

Per 1M Tokens

In: $3.00

Out: $15.00

Value

5.1

Monthly Cost

$10.50

(openai/o3-pro)

Context

200K

Quality

77

Per 1M Tokens

In: $20.00

Out: $80.00

Value

1.0

Monthly Cost

$60.00

(minimax/minimax-m2.1)

Context

197K

Quality

75

Per 1M Tokens

In: $0.27

Out: $0.95

Value

79.0

Monthly Cost

$0.75

(qwen/qwen3-max-thinking)

Context

262K

Quality

75

Per 1M Tokens

In: $1.20

Out: $6.00

Value

12.5

Monthly Cost

$4.20
AI credit balance monitoring for OpenAI, Anthropic, Elevenlabs, and OpenRouter services

Tired of manually checking your API credits?

Monitor your credit balance and spending in real-time. Get alerts before you run out.

Privacy-first desktop app. No sign-up required.

Try free for 7 daysLearn more →

Best Value LLM APIs — Quality Per Dollar

Value score = quality points per $1 of output cost (per 1M tokens). Higher is better. These models deliver the most capability per dollar spent.

#

1

Model

Meta: Llama 3.2 3B Instruct

Provider

Meta

Quality

19

Output / 1M

$0.02

Value Score

950.0

#

2

Model

LiquidAI: LFM2-2.6B

Provider

Liquid

Quality

15

Output / 1M

$0.02

Value Score

750.0

#

3

Model

LiquidAI: LFM2-8B-A1B

Provider

Liquid

Quality

13

Output / 1M

$0.02

Value Score

650.0

#

4

Model

Qwen: Qwen3 235B A22B Instruct 2507

Provider

Qwen

Quality

55

Output / 1M

$0.10

Value Score

550.0

#

5

Model

Meta: Llama 3.1 8B Instruct

Provider

Meta

Quality

23

Output / 1M

$0.05

Value Score

460.0

#

6

Model

Meta: Llama 3.2 11B Vision Instruct

Provider

Meta

Quality

21

Output / 1M

$0.05

Value Score

428.6

#

7

Model

Meta: Llama 3 8B Instruct

Provider

Meta

Quality

17

Output / 1M

$0.04

Value Score

425.0

#

8

Model

OpenAI: gpt-oss-120b

Provider

OpenAI

Quality

62

Output / 1M

$0.19

Value Score

326.3

#

9

Model

OpenAI: gpt-oss-20b

Provider

OpenAI

Quality

45

Output / 1M

$0.14

Value Score

321.4

#

10

Model

Mistral: Mistral Small 3

Provider

Mistral

Quality

25

Output / 1M

$0.08

Value Score

312.5

About LLM API Pricing

What is LLM API Pricing?

LLM APIs let you integrate large language models into your applications via HTTP requests. Every major AI provider — OpenAI, Anthropic, Google, DeepSeek, Mistral, xAI — offers API access to their models with per-token pricing. You pay separately for input tokens (your prompts) and output tokens (model responses), quoted per million tokens.

  • Input vs Output Token Pricing: Input tokens (prompts, context) are cheaper because they only need to be processed once. Output tokens (completions) cost 2-5x more because each token requires a full forward pass through the model. Optimizing prompt length has the biggest impact on cost.
  • Quality-Price Tradeoff: More expensive models generally deliver higher quality responses. Our quality scores (0-100) let you compare: Claude Opus 4.6 scores 100 at $25/1M output, while DeepSeek V3.2 scores 79 at $0.28/1M. The right choice depends on your quality requirements.
  • Context Window Costs: Larger context windows let you send more data per request but increase token costs. A 200K context model processing long documents costs proportionally more in input tokens than a short chatbot interaction. Choose context size based on your actual needs.

When to Use LLM API Pricing

Different use cases call for different models. Match your quality requirements to your budget using the value score to find the optimal model.

Ideal for

  • Chatbots and conversational AI — mid-tier models like Sonnet or GPT-4.1 offer the best quality/cost balance
  • Code generation — specialized models like DeepSeek Coder or Codex variants optimize for code tasks
  • Bulk content processing — budget models like Gemini Flash or DeepSeek V3 handle volume at minimal cost
  • Complex reasoning tasks — premium models like Opus 4.6 or GPT-5 justify their cost for hard problems
  • Prototyping — free tier models let you build without spending anything

Not ideal for

  • Real-time applications needing sub-100ms latency (consider edge-deployed models)
  • Tasks that don't need language understanding (use traditional algorithms instead)
  • Processing sensitive data with compliance requirements (check each provider's data policies)

LLM API Monthly Cost Estimates

Hobby / Prototyping

$0-10/mo

Free tier models

< 1K requests/day

Testing & development

Startup / MVP

$50-300/mo

Mid-tier models (Sonnet, GPT-4.1)

5-20K requests/day

Single product

Growth

$300-2,000/mo

Mix of premium & budget models

20-100K requests/day

Multiple use cases

Enterprise

$2,000+/mo

Premium models for quality-critical tasks

100K+ requests/day

Model fallback chains

5 LLM API Cost Optimization Tips

1

Use a Model Cascade

Route easy queries to cheap models (Haiku, Flash, GPT-5 Nano) and only escalate to expensive ones (Opus, GPT-5) when needed. A classifier model can decide the routing. This typically saves 60-80% vs using premium models for everything.

2

Optimize Prompt Length

Input tokens cost money. Strip unnecessary context, use concise system prompts, and avoid sending full documents when a summary suffices. A 50% reduction in prompt length = 50% savings on input costs.

3

Cache Frequent Requests

If you make similar API calls repeatedly, cache responses. Many providers also offer prompt caching features that reduce costs for repeated system prompts. Anthropic's prompt caching can save up to 90% on cached tokens.

4

Compare Value Scores, Not Just Prices

The cheapest model isn't always the best value. A model at $0.50/1M output with quality score 30 delivers less value than one at $2/1M with quality score 70. Use the value score column to find the sweet spot for your needs.

5

Monitor Per-Model Spending

Track costs per model and per use case with CostGoat. Identify which models consume the most budget, find opportunities to downgrade specific workflows, and catch cost spikes early before they become expensive surprises.

AI credit balance monitoring for OpenAI, Anthropic, Elevenlabs, and OpenRouter services

Track Your LLM API Costs in Real-Time

Monitor spending across OpenAI, Anthropic, Google, and other LLM providers. Track credit balances and get alerts when usage spikes.

Privacy-first desktop app. 7-day free trial, no sign-up required.

Try Free for 7 DaysLearn more →

LLM API Pricing FAQ

Common questions about LLM API costs, pricing models, and how to save money

AI Pricing

Gemini API PricingClaude API PricingGoogle Veo PricingAI Cost CalculatorsReplicate API PricingOpenRouter API PricingOpenRouter Free Models
DownloadsPricingDashboardContactAffiliatesTermsPrivacy

© 2026 CostGoat. All rights reserved.

Made by Functioncraft: Redis GUI Client · SSH GUI Client