Question 1

Is GPT or Claude cheaper?

Accepted Answer

It depends on the selected model and the input-to-output ratio. Smaller GPT-family models can be cheaper for high-volume input, while Claude pricing varies between Sonnet, Opus, and Haiku tiers.

Question 2

Why compare per million tokens?

Accepted Answer

Providers publish token pricing per million tokens, which makes it the cleanest unit for comparing models even when actual requests are much smaller.

Question 3

What costs are not included?

Accepted Answer

This comparison excludes taxes, committed-use discounts, priority processing, regional uplifts, tool calls, web search, file search, and long-context premiums unless those are added as explicit JSON rules.

Question 4

Which model is best for RAG cost?

Accepted Answer

RAG cost depends heavily on input tokens and cache hits because retrieved context can be much longer than the user question. Compare input and cached-input pricing first.

Question 5

Which model is best for agent cost?

Accepted Answer

Agent cost depends on the number of model calls per task. A cheaper model can win if it completes tasks in fewer retries, while a premium model can be cheaper when it avoids failed loops.

Model	Input	Cached input	Output	Context
GPT-5.5	$5	$0.5	$30	Provider dependent
GPT-5.4	$2.5	$0.25	$15	Provider dependent
GPT-5.4 Mini	$0.75	$0.075	$4.5	Provider dependent
Claude Sonnet 4	$3	$0.3	$15	200,000 tokens
Claude Opus 4	$15	$1.5	$75	200,000 tokens
Claude Haiku 3.5	$0.8	$0.08	$4	200,000 tokens

AI API Pricing Comparison

GPT-5.4 Mini Token Cost Calculator

Cost Per Million Tokens

GPT vs Claude Pricing Snapshot

How To Find The Cheapest AI API

GPT vs Claude Cost Patterns

Compare These Before Choosing

FAQ