LLM Price Calculator

Compare LLM API pricing across Claude, GPT, Gemini, and GLM. Calculate per-call costs, set budgets, and estimate multi-turn chain costs with prompt caching.

||
0%
1,000 in + 500 out × 1,000 calls32 models
Provider
Input
31thinking
Anthropic
200K / 128K|In: $5/M · Out: $25/M
Per call
$0.018
Total
$17.50
30thinking
Anthropic
200K / 64K|In: $3/M · Out: $15/M
Per call
$0.010
Total
$10.50
18
Anthropic
200K / 64K|In: $1/M · Out: $5/M
Per call
$0.0035
Total
$3.50
29thinking
OpenAI
1.1M / 128K|In: $2.5/M · Out: $15/M
Per call
$0.010
Total
$10.00
16thinking
OpenAI
400K / 128K|In: $0.75/M · Out: $4.5/M
Per call
$0.0030
Total
$3.00
7thinking
OpenAI
400K / 128K|In: $0.2/M · Out: $1.25/M
Per call
$0.0008
Total
$0.825
28thinking
OpenAI
400K / 128K|In: $1.75/M · Out: $14/M
Per call
$0.0088
Total
$8.75
22thinking
OpenAI
400K / 128K|In: $1.25/M · Out: $10/M
Per call
$0.0063
Total
$6.25
23thinking
OpenAI
400K / 128K|In: $1.25/M · Out: $10/M
Per call
$0.0063
Total
$6.25
10thinking
OpenAI
400K / 128K|In: $0.25/M · Out: $2/M
Per call
$0.0013
Total
$1.25
1thinking
OpenAI
400K / 128K|In: $0.05/M · Out: $0.4/M
Per call
$0.0003
Total
$0.250
20
OpenAI
1M / 32K|In: $2/M · Out: $8/M
Per call
$0.0060
Total
$6.00
9
OpenAI
1M / 32K|In: $0.4/M · Out: $1.6/M
Per call
$0.0012
Total
$1.20
3
OpenAI
1M / 32K|In: $0.1/M · Out: $0.4/M
Per call
$0.0003
Total
$0.300
26
OpenAI
128K / 16K|In: $2.5/M · Out: $10/M
Per call
$0.0075
Total
$7.50
6
OpenAI
128K / 16K|In: $0.15/M · Out: $0.6/M
Per call
$0.0005
Total
$0.450
32thinking
OpenAI
200K / 100K|In: $20/M · Out: $80/M
Per call
$0.060
Total
$60.00
21thinking
OpenAI
200K / 100K|In: $2/M · Out: $8/M
Per call
$0.0060
Total
$6.00
17thinking
OpenAI
200K / 100K|In: $1.1/M · Out: $4.4/M
Per call
$0.0033
Total
$3.30
27
Google
1M / 65K|In: $2/M · Out: $12/M
Per call
$0.0080
Total
$8.00
8
Google
1M / 65K|In: $0.25/M · Out: $1.5/M
Per call
$0.0010
Total
$1.00
14thinking
Google
1M / 65K|In: $0.5/M · Out: $3/M
Per call
$0.0020
Total
$2.00
24thinking
Google
1M / 65K|In: $1.25/M · Out: $10/M
Per call
$0.0063
Total
$6.25
11thinking
Google
1M / 65K|In: $0.3/M · Out: $2.5/M
Per call
$0.0016
Total
$1.55
4
Google
1M / 65K|In: $0.1/M · Out: $0.4/M
Per call
$0.0003
Total
$0.300
5
Google
1M / 8.2K|In: $0.1/M · Out: $0.4/M
Per call
$0.0003
Total
$0.300
15
Zhipu AI
200K / 128K|In: $1/M · Out: $3.2/M
Per call
$0.0026
Total
$2.60
19
Zhipu AI
200K / 128K|In: $1.2/M · Out: $5/M
Per call
$0.0037
Total
$3.70
12
Zhipu AI
200K / 128K|In: $0.6/M · Out: $2.2/M
Per call
$0.0017
Total
$1.70
2
Zhipu AI
200K / 128K|In: $0.07/M · Out: $0.4/M
Per call
$0.0003
Total
$0.270
13
Zhipu AI
128K / 128K|In: $0.6/M · Out: $2.2/M
Per call
$0.0017
Total
$1.70
25
Zhipu AI
128K / 128K|In: $2.2/M · Out: $8.9/M
Per call
$0.0067
Total
$6.65
Prices per million tokens. Last updated March 2026.

What this calculator does

This calculator compares API pricing across 30+ large language models from Anthropic, OpenAI, Google, and Zhipu AI. It covers input tokens, output tokens, prompt caching discounts, and reasoning token costs - all the variables that affect your actual bill.

There are three modes. Calculate cost shows per-call and total costs for a given workload, with presets for common scenarios. Set budget flips the question - enter a dollar amount and see how many API calls each model can handle. Chain models multi-turn conversations where context accumulates. All settings are saved in the URL for bookmarking and sharing.

How LLM API pricing works

LLM APIs charge per token - roughly 0.75 English words each. Type something below to see tokens in action.

~16 tokens
9 words
GPT-5-mini
<$0.0001
Claude Haiku 4.5
<$0.0001
Claude Sonnet 4.6
<$0.0001
Claude Opus 4.6
<$0.0001

Input vs. output cost

Input and output tokens are priced separately. Output typically costs 3-5x more - a task generating long responses is significantly more expensive than one processing long inputs.

GPT-5-mini$0.0012
Claude Haiku 4.5$0.0090
Claude Sonnet 4.6$0.0270
Claude Opus 4.6$0.0450
Input Output

Prompt caching

If your application sends the same prefix on every call (system prompts, few-shot examples), caching lets you reuse it at 75-90% off. It's the single biggest cost lever for most applications.

GPT-5-mini
$0.0004
$0.0006
-40%
Claude Haiku 4.5
$0.0011
$0.0040
-72%
Claude Sonnet 4.6
$0.0034
$0.0120
-72%
Claude Opus 4.6
$0.0056
$0.0200
-72%

Based on 4K input tokens per call. Cached tokens use each provider's discounted rate.

Multi-turn conversations

Each API call sends the full conversation history as input, so costs grow with every turn. This is why a 10-turn conversation costs more than 10 independent calls.

1
1.5K
2
2.5K
3
3.5K
4
4.5K
5
5.5K
$0.09005 turns with Claude Sonnet 4.61.5x vs independent calls

Pricing is pulled from OpenRouter across 30+ models from Anthropic, OpenAI, Google, and Zhipu AI, and updated regularly. All calculator settings save to the URL - bookmark a comparison or share it with your team.