LLM Price Calculator

Compare LLM API pricing across Claude, GPT, Gemini, and GLM. Calculate per-call costs, set budgets, and estimate multi-turn chain costs with prompt caching.

||
0%
1,000 in + 500 out × 1,000 calls61 models
Provider
Input
58thinking
Anthropic
1M / 128K|In: $5/M · Out: $25/M
Per call
$0.018
Total
$17.50
59thinking
Anthropic
200K / 128K|In: $5/M · Out: $25/M
Per call
$0.018
Total
$17.50
56thinking
Anthropic
1M / 64K|In: $3/M · Out: $15/M
Per call
$0.010
Total
$10.50
57thinking
Anthropic
200K / 64K|In: $3/M · Out: $15/M
Per call
$0.010
Total
$10.50
36
Anthropic
200K / 64K|In: $1/M · Out: $5/M
Per call
$0.0035
Total
$3.50
60thinking
OpenAI
1.1M / 128K|In: $5/M · Out: $30/M
Per call
$0.020
Total
$20.00
55thinking
OpenAI
1.1M / 128K|In: $2.5/M · Out: $15/M
Per call
$0.010
Total
$10.00
34thinking
OpenAI
400K / 128K|In: $0.75/M · Out: $4.5/M
Per call
$0.0030
Total
$3.00
15thinking
OpenAI
400K / 128K|In: $0.2/M · Out: $1.25/M
Per call
$0.0008
Total
$0.825
53thinking
OpenAI
400K / 128K|In: $1.75/M · Out: $14/M
Per call
$0.0088
Total
$8.75
46thinking
OpenAI
400K / 128K|In: $1.25/M · Out: $10/M
Per call
$0.0063
Total
$6.25
47thinking
OpenAI
400K / 128K|In: $1.25/M · Out: $10/M
Per call
$0.0063
Total
$6.25
20thinking
OpenAI
400K / 128K|In: $0.25/M · Out: $2/M
Per call
$0.0013
Total
$1.25
3thinking
OpenAI
400K / 128K|In: $0.05/M · Out: $0.4/M
Per call
$0.0003
Total
$0.250
43
OpenAI
1M / 32K|In: $2/M · Out: $8/M
Per call
$0.0060
Total
$6.00
19
OpenAI
1M / 32K|In: $0.4/M · Out: $1.6/M
Per call
$0.0012
Total
$1.20
7
OpenAI
1M / 32K|In: $0.1/M · Out: $0.4/M
Per call
$0.0003
Total
$0.300
50
OpenAI
128K / 16K|In: $2.5/M · Out: $10/M
Per call
$0.0075
Total
$7.50
10
OpenAI
128K / 16K|In: $0.15/M · Out: $0.6/M
Per call
$0.0005
Total
$0.450
61thinking
OpenAI
200K / 100K|In: $20/M · Out: $80/M
Per call
$0.060
Total
$60.00
44thinking
OpenAI
200K / 100K|In: $2/M · Out: $8/M
Per call
$0.0060
Total
$6.00
35thinking
OpenAI
200K / 100K|In: $1.1/M · Out: $4.4/M
Per call
$0.0033
Total
$3.30
45thinking
Google
1M / 65K|In: $1.5/M · Out: $9/M
Per call
$0.0060
Total
$6.00
52
Google
1M / 65K|In: $2/M · Out: $12/M
Per call
$0.0080
Total
$8.00
18
Google
1M / 65K|In: $0.25/M · Out: $1.5/M
Per call
$0.0010
Total
$1.00
27thinking
Google
1M / 65K|In: $0.5/M · Out: $3/M
Per call
$0.0020
Total
$2.00
48thinking
Google
1M / 65K|In: $1.25/M · Out: $10/M
Per call
$0.0063
Total
$6.25
24thinking
Google
1M / 65K|In: $0.3/M · Out: $2.5/M
Per call
$0.0016
Total
$1.55
8
Google
1M / 65K|In: $0.1/M · Out: $0.4/M
Per call
$0.0003
Total
$0.300
9
Google
1M / 8.2K|In: $0.1/M · Out: $0.4/M
Per call
$0.0003
Total
$0.300
16thinking
DeepSeek
1M / 384K|In: $0.43/M · Out: $0.87/M
Per call
$0.0009
Total
$0.865
5thinking
DeepSeek
1M / 384K|In: $0.14/M · Out: $0.28/M
Per call
$0.0003
Total
$0.280
31thinking
xAI
1M / 30K|In: $1.25/M · Out: $2.5/M
Per call
$0.0025
Total
$2.50
32thinking
xAI
2M / 30K|In: $1.25/M · Out: $2.5/M
Per call
$0.0025
Total
$2.50
28
xAI
256K / 256K|In: $1/M · Out: $2/M
Per call
$0.0020
Total
$2.00
42thinking
Mistral
262K / 262K|In: $1.5/M · Out: $7.5/M
Per call
$0.0052
Total
$5.25
21
Mistral
262K / 262K|In: $0.5/M · Out: $1.5/M
Per call
$0.0013
Total
$1.25
22
Mistral
262K / 262K|In: $0.4/M · Out: $2/M
Per call
$0.0014
Total
$1.40
11thinking
Mistral
256K / 256K|In: $0.15/M · Out: $0.6/M
Per call
$0.0005
Total
$0.450
40thinking
Mistral
128K / 16K|In: $2/M · Out: $5/M
Per call
$0.0045
Total
$4.50
23
Mistral
262K / 262K|In: $0.4/M · Out: $2/M
Per call
$0.0014
Total
$1.40
13
Mistral
256K / 4K|In: $0.3/M · Out: $0.9/M
Per call
$0.0008
Total
$0.750
12
Meta
1M / 16K|In: $0.15/M · Out: $0.6/M
Per call
$0.0005
Total
$0.450
2
Meta
328K / 16K|In: $0.08/M · Out: $0.3/M
Per call
$0.0002
Total
$0.230
41thinking
Qwen
262K / 65K|In: $1.3/M · Out: $7.8/M
Per call
$0.0052
Total
$5.20
39
Qwen
262K / 65K|In: $1.2/M · Out: $6/M
Per call
$0.0042
Total
$4.20
37
Qwen
1M / 65K|In: $1/M · Out: $5/M
Per call
$0.0035
Total
$3.50
29thinking
Qwen
1M / 65K|In: $0.5/M · Out: $3/M
Per call
$0.0020
Total
$2.00
14thinking
Qwen
1M / 65K|In: $0.19/M · Out: $1.12/M
Per call
$0.0008
Total
$0.750
17thinking
Xiaomi
1M / 131K|In: $0.435/M · Out: $0.87/M
Per call
$0.0009
Total
$0.870
6thinking
Xiaomi
1M / 131K|In: $0.14/M · Out: $0.28/M
Per call
$0.0003
Total
$0.280
54
Amazon
1M / 32K|In: $2.5/M · Out: $12.5/M
Per call
$0.0088
Total
$8.75
30
Amazon
300K / 8K|In: $0.8/M · Out: $3.2/M
Per call
$0.0024
Total
$2.40
1
Amazon
300K / 8K|In: $0.06/M · Out: $0.24/M
Per call
$0.0002
Total
$0.180
51
Cohere
256K / 8K|In: $2.5/M · Out: $10/M
Per call
$0.0075
Total
$7.50
33
Zhipu AI
200K / 128K|In: $1/M · Out: $3.2/M
Per call
$0.0026
Total
$2.60
38
Zhipu AI
200K / 128K|In: $1.2/M · Out: $5/M
Per call
$0.0037
Total
$3.70
25
Zhipu AI
200K / 128K|In: $0.6/M · Out: $2.2/M
Per call
$0.0017
Total
$1.70
4
Zhipu AI
200K / 128K|In: $0.07/M · Out: $0.4/M
Per call
$0.0003
Total
$0.270
26
Zhipu AI
128K / 128K|In: $0.6/M · Out: $2.2/M
Per call
$0.0017
Total
$1.70
49
Zhipu AI
128K / 128K|In: $2.2/M · Out: $8.9/M
Per call
$0.0067
Total
$6.65
Prices per million tokens. Last updated March 2026.

What this calculator does

This calculator compares API pricing across 30+ large language models from Anthropic, OpenAI, Google, and Zhipu AI. It covers input tokens, output tokens, prompt caching discounts, and reasoning token costs - all the variables that affect your actual bill.

There are three modes. Calculate cost shows per-call and total costs for a given workload, with presets for common scenarios. Set budget flips the question - enter a dollar amount and see how many API calls each model can handle. Chain models multi-turn conversations where context accumulates. All settings are saved in the URL for bookmarking and sharing.

How LLM API pricing works

LLM APIs charge per token - roughly 0.75 English words each. Type something below to see tokens in action.

~16 tokens
9 words
GPT-5-mini
<$0.0001
Claude Haiku 4.5
<$0.0001
Claude Sonnet 4.6
<$0.0001
Claude Opus 4.6
<$0.0001

Input vs. output cost

Input and output tokens are priced separately. Output typically costs 3-5x more - a task generating long responses is significantly more expensive than one processing long inputs.

GPT-5-mini$0.0012
Claude Haiku 4.5$0.0090
Claude Sonnet 4.6$0.0270
Claude Opus 4.6$0.0450
Input Output

Prompt caching

If your application sends the same prefix on every call (system prompts, few-shot examples), caching lets you reuse it at 75-90% off. It's the single biggest cost lever for most applications.

GPT-5-mini
$0.0004
$0.0006
-40%
Claude Haiku 4.5
$0.0011
$0.0040
-72%
Claude Sonnet 4.6
$0.0034
$0.0120
-72%
Claude Opus 4.6
$0.0056
$0.0200
-72%

Based on 4K input tokens per call. Cached tokens use each provider's discounted rate.

Multi-turn conversations

Each API call sends the full conversation history as input, so costs grow with every turn. This is why a 10-turn conversation costs more than 10 independent calls.

1
1.5K
2
2.5K
3
3.5K
4
4.5K
5
5.5K
$0.09005 turns with Claude Sonnet 4.61.5x vs independent calls

Pricing is pulled from OpenRouter across 30+ models from Anthropic, OpenAI, Google, and Zhipu AI, and updated regularly. All calculator settings save to the URL - bookmark a comparison or share it with your team.