Question 1

How are AI API costs calculated?

Accepted Answer

AI APIs charge per token (roughly 0.75 words). You pay separately for input tokens (what you send) and output tokens (what the model generates). Prices vary by model — GPT-4o costs more per token than GPT-4o-mini, but is more capable.

Question 2

What's the cheapest AI API?

Accepted Answer

For most use cases, DeepSeek V3, GPT-4o-mini, and Gemini 2.5 Flash offer the lowest per-token pricing. However, cheaper models may require more tokens to achieve the same quality, so the cheapest per-token isn't always the cheapest per-task.

Question 3

How do I estimate my token usage?

Accepted Answer

A rough rule: 1 token ≈ 0.75 words. A typical chatbot message is 100-300 tokens input and 200-500 tokens output. A document summary might use 2,000-4,000 input tokens. Use our Token Counter tool for exact counts.

Question 4

What are cached input tokens?

Accepted Answer

Some providers offer discounted pricing when you reuse the same prompt prefix across requests. This is called prompt caching. It can reduce input costs by 50-90% for repetitive workloads like batch processing with a shared system prompt.

AI Cost Calculator

How AI API Pricing Works

Choosing the Right Model

Reducing API Costs

AI Cost Calculator

How AI API Pricing Works

Choosing the Right Model

Reducing API Costs

Related Tools

AI Token Counter

Prompt Builder

System Prompt Generator