Question 1

Is my data safe?

Accepted Answer

Yes. Token counting is done entirely in your browser using a character-based estimation algorithm. No API calls are made to OpenAI, Anthropic, Google, or any other service. Your text never leaves your machine. You can verify this by opening your browser's DevTools Network tab — you will see zero outbound requests while using the tool.

Question 2

How accurate is the token count?

Accepted Answer

The tool uses a character-to-token heuristic rather than running each provider's actual tokenizer (like tiktoken for OpenAI). For standard English prose, estimates are typically within 10-15% of the real count. Code, URLs, or JSON-heavy content may deviate more — up to 20-30% in edge cases — because special characters and punctuation tokenize unpredictably. The estimates are reliable for cost budgeting and model comparison, but not suitable for exact billing reconciliation.

Question 3

Why do different models show different token counts for the same text?

Accepted Answer

Each LLM provider uses a different tokenizer with its own vocabulary. GPT-4o uses the o200k_base tokenizer, Claude uses its own proprietary tokenizer, and Gemini uses SentencePiece. A word that is one token in one system might be split into two tokens in another. This tool applies provider-specific ratios to approximate those differences.

Question 4

How does the tool handle CJK (Chinese, Japanese, Korean) text?

Accepted Answer

CJK characters are tokenized less efficiently than Latin characters in most LLM tokenizers — a single Chinese character might become 2-3 tokens depending on the model. The tool detects CJK character ranges and applies a higher token-per-character ratio for those segments, giving you a more realistic estimate than a simple word-count division would.

Question 5

Are the pricing numbers up to date?

Accepted Answer

Pricing is hardcoded based on the latest published rates at the time the tool was last updated. LLM providers change their pricing periodically — for example, OpenAI has cut GPT-4o pricing multiple times since launch. Check the provider's official pricing page if you need exact current rates. The relative cost comparisons between models remain useful even if absolute numbers shift.

Question 6

Can I estimate costs for a full conversation with multiple turns?

Accepted Answer

The tool estimates cost for a single block of text at a time. For multi-turn conversations, remember that each API call sends the entire conversation history as input tokens. Paste your full conversation context (system prompt + all previous turns) to get the cumulative input cost. Then estimate the output cost separately for the expected response length.

Question 7

What is the difference between input and output token pricing?

Accepted Answer

Most LLM providers charge different rates for input tokens (your prompt) and output tokens (the model's response). Output tokens are typically 3-5x more expensive than input tokens because generation requires more compute than reading. Check your provider's pricing page for current rates — they change frequently. This tool lets you toggle between input and output pricing to model both sides of a request.

LLM Token Counter & API Cost Calculator

About This Tool

How to Use

FAQ

Related Tools

Word & Character Counter

String Length Calculator

JSON Formatter

Markdown Preview

Prompt Token Cost Calculator