Question 1

Is my data safe?

Accepted Answer

Yes. All encoding detection runs entirely in your browser using JavaScript. No data is sent to any server. Your text and files stay on your machine.

Question 2

Why does pasted text always show as UTF-8?

Accepted Answer

When you paste text into a browser text area, the browser converts it to its internal string representation (UTF-16). When the tool encodes this string to bytes for analysis, it uses the TextEncoder API which always produces UTF-8. To detect the original encoding of a file, use the File mode instead.

Question 3

What is a Byte Order Mark (BOM)?

Accepted Answer

A BOM is a special Unicode character (U+FEFF) placed at the beginning of a file to signal its encoding and byte order. For example, a UTF-8 BOM is the three-byte sequence EF BB BF, while a UTF-16 LE BOM is FF FE. When a BOM is present, encoding detection is 100% certain.

Question 4

How accurate is the detection?

Accepted Answer

Detection accuracy depends on the data. Files with a BOM are detected with 100% confidence. For UTF-8 text with multi-byte characters (e.g., accented letters, CJK), accuracy is very high. Short ASCII-only strings are ambiguous since ASCII is a valid subset of many encodings. The confidence percentage reflects the strength of the heuristic match.

Question 5

What is the difference between ISO-8859-1 and Windows-1252?

Accepted Answer

ISO-8859-1 (Latin-1) and Windows-1252 are both single-byte Western European encodings. They are identical for byte values 0xA0-0xFF, but differ in the 0x80-0x9F range. ISO-8859-1 maps these to control characters, while Windows-1252 maps them to printable characters like curly quotes, em dashes, and the euro sign. In practice, many files labeled as ISO-8859-1 are actually Windows-1252.

Question 6

Can I analyze large files?

Accepted Answer

Yes. The file is read into memory as an ArrayBuffer and analyzed in JavaScript. Files up to several hundred megabytes work well. Very large files may be limited by your browser's available memory. The hex dump shows only the first 160 bytes regardless of file size.

Encoding Detector

About This Tool

How to Use

FAQ

Related Tools

Base64 Encode/Decode

String Escape/Unescape

HTML Entity Encode/Decode

Hash Generator

Unicode Normalizer

Locale String Tester