Question 1

What information does the Unicode Inspector show?

Accepted Answer

For each character it displays: the rendered character, Unicode code point (U+XXXX), UTF-8 byte sequence in hex, UTF-16 code units in hex, the Unicode character name, general category (Letter, Number, Punctuation, Symbol, Separator, Control, Other), Unicode block name, and UTF-8 byte count. Summary statistics include totals for characters, code points, UTF-8 bytes, UTF-16 bytes, and unique characters.

Question 2

How does it handle emoji and supplementary characters?

Accepted Answer

The tool uses JavaScript's Unicode-aware string iteration (Symbol.iterator) to correctly split text into individual Unicode code points, even when a character requires a surrogate pair in UTF-16. For example, the globe emoji (U+1F30D) is shown as a single character with its 4-byte UTF-8 encoding and 2 UTF-16 code units.

Question 3

Can I search for a specific code point?

Accepted Answer

Yes. Type a code point in U+XXXX format (e.g. U+00E9 for e with acute accent), a hex value with 0x prefix, or a decimal number into the search bar. You can also search by character name, category, or Unicode block name.

Question 4

What is the difference between UTF-8 bytes and UTF-16 code units?

Accepted Answer

UTF-8 uses 1 to 4 bytes per character — ASCII characters use 1 byte, most European accented characters use 2 bytes, CJK ideographs use 3 bytes, and emoji use 4 bytes. UTF-16 uses 2 or 4 bytes (1 or 2 code units of 16 bits each). Characters in the Basic Multilingual Plane (U+0000 to U+FFFF) use 1 code unit; supplementary characters above U+FFFF use a surrogate pair of 2 code units.

Question 5

How accurate are the character names?

Accepted Answer

The tool includes a built-in lookup table covering ASCII characters, common punctuation, currency symbols, special Unicode characters (zero-width spaces, BOM, etc.), and names generated from Unicode block ranges for CJK, Hiragana, Katakana, Hangul, and emoji. For less common characters, a descriptive name based on the code point and block is provided.

Question 6

Is my data safe?

Accepted Answer

Yes. Character analysis relies on JavaScript's String.prototype[Symbol.iterator](), TextEncoder for UTF-8 byte calculations, and a bundled Unicode name lookup table. No data is sent to any server, no external APIs are called, and your text never leaves the browser tab.

Question 7

Can I use this to debug encoding issues?

Accepted Answer

Absolutely. The tool is ideal for identifying invisible characters (zero-width spaces, byte order marks, non-breaking spaces), mojibake (incorrectly decoded text), and unexpected characters in data files. The UTF-8 byte display helps you verify whether characters are encoded as expected.

Unicode Inspector

About This Tool

How to Use

Popular Unicode Inspector Examples

FAQ

Related Tools

Word & Character Counter

String Escape/Unescape

Text Case Converter

ASCII / Unicode Table

Base64 Encode/Decode

Whitespace Visualizer

String Length Calculator

Unicode Normalizer

Locale String Tester

Language Code Reference