Question 1

Combining Characters and Diacritical Marks

Accepted Answer

## Combining Characters: Multiple Code Points, One Visual Character

Unicode allows characters to be composed from a base character plus one or more combining marks. The result looks like a single character but consists of multiple code points.

### Example: é Two Ways

Precomposed (NFC):

é  →  U+00E9 (1 code point, 2 UTF-8 bytes)

Decomposed (NFD):

é  →  U+0065 + U+0301 (2 code points, 3 UTF-8 bytes)

Both render identically as é, but:

| Metric | Precomposed | Decomposed |
|--------|-----

Question 2

When is this useful?

Accepted Answer

When building text editors, input validators, or search functionality that handles international text, understanding combining characters prevents bugs like broken truncation, inconsistent search results, and incorrect character counting for user-facing limits.

Combining Characters and Diacritical Marks

Detailed Explanation

Combining Characters: Multiple Code Points, One Visual Character

Example: é Two Ways

Stacked Combining Marks

Zalgo Text

Practical Impact

Use Case

Try It — String Length Calculator

Related Topics

Metric	Precomposed	Decomposed
`.length`	1	2
Code points	1	2
Grapheme clusters	1	1
UTF-8 bytes	2	3