Question 1

CJK Character Length: Chinese, Japanese, Korean Text

Accepted Answer

## CJK Characters: 3 Bytes in UTF-8

Chinese, Japanese (Kanji/Hiragana/Katakana), and Korean (Hangul) characters occupy the Unicode range U+4E00–U+9FFF (CJK Unified Ideographs) and related blocks. These characters require 3 bytes each in UTF-8.

### Example String

東京都渋谷区 (Tokyo Shibuya-ku)

### Length Measurements

| Metric | Value |
|--------|-------|
| JavaScript .length | 5 |
| Code points | 5 |
| Grapheme clusters | 5 |
| UTF-8 bytes | 15 |
| UTF-16 bytes | 10 |
| UTF-32 bytes | 20 |

###

Question 2

When is this useful?

Accepted Answer

When building applications for Asian markets or handling multilingual content, knowing that CJK characters use 3 UTF-8 bytes each is essential for accurate storage planning, API payload size estimation, and database column sizing.

CJK Character Length: Chinese, Japanese, Korean Text

Detailed Explanation

CJK Characters: 3 Bytes in UTF-8

Example String

Length Measurements

UTF-8 vs UTF-16 for CJK

Japanese Mixed Text

Database Considerations

Use Case

Try It — String Length Calculator

Related Topics

Metric	Value
JavaScript `.length`	5
Code points	5
Grapheme clusters	5
UTF-8 bytes	15
UTF-16 bytes	10
UTF-32 bytes	20