Question 1

Regex to Extract Emoji from Text — Unicode Property Escapes

Accepted Answer

## Extracting Emoji from Text

Emoji detection looks deceptively simple but requires Unicode property escapes to handle multi-codepoint sequences correctly. Modern JavaScript (ES2018+) supports \p{Extended_Pictographic} with the u flag.

### Single Emoji

\p{Extended_Pictographic}

(Use the u flag.) Matches single-codepoint emoji like 😀 and 🎉.

### Including Skin-Tone Modifiers and ZWJ Sequences

Emoji like 👩‍💻 (woman technologist) are sequences joined by zero-width joiners (\u200D). To ca

Question 2

When is this useful?

Accepted Answer

Detecting emoji-only messages in chat support to escalate them appropriately, stripping emoji for legacy systems that mishandle wide characters, or counting emoji frequency in user reviews for sentiment analysis.

Input	Single	Sequence-Aware
`"hi 😀"`	😀	😀
`"👩‍💻 is a developer"`	👩, 💻	👩‍💻
`"🇯🇵 Japan"`	🇯, 🇵	🇯🇵
`"thumbs up: 👍🏽"`	👍, 🏽	👍🏽

Regex to Extract Emoji from Text — Unicode Property Escapes

Detailed Explanation

Extracting Emoji from Text

Single Emoji

Including Skin-Tone Modifiers and ZWJ Sequences

Tested Examples

Counting Visible Emoji

Stripping Emoji

Practical Recommendation

Use Case

Try It — Regex Cheat Sheet

Related Topics