๐Ÿš€ Big News: Socket Acquires Coana to Bring Reachability Analysis to Every Appsec Team.Learn more โ†’
Socket
Book a DemoInstallSign in
Socket

split-graphemes

Package Overview
Dependencies
Maintainers
2
Versions
13
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

split-graphemes

Divide the string into graphemes.

0.5.0
latest
Source
npm
Version published
Weekly downloads
11K
5.97%
Maintainers
2
Weekly downloads
ย 
Created
Source

split-graphemes

Divide ligature letters such as Thai, Khmer letters and complex emoji into array of graphemes. You can simply use this library instead of Array.from to get graphemes.

CircleCI

Installation

$ npm install split-graphemes

Examples

Emoji

// An emoji '๐Ÿ‘จโ€๐Ÿ‘ฉโ€๐Ÿ‘ฆโ€๐Ÿ‘ฆ' consists of 4 people face emoji joined by Zero Width Joiners (ZWJ).
const chars = Array.from('๐Ÿ‘จโ€๐Ÿ‘ฉโ€๐Ÿ‘ฆโ€๐Ÿ‘ฆ') // ['๐Ÿ‘จ', ZWJ, '๐Ÿ‘ฉ', ZWJ, '๐Ÿ‘ฆ', ZWJ, '๐Ÿ‘ฆ']
// It is interpreted exactly as one character!
const chars = splitGraphemes('๐Ÿ‘จโ€๐Ÿ‘ฉโ€๐Ÿ‘ฆโ€๐Ÿ‘ฆ') // ['๐Ÿ‘จโ€๐Ÿ‘ฉโ€๐Ÿ‘ฆโ€๐Ÿ‘ฆ']

Khmer characters

Array.from('แž”แŸ‰แžปแžŸแŸ’แžŠแžทแŸ') // ['แž”', 'แŸ‰', 'แžป', 'แžŸ', 'แŸ’', 'แžŠ', 'แžท', 'แŸ']
splitGraphemes('แž”แŸ‰แžปแžŸแŸ’แžŠแžทแŸ') // ['แž”แŸ‰แžป', 'แžŸแŸ’แžŠแžทแŸ']

Japanese NFD

splitGraphemes('ใ“ใ‚™ใ‚“ใ‚™ใซใ‚™ใกใ‚™ใฏใ‚™') // ['ใ“ใ‚™', 'ใ‚“ใ‚™', 'ใซใ‚™', 'ใกใ‚™', 'ใฏใ‚™']
splitGraphemes('ใƒใ‚šใƒ’ใ‚šใƒ•ใ‚šใƒ˜ใ‚šใƒ›ใ‚š') // ['ใƒใ‚š', 'ใƒ’ใ‚š', 'ใƒ•ใ‚š', 'ใƒ˜ใ‚š', 'ใƒ›ใ‚š']

English

splitGraphemes('Hello') // ['H', 'e', 'l', 'l', 'o']

Supported ligature characters

The list of characters is at here.

Keywords

grapheme

FAQs

Package last updated on 27 Jul 2021

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts