Socket
Socket
Sign inDemoInstall

cjk-length

Package Overview
Dependencies
Maintainers
1
Versions
1
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

cjk-length

Returns string length with wide characters counting as two


Version published
Weekly downloads
1.3K
decreased by-26.51%
Maintainers
1
Weekly downloads
 
Created
Source

CJK Length

Returns string length with wide characters counting as two

In CJK (Chinese, Japanese and Korean) text, "wide" or "fullwidth" characters are Unicode glyphs that get printed as two blocks wide instead of one when using a fixed-width font. Examples include ranges like the Japanese kana (あいうえお), full-width romaji (ABCDE), and kanji/hanzi ideographs (一所懸命).

Since these characters are printed as two blocks, but count as one, this causes a problem when trying to accurately measure the length of the string for use in a fixed-width text environment such as the terminal—a string containing one fullwidth character will visually appear to be one character longer than its length value would indicate. This causes e.g. tabulated layouts to be broken.

This function scans a given string for occurrences of characters from the relevant Unicode ranges to correctly determine the string's visual length.

For a full list of the character ranges used, see the characters.js source.

Usage

To use, replace property accesses such as myString.length with function calls to cjkLength(myString):

const cjkLength = require('cjk-length').default

// Using cjkLength() to get a visually correct string length for fixed-width fonts:
// In this case, 'abcdeABCDE' has length 10 but is displayed as though it's length 15.
const myString = 'abcdeABCDE'
console.log(myString.length)      // 10
console.log(cjkLength(myString))  // 15

// Verifying that this longer string width value looks correct (in a terminal):
console.log(`.${myString}.`)                         // .abcdeABCDE.
console.log(`.${'a'.repeat(myString.length)}.`)      // .aaaaaaaaaa.
console.log(`.${'a'.repeat(cjkLength(myString))}.`)  // .aaaaaaaaaaaaaaa.

If you need to process a string's wide characters in some other way, you can import the regular expression used to match them:

const { charsRegex } = require('cjk-length')

console.log(charsRegex instanceof RegExp)  // true

Note: charsRegex is a structured like new RegExp('[\u1100-\u11F9\u3000-\u303F .. etc. \uFFE0-\uFFE6]', 'g').

Sources

License

MIT license

FAQs

Package last updated on 28 Aug 2019

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

npm

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc