Socket
Socket
Sign inDemoInstall

detect-character-encoding

Package Overview
Dependencies
Maintainers
1
Versions
12
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

detect-character-encoding

Detect character encoding using ICU.


Version published
Weekly downloads
5.3K
increased by10.99%
Maintainers
1
Weekly downloads
 
Created
Source

detect-character-encoding

Node.js package Linux Build Status

Detect character encoding using ICU.

Getting started

Install using:

$ npm install detect-character-encoding

Use it like this:

const fs = require('fs');
const detectCharacterEncoding = require('detect-character-encoding');

const fileBuffer = fs.readFileSync('file.txt');
const charsetMatch = detectCharacterEncoding(fileBuffer);

console.log(charsetMatch);
// {
//   encoding: 'UTF-8',
//   confidence: 60
// }

Supported environments

detect-character-encoding should work fine on:

  • Ubuntu 14.04 x64
  • Ubuntu 16.04 x64
  • Debian 8
  • macOS 10.12
  • Alpine Linux

You may currently encounter issues on 32-bit systems and Windows.

Supported character sets

As listed in ICU’s user guide:

  • UTF-8
  • UTF-16BE
  • UTF-16LE
  • UTF-32BE
  • UTF-32LE
  • Shift_JIS
  • ISO-2022-JP
  • ISO-2022-CN
  • ISO-2022-KR
  • GB18030
  • Big5
  • EUC-JP
  • EUC-KR
  • ISO-8859-1
  • ISO-8859-2
  • ISO-8859-5
  • ISO-8859-6
  • ISO-8859-7
  • ISO-8859-8
  • ISO-8859-9
  • windows-1250
  • windows-1251
  • windows-1252
  • windows-1253
  • windows-1254
  • windows-1255
  • windows-1256
  • KOI8-R
  • IBM420
  • IBM424

Release history

  • v0.5.0 (2017-07-23):
    • Update to ICU 59.1
    • Add support for Alpine Linux
    • Drop support for Node.js v5 and v7
  • v0.4.0 (2017-07-02):
    • Update to ICU 58.1
    • Add support for Node.js v8
    • Add support for Ubuntu 16.04 and drop support for Ubuntu 12.04
    • Add support for Debian 8 and drop support for Debian 7
    • Drop support for macOS versions older than macOS Sierra 10.12
  • v0.3.1 (2017-03-10):
    • Fix continuing execution even after an error occurred.
    • Fix memory leak by properly closing ICU’s charset detector.
  • v0.3.0 (2017-01-28): Add support for Node.js v6 and v7 and drop support for Node.js v0.10 and v0.12.
  • v0.2.1 (2015-12-28): Republish because v0.2.0 didn’t include config.gypi.
  • v0.2.0 (2015-09-15): Add support for Node.js v4.
  • v0.1.0 (2015-03-15): Initial release.

License

detect-character-encoding is licensed under the BSD 2-clause license, subject to additional terms. See LICENSE for the full license text.

Keywords

FAQs

Package last updated on 23 Jul 2017

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

npm

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc