Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More
Socket
Sign inDemoInstall
Socket

n-gram-lang-detect

Package Overview
Dependencies
Maintainers
1
Versions
2
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

n-gram-lang-detect

n-gram-lang-detect is a Node.js package designed to detect the language of a given string using n-gram analysis.

  • 1.1.1
  • latest
  • Source
  • npm
  • Socket score

Version published
Maintainers
1
Created
Source

n-gram-lang-detect

n-gram-lang-detect is a Node.js package designed to detect the language of a given string using n-gram analysis. It supports generating language profiles from datasets. Currently, it supports Spanish (ES), English (EN), and Portuguese (PR) languages.

This package is trained using the Universal Declaration of Human Rights in the respective languages. It also generates text using localisms from Argentina, Uruguay, and Brazil.

Installation

To install n-gram-lang-detect you can use npm:

npm install n-gram-lang-detect

How to use

const { detectLanguage } = require('n-gram-lang-detect');
import { detectLanguage } from 'n-gram-lang-detect';

Detect language

const text = 'Hola, ¿cómo estás?';
const detectedLanguage = detectLanguage(text);
console.log(detectedLanguage);  // { detectedLanguage: string; similarityScore: number }
const text = 'Hola, ¿cómo estás?';
const similarityThreshold = 0.004; // this is the default value
const detectedLanguage = detectLanguage(text, similarityThreshold);
console.log(detectedLanguage);  // { detectedLanguage: string; similarityScore: number }
const text = 'Hola, ¿cómo estás?';
const similarityThreshold = 0.004;
const customTerminology = [
    { "term": "championes", "mappedTerm": "zapatillas", "locale": "es" },
    { "term": "romanitas", "mappedTerm": "chancletas", "locale": "es" },
    { "term": "capacete de ciclismo", "mappedTerm": "Capacete de proteção", "locale": "pt" },
]
const detectedLanguage = detectLanguage(text, similarityThreshold,customTerminology);
console.log(detectedLanguage);  // { detectedLanguage: string; similarityScore: number }

Parameters

  • text: The text to detect the language.
  • similarityThreshold: The minimum similarity score to consider a language detected. Default value is 0.004.
  • customTerminology: A list of custom terminology to use in the detection. The recommended format is an array of objects with the following properties: term, mappedTerm, locale and the mappedTerm should be in the same language as the locale and should be a synonym of the term.

FAQs

Package last updated on 16 Oct 2024

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

npm

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc