🚀 Big News: Socket Acquires Coana to Bring Reachability Analysis to Every Appsec Team.Learn more
Socket
DemoInstallSign in
Socket

markovian-nlp

Package Overview
Dependencies
Maintainers
1
Versions
40
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

markovian-nlp

Markov chains & NLP

1.1.5
Source
npm
Version published
Weekly downloads
3
-97.3%
Maintainers
1
Weekly downloads
 
Created
Source

markovian-nlp

license npm current version

Setup

Installation

With npm installed, run terminal command:

npm i markovian-nlp
  • npm package

Usage

Module import

Declare method import at the top of each JavaScript file it will be used.

ES2015

import ngramsDistribution from 'markovian-nlp';

CommonJS

const ngramsDistribution = require('markovian-nlp');

Glossary

Learn more about computational linguistics and natural language processing (NLP) on Wikipedia.

The following terms are used in the API documentation:

termdescription
bigram2-gram sequence
endgramfinal gram in a sequence
n-gramcontiguous gram (word) sequence
startgramfirst gram in a sequence
unigram1-gram sequence

API

ngramsDistribution(document)

View the n-grams distribution of text.

Potential applications: Markov models

Example

ngramsDistribution('birds have featured in culture and art since prehistoric times');
Output
{
  and: { _end: 0, _start: 0, art: 1 },
  art: { _end: 0, _start: 0, since: 1 },
  birds: { _end: 0, _start: 1, have: 1 },
  culture: { _end: 0, _start: 0, and: 1 },
  featured: { _end: 0, _start: 0, in: 1 },
  have: { _end: 0, _start: 0, featured: 1 },
  in: { _end: 0, _start: 0, culture: 1 },
  prehistoric: { _end: 0, _start: 0, times: 1 },
  since: { _end: 0, _start: 0, prehistoric: 1 },
  times: { _end: 1, _start: 0 },
}

Each number represents the sum of occurrences.

startgramendgrambigrams
"birds""times"all remaining keys ("have featured", "featured in", etc.)

Input

user-defined parametertypeimplementsintermediate transformations
documentStringcompromise (document)normalization, rule-based text parsing

Return value

typedescription
Objectdistributions of unigrams to startgrams, endgrams, and following bigrams
Signature
// pseudocode (does not run)
ngramsDistribution(document) => ({
  ...unigrams: {
    ...{ ...bigram: bigramsDistribution },
    _end: endgramsDistribution,
    _start: startgramsDistribution,
  },
});

FAQs

Package last updated on 02 Oct 2018

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts