Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More →

markov-strings

Package Overview

Dependencies

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

markov-strings

A Markov string generator

3.0.0-beta.1
beta
Source
npm

Version published: 5 years ago

Weekly downloads: 379; decreased by-38.57%

Maintainers: 1

Weekly downloads

Created: 8 years ago

Source

Markov-strings

A simplistic Markov chain text generator. Give it an array of strings, and it will output a randomly generated string.

This module was created for the Twitter bot @BelgicaNews.

Markov-strings

Prerequisites

This module makes use of ES6 features.

Installing

npm install --save markov-strings

Usage

const Markov = require('markov-strings').default
// or
import Markov from 'markov-strings'

const data = [/* insert a few hundreds/thousands sentences here */]

// Build the Markov generator
const markov = new Markov({ stateSize: 2 })

// Add data for the generator
markov.addData(data)

const options = {
  maxTries: 20, // Give up if I don't have a sentence after 20 tries (default is 10)
  prng: Math.random, // An external Pseudo Random Number Generator if you want to get seeded results
  filter: (result) => {
    return
      result.string.split(' ').length >= 5 && // At least 5 words
      result.string.endsWith('.')             // End sentences with a dot.
  }
}

// Generate a sentence
const result = markov.generate(options)
console.log(result)
/*
{
  string: 'lorem ipsum dolor sit amet etc.',
  score: 42,
  tries: 5,
  refs: [ an array of objects ]
}
*/

API

new Markov([options])

Create a generator instance.

options

{
  stateSize: number
}

The stateSize is the number of words for each "link" of the generated sentence. 1 will output gibberish sentences without much sense. 2 is a sensible default for most cases. 3 and more can create good sentences if you have a corpus that allows it.

.addData(data)

To function correctly, the Markov generator needs its internal data to be correctly structured. .addData(data) allows you add raw data, that is automatically formatted to fit the internal structure.

You can call .addData(data) as often as you need, with new data each time.

data

string[] | Array<{ string: string }>

data is an array of strings (sentences), or an array of objects. If you wish to use objects, each one must have a string attribute. The bigger the array, the better and more various the results.

Examples:

[ 'lorem ipsum', 'dolor sit amet' ]

[
  { string: 'lorem ipsum', attr: 'value' },
  { string: 'dolor sit amet', attr: 'other value' }
]

Since .addData(data) can take some time (it loops for each word of each string), a non-blocking variant .addDataAsync(data) is conveniently available if you need it.

.generate([options])

Returns an object of type MarkovResult:

{
  string: string, // The resulting sentence
  score: number,  // A relative "score" based on the number of possible permutations. Higher is "better", but the actual value depends on your corpus
  refs: Array<{ string: string }>, // The array of references used to build the sentence
  tries: number   // The number of tries it took to output this result
}

The refs array will contain all objects that have been used to build the sentence. May be useful to fetch some meta data or make some stats.

Since .generate() can potentially take several seconds or more, a non-blocking variant .generateAsync() is conveniently available if you need it.

options

{
  maxTries: number // The max number of tentatives before giving up (default is 10)
  prng: Math.random, // An external Pseudo Random Number Generator if you want to get seeded results
  filter: (result: MarkovResult) => boolean // A callback to filter results (see example above)
}

Changelog

3.0.0

Refactoring to facilitate iterative construction of the corpus (multiple .addData() instead of a one-time buildCorpus())

2.1.0

Add an optionnal prng parameter at generation to use a specific Pseudo Random Number Generator

2.0.4

Dependencies update

2.0.0

Refactoring with breaking changes
The constructor and generator take two different options objects
Most of generator options are gone, except filter and maxTries
Tests have been rewritten with jest, in TypeScript

1.5.0

Code rewritten in TypeScript. You can now import MarkovGenerator from 'markov-strings'

1.4.0

New filter() method, thanks @flpvsk

1.3.4 - 1.3.5

Dependencies update

1.3.3

Updated README. Version bump for npm

1.3.2

Fixed an infinite loop bug
Performance improvement

1.3.1

Updated README example
Removed a useless line

1.3.0

New feature: the generator now accepts arrays of objects, and tells the user which objects were used to build a sentence
Fixed all unit tests
Added a changelog

Running the tests

npm test

Keywords

FAQs

What is markov-strings?

Is markov-strings popular?

Is markov-strings well maintained?

Package last updated on 28 Mar 2020

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

markov-strings

Markov-strings

Prerequisites

Installing

Usage

API

new Markov([options])

options

.addData(data)

data

.generate([options])

options

Changelog

3.0.0

2.1.0

2.0.4

2.0.0

1.5.0

1.4.0

1.3.4 - 1.3.5

1.3.3

1.3.2

1.3.1

1.3.0

Running the tests

Keywords

Related posts

PyPI Introduces Digital Attestations to Strengthen Python Package Security

GitHub Removes Malicious Pull Requests Targeting Open Source Repositories