wink-jaro-distance
An Implementation of Jaro Distance Algorithm by Matthew A. Jaro
De-duplicate short strings such as names by computing similarity and distance between a pair of strings using wink-jaro-distance
. It is a part of wink — a growing family of high quality packages for Statistical Analysis, Natural Language Processing and Machine Learning in NodeJS.
It is an implementation of Jaro Distance Algorithm that determines the similarity/distance by taking into account the insertions, deletions and transpositions.
Installation
Use npm to install:
npm install wink-jaro-distance --save
Example
var jaro = require( 'wink-jaro-distance' );
console.log( jaro( 'father', 'farther') );
console.log( jaro( 'Angelina', 'Angelica') );
console.log( jaro( 'Flikr', 'Flicker' ) );
console.log( jaro( 'abcdef', 'fedcba' ) );
API
Table of Contents
jaro
Computes Jaro distance and similarity between strings s1
and s2
.
Original Reference: UNIMATCH:
A Record Linkage System: Users Manual pp 104.
Parameters
Examples
jaro( 'daniel', 'danielle' );
jaro( 'god', 'father' );
Returns object containing distance
and similarity
values between 0 and 1.
Need Help?
If you spot a bug and the same has not yet been reported, raise a new issue or consider fixing it and sending a pull request.
Copyright & License
wink-jaro-distance is copyright 2017 GRAYPE Systems Private Limited.
It is licensed under the under the terms of the GNU Affero General Public License as published by the Free
Software Foundation, version 3 of the License.