Security News
New Python Packaging Proposal Aims to Solve Phantom Dependency Problem with SBOMs
PEP 770 proposes adding SBOM support to Python packages to improve transparency and catch hidden non-Python dependencies that security tools often miss.
fuzzy-native
Advanced tools
Fuzzy string matching library package for Node. Implemented natively in C++ for speed with support for multithreading.
The scoring algorithm is heavily tuned for file paths, but should work for general strings.
(from main.js.flow)
export type MatcherOptions = {
// Default: false
caseSensitive?: boolean,
// Default: infinite
maxResults?: number,
// Maximum gap to allow between consecutive letters in a match.
// Provide a smaller maxGap to speed up query results.
// Default: unlimited
maxGap?: number;
// Default: 1
numThreads?: number,
// Default: false
recordMatchIndexes?: boolean,
}
export type MatchResult = {
value: string,
// A number in the range (0-1]. Higher scores are more relevant.
// 0 denotes "no match" and will never be returned.
score: number,
// Matching character index in `value` for each character in `query`.
// This can be costly, so this is only returned if `recordMatchIndexes` was set in `options`.
matchIndexes?: Array<number>,
}
export class Matcher {
constructor(candidates: Array<string>) {}
// Returns all matching candidates (subject to `options`).
// Will be ordered by score, descending.
match: (query: string, options?: MatcherOptions) => Array<MatchResult>;
addCandidates: (candidates: Array<string>) => void;
removeCandidates: (candidates: Array<string>) => void;
setCandidates: (candidates: Array<string>) => void;
}
See also the spec for basic usage.
The scoring algorithm is mostly borrowed from @wincent's excellent command-t vim plugin; most of the code is from his implementation in match.c.
Read the source code for a quick overview of how it works (the function recursive_match
).
NB: score_match.cpp and score_match.h have no dependencies besides the C/C++ stdlib and can easily be reused for other purposes.
There are a few notable additional optimizations:
MatcherBase
. We then compare this the "letter bitmask" of the query to quickly prune out non-matches.FAQs
Native C++ implementation of a fuzzy string matcher.
The npm package fuzzy-native receives a total of 2 weekly downloads. As such, fuzzy-native popularity was classified as not popular.
We found that fuzzy-native demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
PEP 770 proposes adding SBOM support to Python packages to improve transparency and catch hidden non-Python dependencies that security tools often miss.
Security News
Socket CEO Feross Aboukhadijeh discusses open source security challenges, including zero-day attacks and supply chain risks, on the Cyber Security Council podcast.
Security News
Research
Socket researchers uncover how threat actors weaponize Out-of-Band Application Security Testing (OAST) techniques across the npm, PyPI, and RubyGems ecosystems to exfiltrate sensitive data.