Security News
Introducing the Socket Python SDK
The initial version of the Socket Python SDK is now on PyPI, enabling developers to more easily interact with the Socket REST API in Python projects.
@simonjb/rake-js
Advanced tools
A pure JS implementation of the Rapid Automated Keyword Extraction (RAKE) algorithm.
A pure JS implementation of the Rapid Automated Keyword Extraction (RAKE) algorithm. Put in any text corpus, get back a bunch of keyphrases and keywords.
More languages are fairly easy to add, see the stoplist module for details.
Without any further options:
import rake from 'rake-js'
const myKeywords = rake(someTextContent) // ['keyword1, ...]
When the language is known in advance (faster execution):
import rake from 'rake-js'
const myKeywords = rake(someTextContent, { language: 'english' })
When the corpus is divided by something other than whitespace (eg: ;
):
import rake from 'rake-js'
const myKeywords = rake(someTextContent, { delimiters: [';+'] })
This algorithm is fast, compared with other approaches like TextRank. The results are surprisingly good for a cross-language algorithm, and the truly relevant keywords / phrases are included in the result in most cases. For more details about the RAKE algorithm, read the original paper.
There are still rough edges in the code, but I tried to translate the abstract algorithm into a solid software package, tested and typesafe. Actually I wrote this thing because I was very disappointed with all the existing solutions on NPM, and I hope this repository is easier to contribute to in the future.
LGPL-3.0.
You can use this package in all your free or commercial products without any issues, but I want bugfixes and improvements to this algorithm to flow back into the public code repository.
FAQs
A pure JS implementation of the Rapid Automated Keyword Extraction (RAKE) algorithm.
The npm package @simonjb/rake-js receives a total of 97 weekly downloads. As such, @simonjb/rake-js popularity was classified as not popular.
We found that @simonjb/rake-js demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
The initial version of the Socket Python SDK is now on PyPI, enabling developers to more easily interact with the Socket REST API in Python projects.
Security News
Floating dependency ranges in npm can introduce instability and security risks into your project by allowing unverified or incompatible versions to be installed automatically, leading to unpredictable behavior and potential conflicts.
Security News
A new Rust RFC proposes "Trusted Publishing" for Crates.io, introducing short-lived access tokens via OIDC to improve security and reduce risks associated with long-lived API tokens.