Security News
Introducing the Socket Python SDK
The initial version of the Socket Python SDK is now on PyPI, enabling developers to more easily interact with the Socket REST API in Python projects.
KinGBERT(Keywords in Graph with BERT) is a minimal keyword extraction library with graph methods to extract keywords. Use Sentence-BERT embedding for founding the most significant keywords.
KinGBERT is available on PyPI.
pip install KinGBERT
To clone this repository, run
git clone https://github.com/sokolheavy/KinGBERT.git
We use the KinGBERTExtractor
class, which can be configured to generate keywords from text.
text = """What is data science?
Data science is a multidisciplinary approach to extracting actionable insights from the large and ever-increasing volumes of data collected and created by today’s organizations.
Data science encompasses preparing data for analysis and processing, performing advanced data analysis, and presenting the results to reveal patterns and enable stakeholders to draw informed conclusions.
Data preparation can involve cleansing, aggregating, and manipulating it to be ready for specific types of processing. Analysis requires the development and use of algorithms, analysis and AI models. It’s driven by software that combs through data to find patterns within to transform these patterns into predictions that support business decision-making. The accuracy of these predictions must be validated through scientifically designed tests and experiments. And the results should be shared through the skillful use of data visualization tools that make it possible for anyone to see the patterns and understand trends."""
Just extract 5 keywords from the text.
extractor = KinGBERTExtractor()
keywords = extractor.generate(doc)
>>> from KinGBERT import KinGBERTExtractor
>>> extractor = KinGBERTExtractor(top_k=5)
>>> keywords = extractor.generate(text)
>>> print(keywords)
['data science', 'insights', 'analysis', 'experiments', 'algorithms']
FAQs
Keywords extractor with Graph and BERT methods
We found that KinGBERT demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
The initial version of the Socket Python SDK is now on PyPI, enabling developers to more easily interact with the Socket REST API in Python projects.
Security News
Floating dependency ranges in npm can introduce instability and security risks into your project by allowing unverified or incompatible versions to be installed automatically, leading to unpredictable behavior and potential conflicts.
Security News
A new Rust RFC proposes "Trusted Publishing" for Crates.io, introducing short-lived access tokens via OIDC to improve security and reduce risks associated with long-lived API tokens.