KeyBERT performs keyword extraction with state-of-the-art transformer models.
Keyword extraction Python package
RAKE short for Rapid Automatic Keyword Extraction algorithm, is a domain independent keyword extraction algorithm which tries to determine key phrases in a body of text by analyzing the frequency of word appearance and its co-occurance with other words in the text.
A text summarization and keyword extraction package based on TextRank
Python3 implementation of the Schwartz-Hearst algorithm for extracting abbreviation-definition pairs
Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python
A python module implementing the Rapid Automatic Keyword Extraction algorithm.
A package for extracting keywords from large text very quickly (much faster than regex and the original flashtext package
KR-WordRank: Korean Unsupervised Word/Keyword Extractor
Extract higher level clusters from keywords
Easy extraction of keywords from search engine results pages (SERPs).
LAiSER (Leveraging Artificial Intelligence for Skill Extraction & Research) is a tool designed to help learners, educators, and employers extract and share trusted information about skills. It uses a fine-tuned language model to extract raw skill keywords from text, then aligns them with a predefined taxonomy. You can find more technical details in the project's paper.md and an overview in the README.md.
Bangla keyword extraction library which extracts keywords from text or list of texts
Easy extraction of keywords from search engine results pages (SERPs).
Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python
Extract keywords and abstract Chinese article
Analyze resumes and job descriptions by extracting key sections and scoring them based on keyword relevance
A Japanese Keyword Extraction Library
KeyCARE is a Python library designed for the unsupervised keyword extraction from biomedical documents with the use of different algorithms, the classification of the keywords according to their semantic nature, and the extraction of is a relations among those keywords and with other terminologies.
Unsupervised Keyword Extracters
A keyphrase extractor for Persian
This is a simple library for extracting keywords from data with/without using a corpus.
Keyword extraction with spaCy
A Python library that enables smooth keyword extraction from any text using the RAKE(Rapid Automatic Keyword Extraction) algorithm.
A Python Module for Comprehensive Text Mining, including Keyword Extraction and Text Analysis.
BERT, LDA, and TFIDF based keyword extraction in Python
AI-powered knowledge management with YAKE keyword extraction for Claude Code development
String processing library for advanced NLP and keyword extraction
Efficient PDF analysis, text extraction, preprocessing, and pattern recognition with customizable configurations and utilities.
This package extracts important information from a pdf document such as heading, paragraphs and important keywords!!!
news-fetch is an open-source, easy-to-use news extractor with basic NLP features (cleaning text, keywords, summary) that just works.
A spaCy pipeline component for extracting keywords from text using cosine similarity.
Keyword Extractor is a simple tool for extracting keywords from a web page. It returns the keywords and the frequency of each keyword.
Python package for keyphrase labeling.
AdaptKeyBERT extended keyphrase extraction with zero-shot and few-shot semi-supervised domain adaptation.
('Rapid Automatic Keywords Extraction', 'Just a Practice')
The Python package for the BlankSort keyword extraction algorithm.
Extracts keywords with 'TF-IDF' algorithm
Extract sentiment, keywords, and entities from text.
Entropy Rank keyphrase extractor
Yet another Python implementation of TextRank: package for the creation, manipulation, and study of TextRank algorithm based keywords extraction and summarisation
Keyword extraction using transformer-based language models
keywords extract
Chinese keyword extraction using transformer-based language models
An unsupervised text summarization and information retrieval library under the hood using natural language processing models.