Perke
Perke is a Python keyphrase extraction package for Persian language. It
provides an end-to-end keyphrase extraction pipeline in which each component
can be easily modified or extended to develop new models.
Installation
Simple Example
Perke provides a standardized API for extracting keyphrases from a text. Start
by typing the 4 lines below to use TextRank
keyphrase extractor.
from perke.unsupervised.graph_based import TextRank
extractor = TextRank()
extractor.load_text(input='text or path/to/input_file')
extractor.weight_candidates(top_t_percent=0.33)
keyphrases = extractor.get_n_best(n=10)
For more in depth examples see the
examples
directory.
Documentation
Documentation and references are available at
Read The Docs.
Implemented Models
Perke currently, implements the following keyphrase extraction models:
- Unsupervised models
- Graph-based models
- TextRank: article
by Mihalcea and Tarau, 2004
- SingleRank: article
by Wan and Xiao, 2008
- TopicRank: article
by Bougouin, Boudin and Daille, 2013
- PositionRank: article
by Florescu and Caragea, 2017
- MultipartiteRank: article
by Boudin, 2018
Acknowledgements
Perke is inspired by pke.