QazNLTK: a package for working with Kazakh language text processing.
What is it?
QazNLTK provides developers with a fast and convenient tool for processing text in the Kazakh language. Tailored for the unique linguistic characteristics of Kazakh, this library offers a comprehensive set of tools for natural language processing, like: tokenization, sentence segmentation, evaluation similarity score and tranliteration of kazakh language cyrillic-latin.
Table of Contents
Main Features
Here are just a few of the things that qaznltk does well:
- Kazakh language Text Tokenizing by keyword frequencies:
from qaznltk import qaznltk as qnltk
qn = qnltk.QazNLTK()
text = input("Enter text: ")
tokens = qn.tokenize(text)
print(tokens)
- Kazakh language Text Segmentation into sentences:
from qaznltk import qaznltk as qnltk
qn = qnltk.QazNLTK()
text = input("Enter text: ")
sent_tokens = qn.sent_tokenize(text)
print(sent_tokens)
- Evaluate Difference score between 2 text:
from qaznltk import qaznltk as qnltk
qn = qnltk.QazNLTK()
textA = input("Enter text A: ")
textB = input("Enter text B: ")
similarity_score = qn.calc_similarity(textA, textB)
print(similarity_score)
- Convert Kazakh language Text from Cyrillic to Latin using ISO-9 Standard:
from qaznltk import qaznltk as qnltk
qn = qnltk.QazNLTK()
text = input("Enter text: ")
latin_text = qn.convert2latin(text)
print(latin_text)
- Convert Kazakh language Text from Latin to Cyrillic using ISO-9 Standard:
from qaznltk import qaznltk as qnltk
qn = qnltk.QazNLTK()
text = input("Enter text: ")
cyrillic_text = qn.convert2cyrillic(text)
print(cyrillic_text)
- Sentiment Analysis of Kazakh language text [
negative: -1
, neutral: 0
, positive: 1
]:
from qaznltk import qaznltk as qnltk
qn = qnltk.QazNLTK()
text = input("Enter text: ")
sentimize_score = qnltk.sentimize(text)
print(sentimize_score)
- Converting any number
N
into kazakh language number words [N <= 10^31
]:
from qaznltk import qaznltk as qnltk
qn = qnltk.QazNLTK()
n = int(input())
print(qnltk.num2word(n))
Where to get it
The source code is currently hosted on GitHub at: https://github.com/silvermete0r/QazNLTK.git
Binary installers for the latest released version are available at the Python
Package Index (PyPI).
pip install qaznltk
The list of changes to pandas between each release can be found
here. For full
details, see the commit logs at https://github.com/pandas-dev/pandas.
Dependencies
- Package was developed on built-in python functions;
License
Getting Help
📧 supwithproject@gmail.com
Contributing to qaznltk
All contributions, bug reports, bug fixes, documentation improvements, enhancements, and ideas are welcome.
Go to Top