wordtm 0.4.6
An NLP package for topic modeling on the Holy Scripture and other text
from low-code to pro-code
Installation
.. code:: shell
$ pip install wordtm
Usage
wordtm
can be used to perform some NLP pre-processing tasks, text
exploration, including Chinese one, text visualization (word cloud), and
topic modeling (BERTopic, LDA and NMF) as follows:
.. code:: python
from wordtm import meta, util, ta, tm, viz, pivot, quot
version Submodule
Provides some version information.
.. code:: python
import wordtm
print(wordtm.__version__)
meta Submodule
~~~~~~~~~~~~~~
Provides extracting source code of ``wordtm`` module and adding timing and code-showing features
to all functions of the module.
.. code:: python
print(meta.get_module_info())
print(meta.get_module_info(detailed=True))
meta.addin_all()
quot Submodule
~~~~~~~~~~~~~~
Provides functions to extract the quotation source Scripture in OT based on the presribed NT Scripture.
.. code:: python
cdf = util.load_word('cuv.csv')
crom8 = util.extract2(cdf, 'Rom 8')
quot.show_quot(crom8, lang='chi')
pivot Submodule
~~~~~~~~~~~~~~~
Provides a pivot table of the prescribed text.
.. code:: python
cdf = util.load_word('cuv.csv')
pivot.stat(cdf, chi=True)
ta Submodule
~~~~~~~~~~~~
Provides text analytics functions, including extracting the summarization of the prescribed text.
.. code:: python
cdf = util.load_word('cuv.csv')
crom8 = util.extract2(cdf, 'Rom 8')
ta.summary(rom8, code=True)
tm Submodule
~~~~~~~~~~~~~
Provides text modeling functions, including LDA, NMF and BERTopics modeling.
.. code:: python
lda = tm.lda_process("cuv.csv", chi=True, eval=True, timing=True)
nmf = tm.nmf_process("cuv.csv", chi=True, eval=True, code=1)
btm = tm.btm_process("cuv.csv", chi=True, cat='nt', eval=True)
util Submodule
~~~~~~~~~~~~~~
Provides loading text and text preprocessing functions.
.. code:: python
df = util.load_word()
cdf = util.load_word('cuv.csv')
df.head()
cdf.head()
rom8 = util.extract2(df, 'Rom 8')
crom8 = util.extract2(cdf, 'Rom 8')
viz Submodule
~~~~~~~~~~~~~
Wordcloud plotting from the prescribed text.
.. code:: python
cdf = util.load_word('cuv.csv')
viz.chi_wordcloud(cdf)
Contributing
------------
Interested in contributing? Check out the contributing guidelines.
Please note that this project is released with a Code of Conduct. By
contributing to this project, you agree to abide by its terms.
License
-------
``wordtm`` was created by Johnny Cheng. It is licensed under the terms
of the MIT license.
Credits
-------
``wordtm`` was created under the guidance of Jehovah, the Lord.