pdf_to_wordcloud
Generates a word cloud from a given PDF
Installation
$ pip install pdf_to_wordcloud
Arguments
Positional:
PDF: Name of PDF file from which to geneate the wordcloud
Optional:
--remove (-r): Removes word from wordcloud. Accepts multiple arguments (one per flag)
--save (-s): Saves wordcloud as PDF to current directory (no additional argument needed)
--saveto (-st): Saves wordcloud to specified directory
--mask (-m): PNG file to use as shape of wordcloud
--exportdata (-x): Save CSV of word frequency data
--filename (-n): Optional name of data and wordcloud files (omit .pdf/.csv)
Usage
Display wordcloud of file.pdf:
$ pdf file.pdf
Save image of wordcloud as PDF:
$ pdf file.pdf -s
Remove "this" and "that" from wordcloud and save:
$ pdf file.pdf -r this -r that -s
Remove "this" and "that" from wordcloud, export word frequency data and save both files with "my_output" filename:
$ pdf file.pdf -r this -r that -x -n my_output -s