A high-resolution image-to-PCB converter. Gerbolyze plots SVG, PNG and JPG onto existing gerber files. It handles almost the full SVG spec and deals with text, path outlines, patterns, arbitrary paths with self-intersections and holes, etc. fully automatically. It can vectorize raster images both by contour tracing and by grayscale dithering. All processing is done at the vector level without intermediate conversions to raster images accurately preserving the input.
Python client for Cognica database
Wrappers for including pre-trained transformers in spaCy pipelines
Natural language processing utilities and examples for the book Natural Language Processing in Action (nlpia) 2nd Edition by Hobson Lane and Maria Dyshel.
Pyline is a grep-like, sed-like, awk-like command-line tool for line-based text processing in Python.
70+ beginner to intermediate level questions on CLI text processing tasks
Tools, wrappers, etc... for data science with a concentration on text processing
nlp library which support text processing methods, text match methods, etc.
Biome-text is a light-weight open source Natural Language Processing toolbox built with AllenNLP
Narrative analysis add-on for the Orange 3 data mining software package.
A python package for text preprocessing task in natural language processing
A forum scraper library
A package for CLIP-based image and text processing.
Can be used to pre process data before ai processing
Simplifies arranging text fragments with multiple speakers and processing it with coqui.ai TTS
A bunch of python codes to analyze text data in the construction industry. Mainly reconstitute the pre-exist python libraries for Natural Language Processing (NLP)
Utils for automatic document images processing
Flesch Kincaid readability scoring algorithm
Bloatectomy: a method for the identification and removal of duplicate text in the bloated notes of electronic health records and other documents.
A Speech-to-Text toolkit with VAD, punctuation, and emotion classification
Process text for NLP
Linguistic Pattern Lab using spaCy
CausalNLP: A Practical Toolkit for Causal Inference with Text
A multilingual voice recording and transcription tool with German and English support
Test package for distribution
Python library for SEO-friendly HTML text processing and keyword linking
A coreference resolution research toolkit.
User-friendly library to find similar objects
A Python library for processing Yiddish text
A ridiculously simple search engine factory
Text bolts for geniusrise
indxr: A Python utility for indexing long files.
A library for processing PDF documents, images, extracting text, parsing TSV to JSON, and merging JSON files
Expansion to the unstructured package, adding support for image extraction.
Pashto Natural Language Processing Toolkit
Python interface for eunjeon project & mecab based morphological analyzer.
Fast text processing acceleration.
A Python package to extract text from images and PDFs using Vision Language Model (VLM).
A library for processing Code Mixed Text. Still in development!
BELT (BERT For Longer Texts). BERT-based text classification model for processing texts longer than 512 tokens.
Natural language processing support for Pandas dataframes.
A powerful library for audio processing with advanced features for speech recognition, text translation, and speech synthesis.
Python port of open source text processing library for Turkish, zemberek-nlp
NK-HanDic package for installing via pip.
dummy csv, flat, json text file generator, typical usage scenario can be load / stress / performance testing of file-processing data tools
中文情感分析库(Chinese Sentiment))可对文本进行情绪分析、正负情感分析。