Normalize the text before natural language processing
Process FileMaker Pro Advanced's Database Design Report (DDR) to produce textual representations of the design objects for use with version control systems, text editors, etc.
Process text and propose tonality.
AlchemyLanguage is a collection of APIs that offer text analysis through natural language processing.
A Ruby interface to Enrycher text-processing API
git-scribe is a workflow tool for starting, writing, reviewing and publishing multiple forms of a book. it allows you to use asciidoc plain text markup to write, review and translate a work and provides a simple toolkit for generating common digital outputs for publishing - epub, mobi, pdf and html. it is also integrated into github functionality, letting you automate the publishing and collaboration process.
agenndy is a minimal text-based activity log (or personal agenda). It takes a text file which follows some very basic (but strict) rules and turns it into a CSV file (which includes times, activities and hours spent for each activity) suited for further processing. For the schema of the text-based agenda, check out the examples/ directory.
This Ruby gem leverages Machine Learning(ML) techniques to make predictions(forecasts) and classifications in various applications. It provides capabilities such as predicting next month's billing, forecasting upcoming sales orders, identifying patient's potential findings(like Diabetes), determining user approval status, classifying text, generating similarity scores, and making recommendations. It uses Python3 under the hood, powered by popular machine learning techniques including NLP(Natural Language Processing), Decision Tree, K-Nearest Neighbors and Logistic Regression, Random Forest and Linear Regression algorithms.
NoAccent is a Ruby gem designed to remove diacritic accents from text, providing cleaner and simpler text processing.
A library to create text processing pipelines.
This is a small class library of useful text processing routines I tend to use on my personal projects.
A simple multilingual tokenizer for NLP tasks. This tool provides a CLI and a library for linguistic tokenization which is an anavoidable step for many HLT (human language technology) tasks in the preprocessing phase for further syntactic, semantic and other higher level processing goals. Use it for tokenization of German, English and French texts.
An encouraging process wrapper that texts a ship-it squirrel on completion.
A simple multilingual tokenizer for NLP tasks. This tool provides a CLI and a library for linguistic tokenization which is an anavoidable step for many HLT (human language technology) tasks in the preprocessing phase for further syntactic, semantic and other higher level processing goals. Use it for tokenization of German, English and French texts.
A simple multilingual tokenizer for NLP tasks. This tool provides a CLI and a library for linguistic tokenization which is an anavoidable step for many HLT (human language technology) tasks in the preprocessing phase for further syntactic, semantic and other higher level processing goals. Use it for tokenization of German, English and French texts.
Base32 is one of several base 32 transfer encodings. Base32 uses a 32-character set comprising the twenty-six upper-case letters A–Z, and the digits 2–7. Base32 is primarily used to encode binary data, but Base32 is also able to encode binary text like ASCII. Base32 is a notation for encoding arbitrary byte data using a restricted set of symbols that can be conveniently used by humans and processed by computers. Base32 consists of a symbol set made up of 32 different characters, as well as an algorithm for encoding arbitrary sequences of 8-bit bytes into the Base32 alphabet. Because more than one 5-bit Base32 symbol is needed to represent each 8-bit input byte, it also specifies requirements on the allowed lengths of Base32 strings (which must be multiples of 40 bits). The closely related Base64 system, in contrast, uses a set of 64 symbols.
Allows simple processing of russian strings - transliteration, numerals as text and HTML beautification
Process text and calculate RAKE.
=== What is GptHelpr? It is sometime necessary to provide context and explanations for your code. Instead of manually copying and formatting code snippets, GPT-Helpr automates the process with an interactive cli, generating a well-structured Markdown output, which can be copied to your clipboard or printed to file. === Example Usage # note lmk is an alias for gpt_helpr -i -ln $ lmk == 🏴☠️ GptHelpr 0.2.3 == Helping to dig your codebase and cook GPT-XX instructions [current directory /Users/etozzato/WorkSpace/_AINZ/pizzatarians.com] File Path (optional :start:end): TAB -> favicon.ico hey.md js random-acts-of-pizza.md _config.yml _site draft fonts images kneading-baking-academy.md _exe academy favicon.gif hands-in-dough.md index.md parties-and-events.md File Path (optional :start:end): hey.md 1:22 Instructions: can you improve this text? Do you see any issues with the template? File Path (optional :start:end): # this is the generated output (also copied to the clipboard) ==== file source `hey.md 1:22` 1: --- 2: title: Hey, hello! 3: layout: default 4: --- 5: 6: # {{ page.title }} 7: ---- 8: 9: <div class="row"> 10: <div class="col-md-12"> 11: <p class='justin'> 12: Nice to meet you, I am *Mek*! 13: </p> 14: <p class='listo'> 15: I am a self-proclaimed pizza guru and I am here to teach & learn. Originally from Venice, Italy you can find me in San Diego, CA. 16: </p> 17: <p class='listo'> 18: In my spare time, I write code @ PlayStation! 19: </p> 20: </div> 21: </div> 22: can you improve this text? Do you see any issues with the template? ==== end of `hey.md`
Goethe - Text processing library.
a text processing command-line tool that is driven by Ruby's `#each_line`
A smart, static site generator that automatically manages dependencies to achieve blazing build times with minimal cognitive load. Only new and changed files, and files upstream of a changed dependency are processed. Renders markdown or embedded-Ruby (Erb-like) content as HTML. Supports templates (embedded & layout), which may be included within content sources or other templates. Document metadata may me added using a plain-text preamble of key-value pairs. Generates a complete website that can be served by the built-in WEBrick server.
Hunyuan is a Ruby gem designed to simplify the integration of the Hunyuan API for chat completions into your Ruby applications. With Hunyuan, you can effortlessly add natural language processing capabilities, enabling your applications to provide intelligent responses to user queries. Whether you're building chatbots, virtual assistants, or any other application that requires text-based interactions, Hunyuan streamlines the process and empowers your Ruby code with advanced chat completion features.
Wikipedia articles are infamous for being heavily referenced. One article could all of a sudden end up being a rabbit hole where you start clicking on other links and might soon get lost in the process. This ruby script converts your wikipedia url into simple text. You will have all that you need without any references. Removing references also comes in handy when you are plugging this into any text-to-audio converter.
The ruby-amazon-bedrock gem offers Ruby developers an efficient and user-friendly interface to Amazon Bedrock, a powerful library for AI-driven text and image generation. This gem simplifies the process of connecting to Amazon Bedrock's APIs, enabling developers to easily harness the capabilities of advanced machine learning models for generating high-quality text and images.