We use this library at GitHub to detect blob languages, highlight code, ignore binary files, suppress generated files in diffs, and generate language breakdown graphs.
WhatLanguage rapidly detects the language of a sample of text
GitLab Language detection
Compact Language Detection for Ruby
Cross-language temporary (disposable/throwaway) email detection library. Covers 1979 fake email providers. http://twitter.com/FGRibreau
LanguageFilter is a Ruby gem to detect and optionally filter various categories of language.
Language Detection API Client
Compact Language Detection for Ruby
Perform unsupervised language detection, specifically for the purpose of finding English-language tweets.
Ruby language detection library using n-gram model
Ruby bindings for Chromium Compact Language Detector
Language detection
WhatLanguage rapidly detects the language of a sample of text
Compact Language Detection for Ruby
Language detection library for Ruby
Highly accurate language detection library, uses naive bayesian filter.
Detect language of given text
NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, grammar and spelling correction, keywords and keyphrases extraction, chatbot, product description and ad generation, intent classification, text generation, image generation, code generation, question answering, automatic speech recognition, machine translation, language detection, semantic search, semantic similarity, speech synthesis, tokenization, POS tagging, embeddings, and dependency parsing. It is ready for production, served through a REST API. This is the Ruby client for the API. More details here: https://nlpcloud.io. Documentation: https://docs.nlpcloud.io.
Subtitle gem helps you to detect the language(s) of the caption file, translate closed caption to another language and also supports transforming from one format to another. Say for example from dfxp to srt or vtt or to all supported formats."
WhatLanguage rapidly detects the language of a sample of text
Ruby Gem for Dandelion API service. Available endpoint: Entity Extraction, Text Similarity and Language Detection
Small language detection gem using detectlanguage.com api. Heavily inspired by the wtf_lang gem
== Medusa: a ruby crawler framework {rdoc-image:https://badge.fury.io/rb/medusa-crawler.svg}[https://rubygems.org/gems/medusa-crawler] rdoc-image:https://github.com/brutuscat/medusa-crawler/workflows/Ruby/badge.svg?event=push Medusa is a framework for the ruby language to crawl and collect useful information about the pages it visits. It is versatile, allowing you to write your own specialized tasks quickly and easily. === Features * Choose the links to follow on each page with +focus_crawl+ * Multi-threaded design for high performance * Tracks +301+ HTTP redirects * Allows exclusion of URLs based on regular expressions * Records response time for each page * Obey _robots.txt_ directives (optional, but recommended) * In-memory or persistent storage of pages during crawl, provided by Moneta[https://github.com/moneta-rb/moneta] * Inherits OpenURI behavior (redirects, automatic charset and encoding detection, proxy configuration options). <b>Do you have an idea or a suggestion? {Open an issue and talk about it}[https://github.com/brutuscat/medusa-crawler/issues/new]</b> === Examples Medusa is versatile and to be used programatically, you can start with one or multiple URIs: require 'medusa' Medusa.crawl('https://www.example.com', depth_limit: 2) Or you can pass a block and it will yield the crawler back, to manage configuration or drive its crawling focus: require 'medusa' Medusa.crawl('https://www.example.com', depth_limit: 2) do |crawler| crawler.discard_page_bodies = some_flag # Persist all the pages state across crawl-runs. crawler.clear_on_startup = false crawler.storage = Medusa::Storage.Moneta(:Redis, 'redis://redis.host.name:6379/0') crawler.skip_links_like(/private/) crawler.on_pages_like(/public/) do |page| logger.debug "[public page] #{page.url} took #{page.response_time} found #{page.links.count}" end # Use an arbitrary logic, page by page, to continue customize the crawling. crawler.focus_crawl(/public/) do |page| page.links.first end end
This is a simple ruby wrapper for Titus Wormer's awesome javascript language detection called Franc (https://github.com/wooorm/franc). At this time, this gem depends on a global npm installation of franc and runs a bash command within ruby to invoke the franc javascript. If you do not have franc installed, this gem will install it for you on the first run.
UHFerret is a copy-detection tool to analyse large sets of documents to find pairs of documents with substantial amounts of lexical copying. Documents containing either natural language (e.g. English) or computer programs (in C-family) may be processed.
Compact Language Detection for Ruby
Panache is a simple way to create style checkers for various languages. It does simple parsing of source files and then applies user-specified rules to detect style violations.
A rubygem to detect the language of a text (using Google Language Detector)
GitHub Language detection
This is a ruby implementation of Karel The Robot, a programming language designed for extreme beginners. It concerns controlling a robot, named Karel, in a grid-based world comprised of walls and beepers. Karel can pick up and put down beepers, move forward, and turn left. Karel can also detect things about his environment.
Ruby Library for the languagelayer API, powerful language detection, https://languagelayer.com/
Ruby bindings for Whatlang, a natural language detection for Rust.
This is a Ruby API wrapper for consuming the https://linguin.ai/ API allowing you to detect the language of a text fast and with high accuracy.
Detect text language and translate text language in Rails Controller, Views, Model, Lib, etc
Small utility that allows you to detect scripts (languages) in unicode text
Compact Language Detection from chrome (jtoy fork/gemfix)
GitHub Language detection (special OL dub)
RubyTranslate provides a simple wrapper around the Google Translate API for detecting and translating languages.
Natural language detection with a focus on simplicity and performance. Currently wraps the lingua-rs Rust crate.
A simple library written in Ruby language to detect spam built around bogofilter executable. The supported input format is the same as in bogofilter - text, EML, mbox.
The English language is extremely complicated. We have words that can have multiple parts of speech. Natural language processing is difficult because it is hard to tell if a word is a noun when it could be a verb or an adjective, etc. The purpose of this project is to develop an algorithm that, given a sentence string, has a ranking system that detects the part of speech of each word. Why is the useful? Because understanding the correct parts of speech in a sentence is the first step to teaching a robot how to read.
Detect Locale for Rails app using request, session or accept-language headers.
Translation service designed to help you with a variety of language-based features including direct translations, alternate translations, definitions, examples, transliterations, spelling suggestions, language detection, and highly relevant keyword suggestions.
Ruby bindings for Whichlang, a natural language detection for Rust.