LanguageTool is an Open Source proofreading software for English, French, German, Polish, Romanian, and more than 20 other languages. It finds many errors that a simple spell checker cannot detect like mixing up there/their and it detects some grammar problems.
Language Detection Library for Java.
JPlag is a system that finds similarities among multiple sets of source code files. This way it can detect software plagiarism and collusion in software development.
A natural language detection library for Java and the JVM, suitable for long and short text alike
ASV Toolbox is a modular collection of tools for the exploration of written language data. They work either on word lists or text and solve several linguistic classification and clustering tasks. The topics covered contain language detection, POS-tagging, base form reduction, named entity recognition, and terminology extraction. On a more abstract level, the algorithms deal with various kinds of word similarity, using pattern based and statistical approaches. The collection can be used to work on large real world data sets as well as for studying the underlying algorithms. The ASV Toolbox can work on plain text files and connect to a MySQL database. While it is especially designed to work with corpora of the Leipzig Corpora Collection, it can easily be adapted to other sources.
Translate text from/to different languages, or detect text language
LanguageTool is an Open Source proofreading software for English, French, German, Polish, Romanian, and more than 20 other languages. It finds many errors that a simple spell checker cannot detect like mixing up there/their and it detects some grammar problems.
This is the language detection Apache Tika™ toolkit.
Fork of the com.cybozu.labs.langdetect language-detection library
A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets. The jMatchParser-juniversalchardet module contains a charset provider for a character set that uses juniversalchardet for automatically detecting the charset.
JPlag is a system that finds similarities among multiple sets of source code files. This way it can detect software plagiarism and collusion in software development.
Language Detection API Java client.
A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets. The jMatchParser-jchardet module contains a charset provider for a character set that uses jchardet for automatically detecting the charset.
Syntax highlighting and source code detection for various languages.
This project is a fork of the "language-detection" project hosted on Google Code by Nakatani Shuyo at Cybozu Labs, Inc. Original project web site: https://code.google.com/p/language-detection/ This port cleans it up a bit and modify it to mavenize it.
Language detection for Elasticsearch
Provides modules that allow basic language support for Chinese using the Solr/Lucene smartcn analyzer. This includes a (1) Bundle providing the Solr Analyzer; (2) an NLP processing Engine that detects Sentences and Tokenizes Chinese Text and (3) an LabelTokenizer needed to match tokens of the analyzed text with the labels of Entities in the matched vocabularies.
Provides modules that bring language support for Japanese using the Solr/Lucene kuromoji analyzer. This includes a (1) Bundle providing the Solr Analyzer; (2) an NLP processing Engine that Tokenizes, detects sentences, POS taggs, extracts Named Entities and Lemmatizes Japanese text (3) an LabelTokenizer needed to match tokens of the analyzed text with the labels of Entities in the matched vocabularies.
Ingest processor doing language detection for fields
A worker for interpreting the languages used in a given input
A worker for interpreting the languages used in a given input
language detection for 53 languages based on http://code.google.com/p/language-detection
NLP library for Java.
cybozu-language-detection
cybozu-language-detection
This is a language detection library implemented in plain Java.
Language Detection Library for Java.
Language detection library implemented in pure Java
URL Language Detection Library for Java.
Provides the support for <a href="https://tagmycode.com">TagMyCode</a>. This plugin allows you to manage your own snippets.<br/> <br/> Features:<br/> * Add snippets: you can save your code snippets including description, language, and tags<br/> * List snippets (CRUD): snippets are stored locally and you can filter, sort, create, modify, edit or delete them directly from the IDE<br/> * Quick search: you can search your snippets and insert them directly into the document<br/> <br/> CHANGELOG:<br/> <br/> 2.3.0 (released 2020-07-26)<br/> * published plugin into Apache NetBeans Plugin Portal<br/> * filter snippets by languages<br/> <br/> 2.2.1 (released 2018-01-10)<br/> * Quick Search dialog is now resizable</br> * fixed syntax highlight for PHP and HTML</br> * if refresh token is not valid user will be automatically logged out</br> </br> 2.2.0 (released 2017-11-06)<br/> * snippets management works in offline mode<br/> * autodetect language on new snippet<br/> * added settings dialog with editor theme and font size option<br/> * added title and description to snippet view<br/> * changed open browser class<br/> * text can be dragged into table to create a new snippet<br/> * snippets can be dragged directly into editor and the code are copied<br/> * added "save as file" feature<br/> * added "clone snippet" feature<br/> * added "snippet properties" dialog<br/> * detect binary file<br/> <br/> 2.1.0 (released 2017-04-24)<br/> * moved error messages from dialog to Netbeans Notification Log<br/> * added welcome panel<br/> * about dialog shows plugin version and framework version<br/> * moved storage from JSON to SQL<br/> <br/> 2.0 (released 2016-07-11)<br/> * new user interface<br/> * list of snippets stored locally<br/> * syntax highlight powered by <a href="http://bobbylight.github.io/RSyntaxTextArea/">RSyntaxTextArea</a><br/> * snippets are synchronized with server<br/> * filter snippets<br/> * quick search feature<br/> * insert selected snippet at cursor in document<br/> <br/> 1.1.3 (released 2015-12-18)<br/> * Fix for NetBeans 8.1<br/> <br/> 1.1.2 (released 2014-10-03)<br/> * Switched authentication from OAuth 1.0a to OAuth 2<br/> * Console write also snippet title when new snippet is created (thanks to bejoy)<br/> <br/> 1.1 (released 2014-08-19)<br/> * Added "Search snippets" feature<br/> * Fixed some minor bugs<br/> <br/> 1.0 (released 2014-04-14)<br/> * First release with feature "Create snippet"<br/>
NLP library for Java.
NLP library for Java.
Syntax highlighting and source code detection for various languages.
OpenCms-Module 'org.opencms.languagedetection'. This module supports text based language detection. OpenCms is a Content Management System that is based on Open Source Software. Complex Intranet and Internet websites can be quickly and cost-effectively created, maintained and managed.
Syntax highlighting and source code detection for various languages.
PMD rules for detecting the use of racially charged or biased language
Package performing smell detection on the Robot Framework language.
This is a language detection library implemented in plain Java.
This is a language detection library implemented in plain Java.
github-pr-language-detection-lib
This library detects one or more languages out of any text (50+ languages supported).
LaukvikCSV is a powerful API for reading, writing and querying tabular data stored in the CSV format. In contrast to other API it lets you specify data types for each column using meta data. It automatically detects delimiters so you don't have to worry about delimiters being comma, tab, pipe, semicolon etc. Run powerful queries to filter your data easily with a fluid query language thats type safe. Export your tabular data to CSV, JSON, XML and HTML.
language detect, it can detect differ language by a string , it use shuyo's language-detect and google translate.
Java library for language detection of a text snippet
Detect the language of a given text. This library is able to distinguish 68 languages.
Package performing smell detection on the Robot Framework language.
Negation Detection based on POS tags as well as optional language specific negation rules (e.g. so that "Unterstützung" is treated as negated in a phrase like "keine Unterstützung" even so that it is gramatically not an negation)