Stanford CoreNLP provides a set of natural language analysis tools which can take raw English language text input and give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize dates, times, and numeric quantities, mark up the structure of sentences in terms of phrases and word dependencies, and indicate which noun phrases refer to the same entities. It provides the foundational building blocks for higher level text understanding applications.
The boilerpipe library provides algorithms to detect and remove the surplus "clutter" (boilerplate, templates) around the main textual content of a web page. The library already provides specific strategies for common tasks (for example: news article extraction) and may also be easily extended for individual problem settings. Extracting content is very fast (milliseconds), just needs the input document (no global or site-level information required) and is usually quite accurate. Boilerpipe is a Java library written by Christian Kohlschütter. It is released under the Apache License 2.0. The algorithms used by the library are based on (and extending) some concepts of the paper "Boilerplate Detection using Shallow Text Features" by Christian Kohlschütter et al., presented at WSDM 2010 -- The Third ACM International Conference on Web Search and Data Mining New York City, NY USA.
HarfBuzz is a text shaping library. Using the HarfBuzz library allows programs to convert a sequence of Unicode input into properly formatted and positioned glyph output—for any writing system and language.
A text and input validation library in Kotlin for Android.
The WicketStuff Multi-Text input is a Javascript widget which mimics the To: input in Apple's Mail.app, or the >fb:multi-friend-input< input from Facebook. Essentially, the user may type a piece of text into the input then hit the enter key to have the text added to the input in a rounded box. The user may delete the texted entered by either clicking on an x button within the rounded box, or hitting the backspace button to highlight the text, then clicking backspace again to delete it. Two sets of hidden inputs are used to keep track of the text added to the input and the text deleted.
Android library contain custom realisation of EditText component for masking and formatting input text
Remark is a library for taking (X)HTML input and outputting clean Markdown, Markdown Extra, or MultiMarkdown compatible text.
See the child project "multi-text-input" for a more complete description. This is the parent for the actual project and the examples.
A text and input validation library in Kotlin for Android.
A Java library that parses HTML as input and returns clean, easy-to-read text.
Implements a kind of auto-complete text input field
An example GWT Angular module demonstrating the use of text input field.
TextInput components with Vitamin design
WebJar for text-input
An overlay that gets your user's voice permission and input as text
TextInput components with Vitamin design
FormatEditText can be used as a formatted text input box.
Convenience for dealing with Java resources as properties, files, text and input streams.
NextInputs: A Text Validation Library for Java
The WicketStuff Multi-Text input is a Javascript widget which mimics the To: input in Apple's Mail.app, or the >fb:multi-friend-input< input from Facebook. Essentially, the user may type a piece of text into the input then hit the enter key to have the text added to the input in a rounded box. The user may delete the texted entered by either clicking on an x button within the rounded box, or hitting the backspace button to highlight the text, then clicking backspace again to delete it. Two sets of hidden inputs are used to keep track of the text added to the input and the text deleted.
Mootools MeioMask Plugin Integration for Masking Text Input
WebJar for @workday/canvas-kit-css-text-input
Mootools MeioMask Plugin Integration for Masking Text Input
Information extraction is the process of identifying specified classes of entities, relations, and events in natural language text – creating structured data from unstructured input. JET, the Java Extraction Toolkit, developed at New York University over the past fifteen years, provides a rich set of tools for research and education in information extraction from English text. These include standard language processing tools such as a tokenizer, sentence segmenter, part-of-speech tagger, name tagger, regular-expression pattern matcher, and dependency parser. Also provided are relation and event extractors based on the specifications of the U.S. Government's ACE [Automatic Content Extraction] program. The program is provided under an Apache 2.0 license.
This API allows to fill multi-children tree data structure based on a coherent input (text) or from relational database (unary relationship).
An input text field that autocompletes e-mail addresses for Android
parboiled is a mixed Java/Scala library providing for lightweight and easy-to-use, yet powerful and elegant parsing of arbitrary input text based on Parsing expression grammars (PEGs). This bundle wraps two different libraries, parboiled-core and parboiled-java, as they offer classes in the same package path
**Codelet**: Automated insertion of *already unit-tested* example code (its source code, console output, and input text-files) into JavaDoc using inline taglets--Codelet makes it possible to have *always accurate documentation*.
WebJar for @workday/canvas-kit-react-text-input
An input text field that autocompletes e-mail addresses for Android
The JWebSwing implementation for the Text Input Effects collection
The JWebSwing implementation for the Text Input Effects collection
The JWebSwing implementation for the Text Input Effects collection
The JWebSwing implementation for the Text Input Effects collection
The JWebSwing implementation for the Text Input Effects collection
The JWebSwing implementation for the Text Input Effects collection
This component is dedicated to process text resources contained by the Resource in input in order to identify in which language they are written. A dc:language property is added to every Text section having as value name of the ngp file used as for language profile.
This component is dedicated to process text resources contained by the Resource in input in order to identify in which language they are written. A dc:language property is added to every Text section having as value name of the ngp file used as for language profile.
This service is an integration of Apache Tika project. It enables to extract metadata and text content of many kinds of files format. The WebLab document in input is enriched with RDF properties for the metadata and Text unit(s) for the content. The service can be configured through the Spring bean of CXF to handle various kind of features (identifying language or not, provide a normalised XHTML output of the document...).
WebJar for d2l-text-input
Contains a debugging version of the SCIE Webservice, performing only ontology based Named Entity Recognition. Thus this webservice can be used to list the all the ontological named entities found in the input text.
Add a simple text input field to the component configuration
See the child project "multi-text-input" for a more complete description. This is the parent for the actual project and the examples.
The JWebSwing implementation for the Text Input Effects collection
The JWebSwing implementation for the Text Input Effects collection
This is an open source library for parsing text-based inputs. PARSER makes use of context-free grammars to validate and parse sentences. Having specified such a grammar, the library takes care of parsing text-based inputs and returns a tree of nodes that represents the parsed sentence. Furthermore, it provides ways for interpreting and refining the returned tree of nodes. In essence, the library aims to remove the pain of parsing and instead allow focusing on processing and interpreting parsed results.
A Stax Parser Wrapper with original texts from input XML.
A library to validate int and double input for JavaFX and Swing text input
Suji is a converter library from Japanese number notation to numerical value, and from numerical notation to Japanese Kansuji notation. Converter.values(src) to convert from Japanese number notation to numerical value: Japanese number notation can include Kansuji. The String "1つの価格が二兆30万五千十7円になります。" will be converted to two BigDecimal, 1 and 2000000005017. And also, 打率は三割二部五厘です。 will be a 0.325. The return value is a list of Suji Numeral objects. If the input string has no number notation, Suji returns a empty list. The Numeral object has three methods: value(), begin(), and end(): value(): a BigDecimal instance of a numerical value for the number notation. begin(): the begin index (int) of the found number notation at the input string. end(): the end index (int) of the found number notation. Converter.kansujis(src) to convert from numeric notation to Japanese Kansuji notation: The String 20兆30万五千十7円になります。 will be converted to the Kansuji string, 二十兆三十万五千十七. The boolean flag one is interpreted as whether to display the first character 一 or not. The output of Converter.kansujis('1000万', true) will be converted to 一千万, and the output of Converter.kansujis('1000万', false) will be converted to 千万. Note that kansujis does not support numerical notation after the decimal point. If the input string is 32.01, the output will 三十二, not 三十二割一厘. The return value is a list of Kansuji objects. If the input string has no number notation, Suji returns a empty list. The Kansuji object has three methods: value(), begin(), and end(): value(): a String instance of a Kansuji notation. begin(): the begin index (int) of the found number notation at the input string. end(): the end index (int) of the found number notation. Suji is a one-pass parser. That is, Suji parse a source text from the head to the end only once.
Custom semantics created for test artifact