Freeling::Analyzer
FreeLing::Analyzer is a Ruby wrapper around analyzer
, a binary tool
included in FreeLing's package that allows the user to process a stream of text
with FreeLing.
This has been tested with version 3.0+ only.
Usage
text = "Mi amigo Juan Mesa se mesa la barba al lado de la mesa."
analyzer = FreeLing::Analyzer.new(text, :language => :es)
analyzer.tokens.first
analyzer.tokens.map { |t| t.lemma }
Features
- Analyzer is lazy, it does not spawn the process until needed and it sends
the input text to
analyzer
on demand. - It just works with the default instalation of FreeLing. Just set the language
to use and you're good to go.
- It supports Freeling's client/server mode. Ex:
FreeLing::Analyzer.new(text, :server_host => 'localhost:50005')
(analyze
must be running in server mode)
Installation
Add this line to your application's Gemfile:
gem 'freeling-analyzer'
And then execute:
$ bundle
Or install it yourself as:
$ gem install freeling-analyzer
FreeLing
FreeLing is an open source suite of
language analyzers written in C++.
The main services offered are:
- Text tokenization
- Sentence splitting
- Morphological analysis
- Suffix treatment
- Retokenization of clitic pronouns
- Flexible multiword recognition
- Contraction splitting
- Probabilistic prediction of unkown word categories
- Named entity detection (NER)
- Recognition of dates, numbers, ratios, currency, and physical magnitudes
(speed, weight, temperature, density, etc.)
- PoS tagging
- Chart-based shallow parsing
- Named entity classification (NEC)
- WordNet based sense annotation and disambiguation
- Rule-based dependency parsing
- Nominal correference resolution.
Currently supported languages are:
- Spanish
- Catalan
- Galician
- Italian
- English
- Welsh
- Portuguese
- Asturian
Contributing
- Fork it
- Create your feature branch (
git checkout -b my-new-feature
) - Commit your changes (
git commit -am 'Added some feature'
) - Push to the branch (
git push origin my-new-feature
) - Create new Pull Request