Security News
Maven Central Adds Sigstore Signature Validation
Maven Central now validates Sigstore signatures, making it easier for developers to verify the provenance of Java packages.
= scylla
Scylla is a language categorizing gem that allows you to guess the language of a given text. Scylla is a Ruby port of TextCat (http://www.let.rug.nl/~vannoord/TextCat) and is based on the text categorization algorithm presented in Cavnar, W. B. and J. M. Trenkle, ``N-Gram-Based Text Categorization'' In Proceedings of Third Annual Symposium on Document Analysis and Information Retrieval, Las Vegas, NV, UNLV Publications/Reprographics, pp. 161-175, 11-13 April 1994.
== Installation
gem install scylla
== Usage
require 'scylla'
"this is english text".language => "english"
"Este es un texto español".language => "spanish"
Multiple results for other possible languages:
"isso poderia ser confundido com espanhol, bem".language => "portuguese"
"isso poderia ser confundido com espanhol, bem".guess_language => ["portuguese", "spanish"]
== Training
Training is done by fetching data from wikipedia. To fetch latest articles (The country name in the language in question, eg. "England" for English or "日本" for Japanese), run
rake scylla:train
== Contributing to scylla
== Copyright
Copyright (c) 2011 Ashwin Hegde. See LICENSE.txt for further details.
FAQs
Unknown package
We found that scylla demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Maven Central now validates Sigstore signatures, making it easier for developers to verify the provenance of Java packages.
Security News
CISOs are racing to adopt AI for cybersecurity, but hurdles in budgets and governance may leave some falling behind in the fight against cyber threats.
Research
Security News
Socket researchers uncovered a backdoored typosquat of BoltDB in the Go ecosystem, exploiting Go Module Proxy caching to persist undetected for years.