![Oracle Drags Its Feet in the JavaScript Trademark Dispute](https://cdn.sanity.io/images/cgdhsj6q/production/919c3b22c24f93884c548d60cbb338e819ff2435-1024x1024.webp?w=400&fit=max&auto=format)
Security News
Oracle Drags Its Feet in the JavaScript Trademark Dispute
Oracle seeks to dismiss fraud claims in the JavaScript trademark dispute, delaying the case and avoiding questions about its right to the name.
edu.usc.ir:age-predictor
Advanced tools
Ensemble Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum using Apache OpenNLP, and Apache Spark.
This is a author age categorizer that leverages the Apache OpenNLP Maximum Entropy Classifier. It takes a text sample and classifies it into the following age categories: xx-18|18-24|25-34|35-49|50-64|65-xx.
Note: The training data should be a line-by-line, with each line starting with the age, or age category, followed by a tab and the text associated with the age.
Usage: bin/authorage AgeClassifyTrainer [-factory factoryName] [-featureGenerators featuregens] [-tokenizer tokenizer] -model modelFile [-params paramsFile] -lang language -data sampleData [-encoding charsetName]
Arguments description:
-factory factoryName
a sub-class of DoccatFactory where to get implementation and resources.
-featureGenerators featuregens
comma separated feature generator classes. Bag of words default.
-tokenizer tokenizer
tokenizer implementation. WhitespaceTokenizer is used if not specified.
-model modelFile
output model file.
-params paramsFile
training parameters file.
-lang language
language which is being processed.
-data sampleData
data to be used, usually a file name.
-encoding charsetName
encoding for reading and writing text, if absent the system default is used.
Example Usage:
bin/authorage AgeClassifyTrainer -model model/en-ageClassify.bin -lang en -data data/train.txt -encoding UTF-8
Training data format - Age and text seperated by tab in each line like <AGE><Tab><TEXT>
Sample training data-
12 I am just 12 year old
25 I am little bigger
35 I am mature
45 I am getting old
60 I am old like wine
Usage: bin/authorage AgeClassifyEvaluator -model model [-misclassified true|false] -data sampleData [-encoding charsetName]
Arguments description:
-model model
the model file to be evaluated.
-misclassified true|false
if true will print false negatives and false positives.
-data sampleData
data to be used, usually a file name.
-encoding charsetName
encoding for reading and writing text, if absent the system default is used.
Example Usage:
bin/authorage AgeClassifyEvaluator -model model/en-ageClassify.bin -data data/test.txt -encoding UTF-8
Note: Each document must be followed by an empty line to be detected as a separate case from the others.
Usage: bin/authorage AgeClassify model < documents
Usage: bin/authorage AgePredict ./model/classify-unigram.bin ./model/regression-global.bin data/sample_test.txt
For AgePredict to work you need to download en-pos-maxent.bin
, en-sent.bin
and en-token.bin
from http://opennlp.sourceforge.net/models-1.5/ to model/opennlp/
FAQs
Ensemble Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum using Apache OpenNLP, and Apache Spark.
We found that edu.usc.ir:age-predictor demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 0 open source maintainers collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Oracle seeks to dismiss fraud claims in the JavaScript trademark dispute, delaying the case and avoiding questions about its right to the name.
Security News
The Linux Foundation is warning open source developers that compliance with global sanctions is mandatory, highlighting legal risks and restrictions on contributions.
Security News
Maven Central now validates Sigstore signatures, making it easier for developers to verify the provenance of Java packages.