Security News
The Unpaid Backbone of Open Source: Solo Maintainers Face Increasing Security Demands
Solo open source maintainers face burnout and security challenges, with 60% unpaid and 60% considering quitting.
de.unistuttgart.ims:de.unistuttgart.ims.drama
Advanced tools
Several component for processing dramatic texts (theatre plays) with Apache UIMA.
This repository contains a number of UIMA components to process dramatic texts, as well as an executable pipeline. We follow general design ideas implemented in DKPro Core. The full pipeline reads in files in several TEI/XML dialects (see below), and applies the most important NLP tools on them, while keeping the structural annotation of the plays intact (and, if necessary, processing different text layers separately).
git clone https://github.com/quadrama/DramaNLP.git
cd DramaNLP
git checkout develop/1.0
mvn compile install
This produces a lot of output, but at the end, you should see something like BUILD SUCCESS
cd de.unistuttgart.ims.drama.main
and run mvn package
. This creates a file called drama.Main.jar
in the directory target/assembly/
. This file contains the code and all its dependencies.As an example, we'll work on the data from the GerDraCor collection (which is based on TextGrid). Download the files from GitHub and store the XML files in a directory. We will call the directory $TEIDIR
in the following examples. The directory $OUTDIR
is used to store the output of the pipeline. You'll need the file drama.Main.jar
.
Enter the following command in the command line interface:
java -cp target/assembly/drama.Main.jar de.unistuttgart.ims.drama.main.TEI2XMI --input $TEIDIR --output $OUTDIR/xmi --csvOutput $OUTDIR/csv --conllOutput $OUTDIR/conll --skipSpeakerIdentifier --corpus GERDRACOR --collectionId "gdc" --doCleanup
After running, the directory $OUTDIR
contains three sub directories, xmi
, csv
and conll
, which are different file formats for the plays.
This package supports the following drama corpora
FAQs
Several component for processing dramatic texts (theatre plays) with Apache UIMA.
We found that de.unistuttgart.ims:de.unistuttgart.ims.drama demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 0 open source maintainers collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Solo open source maintainers face burnout and security challenges, with 60% unpaid and 60% considering quitting.
Security News
License exceptions modify the terms of open source licenses, impacting how software can be used, modified, and distributed. Developers should be aware of the legal implications of these exceptions.
Security News
A developer is accusing Tencent of violating the GPL by modifying a Python utility and changing its license to BSD, highlighting the importance of copyleft compliance.