Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More →

awessome

Package Overview

Dependencies

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

awessome

0.0.14
PyPI

Maintainers: 1

AWESSOME

A Word Embedding Sentiment Scorer Of Many Emotions (AWESSOME) is a framework with the purpose of predicting the sentiment intensity of sentences.

AWESSOME relies on sentiment seed-words and word embedding, where the similarity between the vector representation of two sentences is considered as a reflection of their sentiment similarity.

AWESSOME capitalizes on pre-existing lexicons (VADER , LabMT), but custom lexicons can also be used, and created using AWESSOME.

AWESSOME also draws upon the recent advances in language model by using the Transformers from HuggingFace, to create word embeddings using BERT, RoBERTa, etc.

AWESSOME is scalable, and does not require any training data, while providing more fine grained (and accurate) sentiment intensity scores of words, phrases and text.

Citation Information

If you use the AWESSOME sentiment analysis tools in your research, please cite the following paper. For example:

Htait, A. & Azzopardi, L. (2020). ...... 2020.

Installation

To install AWESSOME:

#. The simplest is to use the command line to do an installation from [PyPI] using pip, e.g., pip install awessome #. If you already have AWESSOME and simply need to upgrade to the latest version, e.g., pip install --upgrade awessome #. You could also clone this [GitHub repository] : https://github.com/cumulative-revelations/awessome #. You could download and unzip the [full master branch zip file] : https://github.com/cumulative-revelations/awessome/archive/master.zip

In addition to the AWESSOME Python module, you will also be downloading two lexicon dictionaries (VADER , LabMT).

Python Demo and Code Examples

The AWESSOME framework can be flexibility adapted to cater for different seed lexicons and different neural word embeddings models in order to produce corpus specific lexicons without the need for extensive supervised learning and retraining.

Through parameters, AWESSOME gives the possibility to:

#. Choose between different available pre-trained language models, such as: BERT (bert-base-nli-mean-tokens) and Distilbert (distilbert-base-nli-stsb-mean-tokens) Note: some pre-trained language models would need GPU. #. Employ different aggregation methods on the similarity scores of the sentence with each term in the seeds lists: Average (avg), Maximum (max) and Sum (sum). #. Select one of two possible similarity measures, provided by scipy: cosine and euclidean. Note: if no similarity measure is provided, cosine is applied as a default measure. #. Select a source of positive and negative seeds lists, where the user can provide a new lexicon file, or used the pre-built lexicons: vader or labmt (built based on VADER and LabMT sentiment lexicons). Note: if no lexicon file is provided, vader is applied as a default seeds lists source. #. Choose the size of seeds lists, created based on the lexicon files. Note: if no size is provided, the value of 500 is used as default seeds lists size. #. In addition, AWESSOME gives the possiblity to apply a "Weighted Similarity" to seeds, by multiplying the similarity score by the sentiment score of the seeds. users have the option to use that feature of note by simply choosing "weighted" as True or False. Note: if the weighted value is not provided, it is considered by default as False.

An example Demo is added under the name of : awessome_demo.py

Change log

0.0.1 (20/10/2020)

- First Release

Keywords

FAQs

What is awessome?

Is awessome well maintained?

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

awessome

AWESSOME

Citation Information

Installation

Python Demo and Code Examples

Change log

0.0.1 (20/10/2020)

Keywords

Related posts

Input Validation Vulnerabilities Dominate MITRE's 2024 CWE Top 25 List

Risky Business Podcast: Why Open Source Software Needs Better Malware Tracking