Research
Security News
Malicious npm Packages Inject SSH Backdoors via Typosquatted Libraries
Socket’s threat research team has detected six malicious npm packages typosquatting popular libraries to insert SSH backdoors.
This package is used to analyse sentiment, and emojis are also included in this sentiment analysis. The package uses a pre-trained model to assign an emotion to the emojis
This package aims to account for emojis in sentiment analysis, making sentiment analysis better when emojis are included in the text. First we take the emojis from the 'https://getemoji.com/' website, then we create a csv with information about each emoji taken, such as the unicode and description, then we calculate the score of the emoji description and assign a sentiment, which can be positive, neutral or negative. After this information we use the EmoRoBERTa model, which uses GoEmotions to recognise emotions, so we then assign an emotion to each of the emojis on the list. Next we place an input containing emojis and then the text is returned with the emoji replaced by the emotion, the sentiment of the text and information about the emoji/s used in the input.
You can install the package using pip:
pip install SocialDictionary
Function: Constructor of the class, initialises the instance. Description: Gets the full path to the CSV file in the 'Dados' folder. Reads the CSV file using pandas and stores it in the DataFrame self.df. Starts the SentimentIntensityAnalyzer sentiment analyser. Initialises the Google Translator.
Function: Replaces emoticons in the text with the corresponding emotions. Description: Creates a dictionary that maps emoticons to their corresponding emotions using the Design and Emotion columns of the DataFrame. Replaces each emoji in the text with its corresponding emotion. Returns the changed text.
Function: Analyses the sentiment of the given text. Description: Uses SentimentIntensityAnalyzer to calculate the sentiment score of the text. Returns the composite sentiment score.
Function: Extracts all the emoticons present in the text. Description: Goes through each character in the text and checks if it is an emoji using emoji.EMOJI_DATA. Returns a list of the emoji found in the text.
Function: Translate text to the target language. Description: Uses Translator to translate the text into the specified language. Returns the translated text.
Function: Translates the fields of a dictionary into the target language. Description: Loops through each key-value in the dictionary fields. Translates the value into the given language. Returns a new dictionary containing the translated values.
Function: Analyses the supplied text, including emoticons and sentiment. Description: Detects the language of the input text. Extracts emoticons from the original text. Replaces emoticons with the corresponding emotion. Performs sentiment analysis on the modified text. Translates the modified text back to the original language. Gathers detailed information about the emoticons used and translates this information into the input language. Returns a dictionary with the modified text, the sentiment of the text and information about the emoji used.
This file was created by collecting the emoji that are available in a list on the website 'https://getemoji.com/', in this file we have 6 columns, the first one is called 'Design' where we have the graphical representation of the emoji, in the second column we have the 'Unicode' through a function, in the third column we have the 'Description' which is discovered through the emoji library, once we have the description of each emoji in the 'Description' column we calculate the score for each one and then assign a sentiment: If it's below 0 to -1 it's negative, if it's 0 it's neutral and if it's above 0 to 1 it's positive, then using the EmoRoBERTa model we assign an emotion to each emoji using its description, creating the 'emotion' column.
The SentimentAnalyzer class allows you to analyse text to replace emojis with corresponding emotions, perform sentiment analysis and translate text. Here is a detailed guide to using each function:
Initialisation (init):
Initialises the instance of the class, loading a CSV file with emojis and emotions, and initialises the sentiment analyser (SentimentIntensityAnalyzer) and the translator (Translator).
Replace Emojis with Emotions ('replace_emoji_with_emotion'):
2.1.Replaces emojis in text with their corresponding emotions based on CSV data. Parameters: text (str) - Text containing emojis. Return: modified_text (str) - Text with emojis replaced by emotions.
Sentiment Analysis ('sentiment_analyze'):
Performs sentiment analysis on the supplied text and returns the composite score. Parameters: text (str) - Text for sentiment analysis. Return: sentiment (float) - Sentiment analysis composite score.
Extract Emojis ('extract_emojis'):
Extracts all the emojis present in the text. Parameters: text (str) - Text containing emojis. Return: emojis (list) - List of emojis found in the text.
Translate Text ('translate_text'):
Translate the text into the target language. Parameters: text (str) - Text to be translated; target_language (str) - Target language code (e.g. ‘en’ for English). Return: translated (str) - Translated text.
Translate fields ('translate_info'):
Translates the fields of a dictionary into the target language. Parameters: fields (dict) - Dictionary with fields to translate; target_language (str) - Target language code. Return: translated_fields (dict) - Dictionary with the translated fields.
Parse text ('text_analyze'):
Parses the given text, replaces emoticons with their corresponding emotions, performs sentiment analysis and translates the modified text. Parameters: text (str) - text to analyse. Return: translated_results (dict) - Dictionary containing the modified text, the sentiment of the text and information about the emoticons used.
FAQs
This package is used to analyse sentiment, and emojis are also included in this sentiment analysis. The package uses a pre-trained model to assign an emotion to the emojis
We found that SocialDictionary demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Research
Security News
Socket’s threat research team has detected six malicious npm packages typosquatting popular libraries to insert SSH backdoors.
Security News
MITRE's 2024 CWE Top 25 highlights critical software vulnerabilities like XSS, SQL Injection, and CSRF, reflecting shifts due to a refined ranking methodology.
Security News
In this segment of the Risky Business podcast, Feross Aboukhadijeh and Patrick Gray discuss the challenges of tracking malware discovered in open source softare.