You're Invited:Meet the Socket Team at BlackHat and DEF CON in Las Vegas, Aug 4-6.RSVP
Socket
Book a DemoInstallSign in
Socket

revised-stopwords

Package Overview
Dependencies
Maintainers
1
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

revised-stopwords

An optimized stopwords list for sentiment analysis that retains key negations like 'not', 'won't', and 'wouldn't', along with intensity modifiers like 'very' and 'most' to preserve sentiment accuracy.

0.1.4
pipPyPI
Maintainers
1

Revised Stopwords for NLP

Overview

This package provides an optimized stopwords list for sentiment analysis by preserving sentiment-related words that NLTK’s default list would typically remove. By retaining key negations and intensity modifiers, this package ensures that sentiment expressions remain intact, leading to more accurate sentiment classification.

Features

  • Retains crucial negations (not, won't, wouldn't, shouldn't, etc.)
  • Keeps intensity modifiers (very, most, more, etc.) for better sentiment retention
  • Removes only words that do not impact sentiment analysis

Installation

Install via pip

pip install revised-stopwords

Import and Use in Your NLP Pipeline

from revised_stopwords import get_revised_stopwords

# Get the optimized stopwords list
stopwords_list = get_revised_stopwords()

print(stopwords_list)  # Output: A set of refined stopwords

Example: Removing Stopwords from Text

from revised_stopwords import get_revised_stopwords

text = "I don't think this is a very good idea"
tokens = text.lower().split()  # Basic word split

filtered_tokens = [word for word in tokens if word not in get_revised_stopwords()]

print(filtered_tokens)
# Output: ["don't", 'think', 'very', 'good', 'idea']

Troubleshooting

LookupError: Resource stopwords not found?

Please use the NLTK Downloader to obtain the resource. Run this command once to manually download stopwords

import nltk
nltk.download('stopwords')

ModuleNotFoundError: No module named 'nltk'?

This error means that the nltk library is not installed in your environment.
Ensure NLTK is installed:

pip install nltk

Contributing

Want to improve this package? Feel free to fork the repo, submit PRs, or suggest enhancements!

How to Contribute:

  • Fork this repository to your GitHub account.

  • Clone your forked repo:

    git clone https://github.com/priyaa279/revised_stopwords.git
    
  • Create a new branch for your changes:

    git checkout -b feature-branch-name
    
  • Make your changes and commit them:

    git commit -m "Describe your change"
    
  • Push your branch to GitHub:

    git push origin feature-branch-name
    

Submit a Pull Request (PR) for review. For major changes, please open an issue first to discuss what you'd like to modify.

License

This project is open-source and licensed under the MIT License.

FAQs

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts