
Security News
Follow-up and Clarification on Recent Malicious Ruby Gems Campaign
A clarification on our recent research investigating 60 malicious Ruby gems.
Grab the text from common document formats with 1 command. DocRipper is an extremely lightweight Ruby wrapper that can be used to parse text contents from common file formats (currently .doc, .docx and .pdf, .sketch) without the need for a large number of dependencies like an OCR library or OpenOffice/LibreOffice.
For simple parsing, you'll likely see a large performance improvement with DocRipper over solutions that rely on OpenOffice/LibreOffice for .doc/.docx conversion.
Need OCR support or in-image text parsing? Take a look at Docsplit.
.doc
.docx
.pdf
.txt
.sketch
File format | Supported? | Dependencies |
---|---|---|
.doc | x | Antiword |
.docx | x | |
x | Poppler-utils | |
.txt | x | |
.sketch | x | Sqlite3 |
gem install doc_ripper
require 'doc_ripper'
DocRipper::rip('/path/to/file')
DocRipper::rip('/path/to/missing/file')
=> nil
#rip! will raise an exception if rip returns nil or the file type isn't supported
# invalid file type
DocRipper::rip!('/path/to/invalide/file.type')
=> DocRipper::UnsupportedFileType
# missing file
DocRipper::rip!('/path/to/missing/file.doc')
=> DocRipper::FileNotFound
FAQs
Unknown package
We found that doc_ripper demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
A clarification on our recent research investigating 60 malicious Ruby gems.
Security News
ESLint now supports parallel linting with a new --concurrency flag, delivering major speed gains and closing a 10-year-old feature request.
Research
/Security News
A malicious Go module posing as an SSH brute forcer exfiltrates stolen credentials to a Telegram bot controlled by a Russian-speaking threat actor.