
Security News
Nx npm Packages Compromised in Supply Chain Attack Weaponizing AI CLI Tools
Malicious Nx npm versions stole secrets and wallet info using AI CLI tools; Socket’s AI scanner detected the supply chain attack and flagged the malware.
Grab the text from common document formats with 1 command. DocRipper is an extremely lightweight Ruby wrapper that can be used to parse text contents from common file formats (currently .doc, .docx and .pdf, .sketch) without the need for a large number of dependencies like an OCR library or OpenOffice/LibreOffice.
For simple parsing, you'll likely see a large performance improvement with DocRipper over solutions that rely on OpenOffice/LibreOffice for .doc/.docx conversion.
Need OCR support or in-image text parsing? Take a look at Docsplit.
.doc
.docx
.pdf
.txt
.sketch
File format | Supported? | Dependencies |
---|---|---|
.doc | x | Antiword |
.docx | x | |
x | Poppler-utils | |
.txt | x | |
.sketch | x | Sqlite3 |
gem install doc_ripper
require 'doc_ripper'
DocRipper::rip('/path/to/file')
DocRipper::rip('/path/to/missing/file')
=> nil
#rip! will raise an exception if rip returns nil or the file type isn't supported
# invalid file type
DocRipper::rip!('/path/to/invalide/file.type')
=> DocRipper::UnsupportedFileType
# missing file
DocRipper::rip!('/path/to/missing/file.doc')
=> DocRipper::FileNotFound
FAQs
Unknown package
We found that doc_ripper demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Malicious Nx npm versions stole secrets and wallet info using AI CLI tools; Socket’s AI scanner detected the supply chain attack and flagged the malware.
Security News
CISA’s 2025 draft SBOM guidance adds new fields like hashes, licenses, and tool metadata to make software inventories more actionable.
Security News
A clarification on our recent research investigating 60 malicious Ruby gems.