
Security News
Another Round of TEA Protocol Spam Floods npm, But It’s Not a Worm
Recent coverage mislabels the latest TEA protocol spam as a worm. Here’s what’s actually happening.
n8n-nodes-n8ntools-document-processor
Advanced tools
N8N Tools - Document Processor: Process and analyze documents with OCR, text extraction, and format conversion
Process and analyze documents with OCR, text extraction, and format conversion capabilities. This N8N community node provides comprehensive document processing through the N8N Tools platform.
Install this node in your N8N instance:
n8n-nodes-n8ntools-document-processornpm install n8n-nodes-n8ntools-document-processor
https://api.n8ntools.io| Operation | Description | Input | Output |
|---|---|---|---|
| Extract Text | Extract text content | PDF, DOCX, DOC, RTF | Plain text |
| Extract Metadata | Get document properties | Any document | JSON metadata |
| Convert Format | Change document format | Various formats | PDF, DOCX, TXT, HTML, MD, RTF |
| Split Pages | Split into individual pages | PDF, DOCX | ZIP with pages |
| Merge Documents | Combine multiple documents | Multiple files | Single document |
| OCR Processing | Extract text from images | PDF, images | Text with OCR |
[File Trigger] → [N8N Tools Document Processor] → [Extract Data] → [Database/Email]
Invoice Text Extraction:
{
"operation": "extractText",
"inputSource": "binaryData",
"binaryPropertyName": "data",
"advancedOptions": {
"extractImages": true,
"extractTables": true,
"preserveFormatting": true
}
}
{
"text": "This is the extracted text content...",
"wordCount": 1250,
"pageCount": 3,
"hasImages": true,
"hasTables": true,
"images": [
{
"page": 1,
"base64": "iVBORw0KGgoAAAANSUhEUgAA...",
"format": "png"
}
],
"tables": [
{
"page": 2,
"rows": 5,
"columns": 3,
"data": [["Header1", "Header2", "Header3"], ...]
}
],
"success": true,
"operation": "extractText",
"creditsUsed": 2,
"originalFilename": "invoice.pdf"
}
Returns the converted document as binary data with metadata:
{
"success": true,
"operation": "convertFormat",
"originalFilename": "document.pdf",
"convertedFilename": "document.docx",
"targetFormat": "docx",
"creditsUsed": 1
}
{
"filename": "report.pdf",
"fileSize": 2048000,
"mimeType": "application/pdf",
"pageCount": 15,
"author": "John Doe",
"title": "Annual Report 2024",
"subject": "Company Performance",
"keywords": ["business", "report", "annual"],
"creationDate": "2024-01-15T10:30:00Z",
"modificationDate": "2024-01-16T14:20:00Z",
"hasPassword": false,
"isEncrypted": false,
"success": true
}
por): Optimized for Brazilian Portugueseeng): US and UK Englishspa): Latin American and Iberian Spanishfra): French language supportdeu): German language supportauto): Automatic language detection{
"operation": "ocrProcessing",
"inputSource": "fileUrl",
"fileUrl": "https://example.com/scanned-invoice.pdf",
"ocrLanguage": "por",
"advancedOptions": {
"extractTables": true,
"preserveFormatting": true
}
}
[Email Trigger] → [Download Attachment] → [Extract Text] → [Parse Data] → [Update CRM]
[File Upload] → [Extract Metadata] → [Classify Type] → [Route to Process]
[File Monitor] → [Document Processor] → [Convert to PDF] → [Archive]
[Document Input] → [Extract Text] → [Find Key Terms] → [Generate Summary]
// Extract specific information from legal documents
{
"operation": "extractText",
"advancedOptions": {
"extractTables": true,
"preserveFormatting": true
}
}
// Then use regex or NLP to find specific clauses
// Convert old DOC files to modern formats
{
"operation": "convertFormat",
"targetFormat": "docx"
}
// OCR processing for form data extraction
{
"operation": "ocrProcessing",
"ocrLanguage": "eng",
"advancedOptions": {
"extractTables": true // For form fields
}
}
Common errors and solutions:
// Password-protected document
{
"error": "Document is password protected",
"success": false,
"suggestion": "Provide password in advancedOptions"
}
// Unsupported format
{
"error": "Unsupported file format: .xyz",
"success": false,
"suggestion": "Check supported input formats"
}
// OCR language not detected
{
"error": "Could not detect document language",
"success": false,
"suggestion": "Specify OCR language manually"
}
{
"advancedOptions": {
"password": "your-document-password"
}
}
[Data] → [Generate PDF] → [Extract Text] → [Validate Content]
[Scrape URLs] → [Download PDFs] → [Process Documents] → [Store Data]
[Email Attachment] → [Process Document] → [Extract Key Info] → [Reply with Summary]
MIT License - see LICENSE file for details.
Part of the N8N Tools ecosystem • Website • All Packages
FAQs
N8N Tools - Document Processor: Process and analyze documents with OCR, text extraction, and format conversion
The npm package n8n-nodes-n8ntools-document-processor receives a total of 47 weekly downloads. As such, n8n-nodes-n8ntools-document-processor popularity was classified as not popular.
We found that n8n-nodes-n8ntools-document-processor demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
Recent coverage mislabels the latest TEA protocol spam as a worm. Here’s what’s actually happening.

Security News
PyPI adds Trusted Publishing support for GitLab Self-Managed as adoption reaches 25% of uploads

Research
/Security News
A malicious Chrome extension posing as an Ethereum wallet steals seed phrases by encoding them into Sui transactions, enabling full wallet takeover.