Socket
Socket
Sign inDemoInstall

docx-extractor

Package Overview
Dependencies
42
Maintainers
1
Versions
90
Alerts
File Explorer

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

docx-extractor

Extracts comments and other data from docx files


Version published
Maintainers
1
Weekly downloads
12
increased by33.33%

Weekly downloads

Readme

Source

Node.js: docx-extractor

This module allows you to extract comments in 'docx' files and other data stored such as author name, hyperlinks, modification times, etc.

Why?

It was a requirement of a job I was working on and I could not find any modules to do this. It was a pain to have to try and create it at the time of the job so I figured I would just make a module incase anyone else ever needed it.

Please feel free to make any comments or suggestions. It can always be improved so I accept any and all feedback.

Installation

npm install --save docx-extractor

Usage

docx-extractor is a module that will let you extract the comments embedded into the xml of 'docx' files as well as other data.

Every method uses the same format as below. Just replace "someMethod" below with the name of the method you want to use.

Example:

var dxe = require('docx-extractor');

dxe.someMethod('myfile.docx', function(data){
    console.log(data)
});

Methods

  • extractComments(file, callback)

    -> Extracts the comments embedded into the xml of 'docx' files. It returns all of the comments as an array.

  • getHyperlinks(file, callback)

    -> Returns all hyperlinks in the docx file as an array

  • getRevisionNumber(file, callback)

    -> Returns the number of times the document was modified and saved.

  • numberPages(file, callback)

    -> Returns number of pages in the docx file

  • getAuthor(file, callback)

    -> Returns the original author of the document. This will be the name of the user account on the machine it was created from.

  • lastModifiedBy(file, callback)

    -> Returns the last person that modified the document. Again, this returns the name of the computer's user account that was in use at the time.

  • timeCreated(file, callback)

    -> Returns the time the docx file was originally created. Example: 2017-03-08T17:40:00Z

  • lastModified(file, callback)

    -> Returns the time that the document was last modified. Example: 2017-03-09T14:23:00Z

  • templateUsed(file, callback)

    -> Returns the template that was used when creating the document. Example: "Normal.dotm" is the template name of when someone starts a docx file with the blank template.

Keywords

FAQs

Last updated on 01 Apr 2019

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc