Socket
Socket
Sign inDemoInstall

resume-parser-extended

Package Overview
Dependencies
174
Maintainers
1
Versions
8
Alerts
File Explorer

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

resume-parser-extended

A NodeJS library to parse pdf, txt, doc and docx files to JSON and CSV


Version published
Maintainers
1
Weekly downloads
9
increased by50%

Weekly downloads

Readme

Source

Resume Parser

A Simple NodeJs library to parse Resume files in PDF, DOC, DOCX and TXT format to JSON. If the CVs / Resumes contain any social media profile links then the solution should also parse the public social profile web-pages and organize the data in JSON format (e.g. Linkedin public profile, Github, etc.)

Installation

npm install resume-parser --save

Usage

const ResumeParser = require('simple-resume-parser');

// From file
const resume = new ResumeParser("./files/resume.doc");


// From URL
const resume = new ResumeParser("https://writing.colostate.edu/guides/documents/resume/functionalSample.pdf");

//Convert to JSON Object
  resume.parseToJSON()
  .then(data => {
    console.log('Yay! ', data);
  })
  .catch(error => {
    console.error(error);
  });

//Save to JSON File
resume.parseToFile('converted') //output subdirectory
  .then(file => {
    console.log('Yay! ', file);
  })
  .catch(error => {
    console.error(error);
  });

At this moment application will work fine, but! By default it supports only .TXT and .HTML text formats. For better performance you should install at least support of .PDF (and .DOC). Here is instructions, how to do it from textract README file:

  • PDF extraction requires poppler-utils be installed, link
  • DOC extraction requires catdoc be installed, link, unless on OSX in which case textutil (installed by default) is used.
  • DOCX extraction requires antiword be available (e.g. sudo apt-get install -y antiword for Ubuntu)

Extending

All 'action' are by building src/dictionary.js file. For now it has only basics rules, but it's very flexible (although a bit complicated) and extensible. Just put your rule according to existing and following main principles and enjoy!

Contributions

Keywords

FAQs

Last updated on 16 Jan 2023

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc