🚀 Big News: Socket Acquires Coana to Bring Reachability Analysis to Every Appsec Team.Learn more
Socket
DemoInstallSign in
Socket

pdf-info

Package Overview
Dependencies
Maintainers
1
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

pdf-info

This package extracts important information from a pdf document such as heading, paragraphs and important keywords!!!

2.1.0
PyPI
Maintainers
1

Data extractor for PDF documents - pdf-info

A command line tool and Python library to support your analysis of pdf documents.

Extracts important fetures from a document like headers, paragraphs, important keywords and subscripts.

Returns a vector of relevant details!!

Installation

Install pdf-info using pip

pip install pdf-info

Use as Python Library

You can easily add pdf-info to your own Python scripts as library.

from pdf_info import pdf_info_class

ob = pdf_info_class()

result = ob.pdf_info('path/to/my/file.pdf',page_number,tag)

List of tags supported are - "headers", "paragraphs", "keywords", "subscripts".

Maintainers

  • Satyam Prakash

FAQs

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts