Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More
Socket
Sign inDemoInstall
Socket

aspose-pdf

Package Overview
Dependencies
Maintainers
2
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

aspose-pdf

Aspose.PDF for Python via .NET is a PDF Processing library to perform document management can easily be used to generate, modify, convert, render, secure and print documents without using Adobe Acrobat.

  • 24.10.0
  • PyPI
  • Socket score

Maintainers
2

Product Page | Documentation | Demos | Blog | API Reference | Search | Free Support | Temporary License

Try our Free Online Apps demonstrating some of the most popular Aspose.PDF functionality.

Aspose.PDF for Python via .NET is a native library that enables the developers to add PDF processing capabilities to their applications. It can be used to generate or read, convert and manipulate PDF files without the use of Adobe Acrobat. Aspose.PDF for Python via .NET allows to perform a range of document processing tasks such as form processing, get and set metadata information, text and page manipulation, management of annotations, add or remove bookmarks and watermarks, attachments, custom font handling and much more. Check out the Landing Pages of Aspose.PDF for Python via .NET for a more detailed description of the features and possibilities of the library.

General PDF Features

  • Supports most established PDF standards and PDF specifications.
  • Ability to read & export PDFs in multiple image formats including BMP, GIF, JPEG & PNG.
  • Set basic information (e.g. author, creator) of the PDF document.
  • Configure PDF Page properties (e.g. width, height, cropbox, bleedbox etc.).
  • Set page numbering, bookmark level, page sizes etc.
  • Ability to work with text, paragraphs, headings, hyperlinks, graphs, attachments etc.

Supported PDF versions

Aspose.PDF for Python via .NET supports PDF versions 1.2, 1.3, 1.4, 1.5, 1.6, 1.7 and 2.0.

Conversion Features

Aspose.PDF for Python via .NET library allows you to successfully, quickly and easily convert your PDF documents to the most popular formats and vice versa.

  • Convert PDF to Word, Excel, and PowerPoint.
  • Convert PDF to Images formats.
  • Convert PDF file to HTML format and vice versa.
  • Convert PDF to EPUB, Text, XPS, etc.
  • Convert EPUB, Markdown, Text, XPS, PostScript, XML, LaTex to PDF.

Package Features

  • Add, search, extract and replace text in PDF files.
  • Add/delete, extract and replace images.
  • Insert, delete, split PDF pages.
  • Set and get XMP metadata.
  • Validate (PDF/A-1a, PDF/A-1b).
  • Work with bookmarks, annotations, PDF forms, stamps, watermarks and more.

Supported File Formats

The following table indicates the file formats that Aspose.PDF for Python via .NET can load and Save.

FormatDescriptionLoadSaveRemarks
PDFPortable Document FormatYesYes 
CGMComputer Graphics Metafile for 2D vector graphicsYesNo 
EPUBEbook file formatYesYes 
HTMLHTML FormatYesYes 
TeXLaTex typesetting file formatYesYes 
MHTMHTML DocumentYesNo 
PCLPrinter Control Language FilesYesNo 
PSPostscript FilesYesNo 
SVGScalable Vector Graphics (An XML-based vector image format)YesYes 
XMLXML FormatYesYes 
XPSXPS DocumentsYesYes 
XSLFOXSL-FO is part of XSL file which is used for the transformation and formatting of XML dataYesNo 
MDMarkdown FormatYesNo 
XLSSaves the document in the Microsoft Excel SpreadSheetNoYes 
XLSXSaves the document in the Microsoft Excel 2007 formatNoYes 
PPTXSaves the document in the Microsoft PowerPoint Presentations formatNoYes 
DOCSaves the document in the Microsoft Word formatNoYes 
DOCXSaves the document in the Microsoft Word formatNoYes 
MobiXMLSaves the document in eBook MobiXML Standard formatNoYes 
JPEGSaves the document in JPEG FormatYesYes 
EMFEnhanced metafile format (EMF)YesYes 
PNGSaves the document in PNG FormatYesYes 
BMPSaves the document in BMP FormatYesYes 
GIFGraphic Interchange FormatNoYes 
TIFFSaves the document as Single or Multi-Page TIFF ImageYesYes 
TextSave the document int Text FormatYesYes 

Platform Independence

Aspose.PDF for Python via .NET can be used to develop 32-bit and 64-bit Python applications for different operating systems (such as Windows and Linux) where Python 3.7 or later is installed.

Get Started

Run pip install aspose-pdf to fetch the package. If you already have Aspose.PDF for Python via .NET and want to get the latest version, please run pip install --upgrade aspose-pdf.

To learn more about Aspose.PDF for Python via .NET and explore the basic requirements and features of the library, check out the following Aspose.PDF for Python via .NET Documentation pages for other use cases.

Create a PDF file from scratch in Python

In the next code snippet, we are creating a PDF document fron scratch containing the text “Hello World!”. After installing Aspose.PDF for Python via .NET in your environment, you can execute below code sample to see how Aspose.PDF API works.

Below code snippet follows these steps:

  1. Instantiate a Document object.
  2. Add a Page to the document object.
  3. Create a TextFragment object.
  4. Add TextFragment to Paragraph collection of the page.
  5. Save the resultant PDF document.

The following code snippet is a “Hello, World!” program to exhibit working of Aspose.PDF for Python via .NET API:

import aspose.pdf as ap

# Initialize document object
document = ap.Document()
# Add page
page = document.pages.add()
# Initialize textfragment object
text_fragment = ap.text.TextFragment("Hello,world!")
# Add text fragment to new page
page.paragraphs.add(text_fragment)
# Save updated PDF
document.save("output.pdf")

Example of converting HTML to PDF

Aspose.PDF for Python via .NET is a PDF manipulation API that lets you convert any existing HTML documents to PDF format. The process of converting HTML to PDF can be flexibly customized.

Below code snippet follows these steps:

  1. Create an instance of the HtmlLoadOptions object.
  2. Initialize Document object.
  3. Save output PDF document by calling Document.Save() method.
import aspose.pdf as ap
# Instantiate an object of HtmlLoadOptions
options = ap.HtmlLoadOptions()
# Convert HTML to PDF
document = ap.Document("input.html", options)
# Save PDF
document.save("output.pdf")

Example of converting PDF to SVG

Aspose.PDF for Python via .NET supports the feature to convert SVG image to PDF format. To accomplish this requirement, the SvgSaveOptions class has been introduced into the Aspose.PDF namespace. Instantiate an object of SvgSaveOptions and pass it as a second argument to the Document.Save(..) method.

Below code snippet follows these steps:

  1. Create an object of the Document class.
  2. Create SvgSaveOptions object with needed settings.
  3. Call the Document.Save() method and pass it SvgSaveOptions object convert the PDF document to SVG.
import aspose.pdf as ap

# Open PDF document
document = ap.Document("input.pdf")

# Instantiate an object of SvgSaveOptions
saveOptions = ap.SvgSaveOptions()

# Do not compress SVG image to Zip archive
saveOptions.compress_output_to_zip_archive = False
saveOptions.treat_target_file_name_as_directory = True

# Save the output in SVG files
document.save("output.svg", saveOptions)

Merge PDF Files

Merge multiple PDF into single file in Python with Aspose.PDF programmatically. PDF files are merged such that the first one is joined at the end of the other document.

Below code snippet follows these steps:

  1. Open first document.
  2. Open second document.
  3. Add pages of second document to the first.
  4. Save concatenated output file.
import aspose.pdf as ap

# Open first document
document1 = ap.Document("input_1.pdf")
# Open second document
document2 = ap.Document("input_2.pdf")

# Add pages of second document to the first
document1.pages.add(document2.pages)

# Save concatenated output file
document1.save("output.pdf")

Print PDF to XPS printer

You can print a PDF file to an XPS printer, or some other soft printer for that matter, using the PdfViewer class.

Below code snippet follows these steps:

  1. Create an object of the PdfViewer class.
  2. Open the PDF file using the bind_pdf method.
  3. Set different print settings using the PrinterSettings and PageSettings classes.
  4. Set the printer_name property to the XPS or other printer.
  5. Print document using the print_document_with_settings method.
import aspose.pdf as ap
import aspose.pydrawing as drawing

# Create PdfViewer object
viewer = ap.facades.PdfViewer()

# Open input PDF file
viewer.bind_pdf("input.pdf")

# Set attributes for printing
# Print the file with adjusted size
viewer.auto_resize = True
# Print the file with adjusted rotation
viewer.auto_rotate = True
# Do not produce the page number dialog when printing
viewer.print_page_dialog = False

# Create objects for printer and page settings
ps = ap.printing.PrinterSettings()
pgs = ap.printing.PageSettings()

# Set XPS/PDF printer name
ps.printer_name = "Microsoft XPS Document Writer"
# Or set the PDF printer
# ps.printer_name = "Adobe PDF"

# Set PageSize(if required)
pgs.paper_size = ap.printing.PaperSize("A4", 827, 1169)

# Set PageMargins(if required)
pgs.margins = ap.devices.Margins(0, 0, 0, 0)

# Print document using printer and page settings
viewer.print_document_with_settings(pgs, ps)

# Close the PDF file after printing
viewer.close()

Product Page | Documentation | Demos | Blog | API Reference | Search | Free Support | Temporary License

Keywords

FAQs


Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

npm

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc