🚀 Big News: Socket Acquires Coana to Bring Reachability Analysis to Every Appsec Team.Learn more
Socket
DemoInstallSign in
Socket

html-to-document-core

Package Overview
Dependencies
Maintainers
1
Versions
5
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

html-to-document-core

Core engine that parses HTML into an intermediate DocumentElement tree and exposes a plugin registry so external adapters can convert that tree into DOCX, PDF, XLSX, Markdown and more.

0.2.4
latest
Source
npm
Version published
Weekly downloads
355
Maintainers
1
Weekly downloads
 
Created
Source

html-to-document-core

Core engine for converting HTML to document formats.

This package provides the core parsing and conversion infrastructure. Adapters for specific output formats (e.g., DOCX, PDF) can be plugged in at runtime.

Installation

# Install the core engine
npm install html-to-document-core html-to-document-adapter-docx

# Or install the all-in-one wrapper (includes core + default adapters)
npm install html-to-document

For full documentation and usage examples, visit:
https://www.npmjs.com/package/html-to-document

Usage

import { init, Converter } from 'html-to-document-core';
import { DocxAdapter } from 'html-to-document-adapter-docx'

// Initialize with optional tags, middleware, and adapters
const converter = init({
 adapters: {
    register: [
      { format: 'docx', adapter: DocxAdapter }
    ],
  },
  tags: {
    defaultStyles: [
      { key: 'p', styles: { marginBottom: '1px', marginTop: '1px' } },
    ],
  },
});

// Parse HTML into an intermediate format
const elements = await converter.parse('<p>Hello, world!</p>');

// Convert parsed elements using a registered adapter (e.g., 'docx')
const outputBuffer = await converter.convert(elements, 'docx');

Or with the wrapper package:

import { init, DocxAdapter } from 'html-to-document';
// wrapper automatically includes core + DOCX adapter
const converter = init({
 adapters: {
    register: [
      { format: 'docx', adapter: DocxAdapter }
    ],
  },
  tags: {
    defaultStyles: [
      { key: 'p', styles: { marginBottom: '1px', marginTop: '1px' } },
    ],
  },
});
const buffer = await converter.convert('<p>Example</p>', 'docx');

Adapters

Installing an adapter separately

You can install any adapter without the wrapper. For example, to add the DOCX adapter:

npm install html-to-document-adapter-docx

Registering an adapter

After installing, register it when initializing the core:

import { init } from 'html-to-document-core';
import { DocxAdapter } from 'html-to-document-adapter-docx';

const converter = init({
  adapters: {
    register: [
      { format: 'docx', adapter: DocxAdapter }
    ],
  },
});

// Now you can convert:
const elements = await converter.parse('<p>Hello</p>');
const docxBuffer = await converter.convert(elements, 'docx');

API

init(options?: InitOptions): Converter

  • options: configuration for tags, middleware, adapters, and DOM parser.
  • Returns a Converter instance.

Converter

  • parse(html: string): Promise<DocumentElement[]>
    Parses HTML string into document elements.
  • convert(elements: DocumentElement[] | string, format: string): Promise<Buffer | Blob>
    Converts parsed elements (or HTML string) into the specified format using a registered adapter.
  • useMiddleware(mw: Middleware): void
    Add custom middleware for HTML preprocessing.
  • registerConverter(format: string, adapter: IDocumentConverter): void
    Register a custom adapter.

Development

# At repo root
npm install
npm run build

# To test core only
cd packages/core
npm test

# Lint and format
npm run lint
npm run format

License

ISC

Keywords

html-to-document

FAQs

Package last updated on 20 May 2025

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts