
Security News
CVE Volume Surges Past 48,000 in 2025 as WordPress Plugin Ecosystem Drives Growth
CVE disclosures hit a record 48,185 in 2025, driven largely by vulnerabilities in third-party WordPress plugins.
html-to-document-core
Advanced tools
Core engine that parses HTML into an intermediate DocumentElement tree and exposes a plugin registry so external adapters can convert that tree into DOCX, PDF, XLSX, Markdown and more.
Core engine for converting HTML to document formats.
This package provides the core parsing and conversion infrastructure. Adapters for specific output formats (e.g., DOCX, PDF) can be plugged in at runtime.
# Install the core engine
npm install html-to-document-core html-to-document-adapter-docx
# Or install the all-in-one wrapper (includes core + default adapters)
npm install html-to-document
For full documentation and usage examples, visit:
https://www.npmjs.com/package/html-to-document
import { init, Converter } from 'html-to-document-core';
import { DocxAdapter } from 'html-to-document-adapter-docx';
// Initialize with optional tags, middleware, and adapters
const converter = init({
adapters: {
register: [{ format: 'docx', adapter: DocxAdapter }],
},
tags: {
defaultStyles: [
{ key: 'p', styles: { marginBottom: '1px', marginTop: '1px' } },
],
},
});
// Parse HTML into an intermediate format
const elements = await converter.parse('<p>Hello, world!</p>');
// Convert parsed elements using a registered adapter (e.g., 'docx')
const outputBuffer = await converter.convert(elements, 'docx');
Or with the wrapper package:
import { init, DocxAdapter } from 'html-to-document';
// wrapper automatically includes core + DOCX adapter
const converter = init({
adapters: {
register: [{ format: 'docx', adapter: DocxAdapter }],
},
tags: {
defaultStyles: [
{ key: 'p', styles: { marginBottom: '1px', marginTop: '1px' } },
],
},
});
const buffer = await converter.convert('<p>Example</p>', 'docx');
You can install any adapter without the wrapper. For example, to add the DOCX adapter:
npm install html-to-document-adapter-docx
After installing, register it when initializing the core:
import { init } from 'html-to-document-core';
import { DocxAdapter } from 'html-to-document-adapter-docx';
const converter = init({
adapters: {
register: [{ format: 'docx', adapter: DocxAdapter }],
},
});
// Now you can convert:
const elements = await converter.parse('<p>Hello</p>');
const docxBuffer = await converter.convert(elements, 'docx');
init(options?: InitOptions): Converteroptions: configuration for tags, middleware, adapters, and DOM parser.Converter instance.Converterparse(html: string): Promise<DocumentElement[]>convert(elements: DocumentElement[] | string, format: string): Promise<Buffer | Blob>useMiddleware(mw: Middleware): voidregisterConverter(format: string, adapter: IDocumentConverter): voidserialize(elements: DocumentElement[]): string# At repo root
pnpm install
pnpm run build
# To test core only
cd packages/core
pnpm run test
# Lint and format
pnpm run lint
pnpm run format
ISC
FAQs
Core engine that parses HTML into an intermediate DocumentElement tree and exposes a plugin registry so external adapters can convert that tree into DOCX, PDF, XLSX, Markdown and more.
The npm package html-to-document-core receives a total of 22 weekly downloads. As such, html-to-document-core popularity was classified as not popular.
We found that html-to-document-core demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
CVE disclosures hit a record 48,185 in 2025, driven largely by vulnerabilities in third-party WordPress plugins.

Security News
Socket CEO Feross Aboukhadijeh joins Insecure Agents to discuss CVE remediation and why supply chain attacks require a different security approach.

Security News
Tailwind Labs laid off 75% of its engineering team after revenue dropped 80%, as LLMs redirect traffic away from documentation where developers discover paid products.