
Security News
NIST Under Federal Audit for NVD Processing Backlog and Delays
As vulnerability data bottlenecks grow, the federal government is formally investigating NIST’s handling of the National Vulnerability Database.
html-text-extractor
Advanced tools
A Node.js library that extracts and structures text from HTML files for full-text search indexing.
An HTML parsing library for Node.js, designed to extract text sections associated with anchor tags and headings from HTML files in a directory and its subdirectories. The extracted text is structured for indexing in a full-text search engine. The library produces an array of sections, each with properties for the URL (based on the file path), the anchor (if present), the title (based on the following heading tag), and the text.
624 byte
nano sized (ESM, gizpped)yarn add html-text-extractor
npm install html-text-extractor
import { extract } from 'html-text-extractor'
const result = await extract('./dist')
const { extract } = require('html-text-extractor')
// same API like ESM variant
FAQs
A Node.js library that extracts and structures text from HTML files for full-text search indexing.
The npm package html-text-extractor receives a total of 1 weekly downloads. As such, html-text-extractor popularity was classified as not popular.
We found that html-text-extractor demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
As vulnerability data bottlenecks grow, the federal government is formally investigating NIST’s handling of the National Vulnerability Database.
Research
Security News
Socket’s Threat Research Team has uncovered 60 npm packages using post-install scripts to silently exfiltrate hostnames, IP addresses, DNS servers, and user directories to a Discord-controlled endpoint.
Security News
TypeScript Native Previews offers a 10x faster Go-based compiler, now available on npm for public testing with early editor and language support.