Research
Security News
Quasar RAT Disguised as an npm Package for Detecting Vulnerabilities in Ethereum Smart Contracts
Socket researchers uncover a malicious npm package posing as a tool for detecting vulnerabilities in Etherium smart contracts.
hast-util-raw
Advanced tools
The hast-util-raw package is a utility for working with HAST (Hypertext Abstract Syntax Tree) trees. It can parse and transform raw HTML into a HAST tree, allowing for manipulation and analysis of the structure of HTML documents. This package is particularly useful for developers working with virtual DOMs or needing to preprocess or clean HTML content programmatically.
Parsing HTML to HAST
This feature allows for the parsing of HTML strings embedded within HAST trees. The `raw` function takes a HAST tree that may contain raw HTML as part of its nodes and returns a new HAST tree with the raw HTML parsed into HAST nodes. This is useful for integrating unescaped HTML strings into a HAST-based workflow.
const raw = require('hast-util-raw');
const h = require('hastscript');
const tree = h('div', [h('span', 'Hello'), '<strong>world!</strong>']);
const result = raw(tree);
Transforming HAST with embedded raw HTML
This demonstrates how `hast-util-raw` can transform a HAST tree that includes a 'raw' node containing HTML into a fully parsed HAST structure. This is particularly useful for scenarios where raw HTML is mixed with HAST content and a uniform HAST structure is needed for further processing.
const raw = require('hast-util-raw');
const u = require('unist-builder');
const tree = u('root', [u('element', {tagName: 'div'}, [u('text', 'Some text'), u('raw', '<span>More text</span>')])]);
const result = raw(tree);
Similar to hast-util-raw, `rehype-parse` is used for parsing HTML into HAST. However, `rehype-parse` is more focused on being a full HTML parser as part of the rehype ecosystem, offering more comprehensive parsing options and better integration with rehype plugins.
hast utility to parse the tree and semistandard raw
nodes (strings of
HTML) again, keeping positional info okay.
This package is a utility to parse a document again.
It passes each node and embedded raw HTML through an HTML parser
(parse5
), to recreate a tree exactly as how a browser would parse
it, while keeping the original data and positional info intact.
This utility is particularly useful when coming from markdown and wanting to
support HTML embedded inside that markdown (which requires passing
allowDangerousHtml: true
to mdast-util-to-hast
).
Markdown dictates how, say, a list item or emphasis can be parsed.
We can use that to turn the markdown syntax tree into an HTML syntax tree.
But markdown also dictates that things that look like HTML, are passed through
untouched, even when it just looks like XML but doesn’t really make sense, so we
can’t normally use these strings of “HTML” to create an HTML syntax tree.
This utility can.
It can be used to take those strings of HTML and include them into the syntax
tree as actual nodes.
If your final result is HTML and you trust content, then “strings” are fine
(you can pass allowDangerousHtml: true
to hast-util-to-html
, which passes
HTML through untouched).
But there are two main cases where a proper syntax tree is preferred:
The plugin rehype-raw
wraps this utility at a higher-level
(easier) abstraction.
This package is ESM only. In Node.js (version 16+), install with npm:
npm install hast-util-raw
In Deno with esm.sh
:
import {raw} from 'https://esm.sh/hast-util-raw@9'
In browsers with esm.sh
:
<script type="module">
import {raw} from 'https://esm.sh/hast-util-raw@9?bundle'
</script>
import {h} from 'hastscript'
import {raw} from 'hast-util-raw'
const tree = h('div', [h('h1', ['Foo ', h('h2', 'Bar'), ' Baz'])])
const reformatted = raw(tree)
console.log(reformatted)
Yields:
{ type: 'element',
tagName: 'div',
properties: {},
children:
[ { type: 'element',
tagName: 'h1',
properties: {},
children: [Object] },
{ type: 'element',
tagName: 'h2',
properties: {},
children: [Object] },
{ type: 'text', value: ' Baz' } ] }
Options
Configuration.
file?
(VFile | null | undefined
)
— corresponding virtual file representing the input document (optional)
passThrough?
(Array<string> | null | undefined
)
List of custom hast node types to pass through (as in, keep) (optional).
If the passed through nodes have children, those children are expected to be hast again and will be handled.
tagfilter?
(boolean | null | undefined
)
Whether to disallow irregular tags in raw
nodes according to GFM
tagfilter
(default: false
).
This affects the following tags, grouped by their kind:
RAWTEXT
: iframe
, noembed
, noframes
, style
, xmp
RCDATA
: textarea
, title
SCRIPT_DATA
: script
PLAINTEXT
: plaintext
When you know that you do not want authors to write these tags,
you can enable this option to prevent their use from running amok.raw(tree, options)
Pass a hast tree through an HTML parser, which will fix nesting, and turn raw nodes into actual nodes.
tree
(Root | RootContent
)
— original hast tree to transformoptions?
(Options | null | undefined
)
— configuration (optional)Parsed again tree (Root | RootContent
).
This package is fully typed with TypeScript.
It exports the additional type Options
.
The Raw
node type is registered by and exposed from
mdast-util-to-hast
.
Projects maintained by the unified collective are compatible with maintained versions of Node.js.
When we cut a new major release, we drop support for unmaintained versions of
Node.
This means we try to keep the current release line, hast-util-raw@^9
,
compatible with Node.js 16.
Use of hast-util-raw
can open you up to a cross-site scripting (XSS)
attack as raw
nodes are unsafe.
The following example shows how a raw node is used to inject a script that runs
when loaded in a browser.
raw(u('root', [u('raw', '<script>alert(1)</script>')]))
Yields:
<script>alert(1)</script>
Either do not use this utility in combination with user input, or use
hast-util-santize
.
mdast-util-to-hast
— transform mdast to hastrehype-raw
— rehype pluginSee contributing.md
in syntax-tree/.github
for
ways to get started.
See support.md
for ways to get help.
This project has a code of conduct. By interacting with this repository, organization, or community you agree to abide by its terms.
FAQs
hast utility to reparse a tree
The npm package hast-util-raw receives a total of 2,055,770 weekly downloads. As such, hast-util-raw popularity was classified as popular.
We found that hast-util-raw demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 2 open source maintainers collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Research
Security News
Socket researchers uncover a malicious npm package posing as a tool for detecting vulnerabilities in Etherium smart contracts.
Security News
Research
A supply chain attack on Rspack's npm packages injected cryptomining malware, potentially impacting thousands of developers.
Research
Security News
Socket researchers discovered a malware campaign on npm delivering the Skuld infostealer via typosquatted packages, exposing sensitive data.