
Research
Security News
The Landscape of Malicious Open Source Packages: 2025 Mid‑Year Threat Report
A look at the top trends in how threat actors are weaponizing open source packages to deliver malware and persist across the software supply chain.
github.com/mdeiml/tree-sitter-markdown
A Markdown parser for tree-sitter.
The parser is designed to read markdown according to the CommonMark Spec, but some extensions to the spec from different sources such as Github flavored markdown are also included. These can be toggled on or off at compile time. For specifics see Extensions
Even though this parser has existed for some while and obvious issues are mostly solved, there are still lots of inaccuracies in the output. These stem from restricting a complex format such as markdown to the quite restricting tree-sitter parsing rules.
As such it is not recommended to use this parser where correctness is important. The main goal for this parser is to provide syntactical information for syntax highlighting in parsers such as neovim and helix.
All contributions are welcome. For details refer to CONTRIBUTING.md.
Extensions can be enabled at compile time through environment variables. Some
of them are on by default, these can be disabled with the environment variable
NO_DEFAULT_EXTENSIONS
.
Name | Environment variable | Specification | Default | Also enables |
---|---|---|---|---|
Github flavored markdown | EXTENSION_GFM | link | ✓ | Task lists, strikethrough, pipe tables |
Task lists | EXTENSION_TASK_LIST | link | ✓ | |
Strikethrough | EXTENSION_STRIKETHROUGH | link | ✓ | |
Pipe tables | EXTENSION_PIPE_TABLE | link | ✓ | |
YAML metadata | EXTENSION_MINUS_METADATA | link | ✓ | |
TOML metadata | EXTENSION_PLUS_METADATA | link | ✓ | |
Tags | EXTENSION_TAGS | link | ||
Wiki Link | EXTENSION_WIKI_LINK | link |
For guides on how to use this parser in a specific editor, refer to that editor's specific documentation, e.g.
To use the two grammars, first parse the document with the block
grammar. Then perform a second parse with the inline grammar using
ts_parser_set_included_ranges
to specify which parts are inline content.
These parts are marked as inline
nodes. Children of those inline nodes should
be excluded from these ranges. For an example implementation see lib.rs
in
the bindings
folder.
Unfortunately using this parser with WASM/web-tree-sitter does not work out of the box at the moment. This is because the parser uses some C functions that are not exported by tree-sitter by default. To fix this you can statically link the parser to tree-sitter. See also https://github.com/tree-sitter/tree-sitter/issues/949, https://github.com/MDeiml/tree-sitter-markdown/issues/126, and https://github.com/MDeiml/tree-sitter-markdown/issues/93
FAQs
Unknown package
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Research
Security News
A look at the top trends in how threat actors are weaponizing open source packages to deliver malware and persist across the software supply chain.
Security News
ESLint now supports HTML linting with 48 new rules, expanding its language plugin system to cover more of the modern web development stack.
Security News
CISA is discontinuing official RSS support for KEV and cybersecurity alerts, shifting updates to email and social media, disrupting automation workflows.