What is @npmcli/arborist?
@npmcli/arborist is a library for managing and manipulating the node_modules tree in a Node.js project. It provides a programmatic interface to interact with the dependency tree, allowing you to perform tasks such as installing, updating, and removing packages, as well as analyzing the dependency graph.
What are @npmcli/arborist's main functionalities?
Load the dependency tree
This feature allows you to load the current state of the node_modules tree. The `loadActual` method reads the actual state of the installed packages.
const Arborist = require('@npmcli/arborist');
const arborist = new Arborist();
async function loadTree() {
const tree = await arborist.loadActual();
console.log(tree);
}
loadTree();
Install a package
This feature allows you to install a new package into your project. The `reify` method is used to apply changes to the node_modules tree, and the `add` option specifies the package to install.
const Arborist = require('@npmcli/arborist');
const arborist = new Arborist({ path: '.' });
async function installPackage() {
await arborist.reify({ add: ['lodash'] });
console.log('Package installed');
}
installPackage();
Remove a package
This feature allows you to remove a package from your project. The `reify` method is used to apply changes to the node_modules tree, and the `rm` option specifies the package to remove.
const Arborist = require('@npmcli/arborist');
const arborist = new Arborist({ path: '.' });
async function removePackage() {
await arborist.reify({ rm: ['lodash'] });
console.log('Package removed');
}
removePackage();
Analyze the dependency graph
This feature allows you to analyze the dependency graph of your project. By loading the actual state of the node_modules tree, you can iterate over the dependencies and inspect their details.
const Arborist = require('@npmcli/arborist');
const arborist = new Arborist();
async function analyzeGraph() {
const tree = await arborist.loadActual();
const dependencies = tree.children;
dependencies.forEach(dep => {
console.log(dep.name, dep.version);
});
}
analyzeGraph();
Other packages similar to @npmcli/arborist
npm
The npm package itself provides a command-line interface for managing Node.js packages. It includes functionalities for installing, updating, and removing packages, as well as managing the dependency tree. However, it is more focused on CLI usage rather than providing a programmatic interface like @npmcli/arborist.
yarn
Yarn is an alternative package manager for Node.js that offers similar functionalities to npm, including dependency management and package installation. Yarn also provides a CLI for managing packages but does not offer the same level of programmatic control over the dependency tree as @npmcli/arborist.
pnpm
pnpm is another package manager for Node.js that focuses on performance and disk space efficiency. It provides functionalities for managing dependencies and installing packages. Like npm and yarn, pnpm is primarily a CLI tool and does not offer a programmatic interface for manipulating the dependency tree like @npmcli/arborist.
@npmcli/arborist
Inspect and manage node_modules
trees.
There's more documentation in the notes
folder.
USAGE
const Arborist = require('@npmcli/arborist')
const arb = new Arborist({
path: '/path/to/package/root',
registry: 'https://registry.npmjs.org',
'@foo:registry': 'https://registry.foo.com/',
token: 'deadbeefcafebad',
_authToken: 'deadbeefcafebad',
auth: 'aXNhYWNzOm5vdCBteSByZWFsIHBhc3N3b3Jk',
username: 'isaacs',
password: 'bm90IG15IHJlYWwgcGFzc3dvcmQ=',
'//registry.foo.com:token': 'blahblahblah',
'//basic.auth.only.foo.com:_auth': 'aXNhYWNzOm5vdCBteSByZWFsIHBhc3N3b3Jk',
'//registry.foo.com:always-auth': true,
})
arb.loadActual().then(tree => {
})
arb.loadVirtual().then(tree => {
})
arb.buildIdealTree(options).then(() => {
})
arb.reify({
save: true,
}).then(() => {
})
DATA STRUCTURES
A node_modules
tree is a logical graph of dependencies overlaid on a
physical tree of folders.
A Node
represents a package folder on disk, either at the root of the
package, or within a node_modules
folder. The physical structure of the
folder tree is represented by the node.parent
reference to the containing
folder, and node.children
map of nodes within its node_modules
folder, where the key in the map is the name of the folder in
node_modules
, and the value is the child node.
A node without a parent is a top of tree.
A Link
represents a symbolic link to a package on disk. This can be a
symbolic link to a package folder within the current tree, or elsewhere on
disk. The link.target
is a reference to the actual node. Links differ
from Nodes in that dependencies are resolved from the target location,
rather than from the link location.
An Edge
represents a dependency relationship. Each node has an edgesIn
set, and an edgesOut
map. Each edge has a type
which specifies what
kind of dependency it represents: 'prod'
for regular dependencies,
'peer'
for peerDependencies, 'dev'
for devDependencies, and
'optional'
for optionalDependencies. edge.from
is a reference to the
node that has the dependency, and edge.to
is a reference to the node that
requires the dependency.
As nodes are moved around in the tree, the graph edges are automatically
updated to point at the new module resolution targets. In other words,
edge.from
, edge.name
, and edge.spec
are immutable; edge.to
is
updated automatically when a node's parent changes.
class Node
All arborist trees are Node
objects. A Node
refers
to a package folder, which may have children in node_modules
.
-
node.name
The name of this node's folder in node_modules
.
-
node.parent
Physical parent node in the tree. The package in whose
node_modules
folder this package lives. Null if node is top of tree.
Setting node.parent
will automatically update node.location
and all
graph edges affected by the move.
-
node.meta
A Shrinkwrap
object which looks up resolved
and
integrity
values for all modules in this tree. Only relevant on root
nodes.
-
node.children
Map of packages located in the node's node_modules
folder.
-
node.package
The contents of this node's package.json
file.
-
node.path
File path to this package. If the node is a link, then this
is the path to the link, not to the link target. If the node is not a
link, then this matches node.realpath
.
-
node.realpath
The full real filepath on disk where this node lives.
-
node.location
A slash-normalized relative path from the root node to
this node's path.
-
node.isLink
Whether this represents a symlink. Always false
for Node
objects, always true
for Link objects.
-
node.isRoot
True if this node is a root node. (Ie, if node.root === node
.)
-
node.root
The root node where we are working. If not assigned to some
other value, resolves to the node itself. (Ie, the root node's root
property refers to itself.)
-
node.isTop
True if this node is the top of its tree (ie, has no
parent
, false otherwise).
-
node.top
The top node in this node's tree. This will be equal to
node.root
for simple trees, but link targets will frequently be outside
of (or nested somewhere within) a node_modules
hierarchy, and so will
have a different top
.
-
node.dev
, node.optional
, node.devOptional
, node.peer
, Indicators
as to whether this node is a dev, optional, and/or peer dependency.
These flags are relevant when pruning dependencies out of the tree or
deciding what to reify. See Package Dependency Flags below for
explanations.
-
node.edgesOut
Edges in the dependency graph indicating nodes that this
node depends on, which resolve its dependencies.
-
node.edgesIn
Edges in the dependency graph indicating nodes that depend
on this node.
-
extraneous
True if this package is not required by any other for any
reason. False for top of tree.
-
node.resolve(name)
Identify the node that will be returned when code
in this package runs require(name)
-
node.errors
Array of errors encountered while parsing package.json or
version specifiers.
class Link
Link objects represent a symbolic link within the node_modules
folder.
They have most of the same properties and methods as Node
objects, with a
few differences.
link.target
A Node object representing the package that the link
references. If this is a Node already present within the tree, then it
will be the same object. If it's outside of the tree, then it will be
treated as the top of its own tree.link.isLink
Always true.link.children
This is always an empty map, since links don't have their
own children directly.
class Edge
Edge objects represent a dependency relationship a package node to the
point in the tree where the dependency will be loaded. As nodes are moved
within the tree, Edges automatically update to point to the appropriate
location.
new Edge({ from, type, name, spec })
Creates a new edge with the
specified fields. After instantiation, none of the fields can be
changed directly.edge.from
The node that has the dependency.edge.type
The type of dependency. One of 'prod'
, 'dev'
, 'peer'
,
or 'optional'
.edge.name
The name of the dependency. Ie, the key in the
relevant package.json
dependencies object.edge.spec
The specifier that is required. This can be a version,
range, tag name, git url, or tarball URL. Any specifier allowed by npm
is supported.edge.to
Automatically set to the node in the tree that matches the
name
field.edge.valid
True if edge.to
satisfies the specifier.edge.error
A string indicating the type of error if there is a problem,
or null
if it's valid. Values, in order of precedence:
DETACHED
Indicates that the edge has been detached from its
edge.from
node, typically because a new edge was created when a
dependency specifier was modified.MISSING
Indicates that the dependency is unmet. Note that this is
not set for unmet dependencies of the optional
type.PEER LOCAL
Indicates that a peerDependency
is found in the
node's local node_modules
folder, and the node is not the top of
the tree. This violates the peerDependency
contract, because it
means that the dependency is not a peer.INVALID
Indicates that the dependency does not satisfy edge.spec
.
edge.reload()
Re-resolve to find the appropriate value for edge.to
.
Called automatically from the Node
class when the tree is mutated.
Package Dependency Flags
The dependency type of a node can be determined efficiently by looking at
the dev
, optional
, and devOptional
flags on the node object. These
are updated by arborist when necessary whenever the tree is modified in
such a way that the dependency graph can change, and are relevant when
pruning nodes from the tree.
| extraneous | peer | dev | optional | devOptional | meaning | prune? |
|------------+------+-----+----------+-------------+---------------------+-------------------|
| | | | | | production dep | never |
|------------+------+-----+----------+-------------+---------------------+-------------------|
| X | N/A | N/A | N/A | N/A | nothing depends on | always |
| | | | | | this, it is trash | |
|------------+------+-----+----------+-------------+---------------------+-------------------|
| | | X | | X | devDependency, or | if pruning dev |
| | | | | not in lock | only depended upon | |
| | | | | | by devDependencies | |
|------------+------+-----+----------+-------------+---------------------+-------------------|
| | | | X | X | optionalDependency, | if pruning |
| | | | | not in lock | or only depended on | optional |
| | | | | | by optionalDeps | |
|------------+------+-----+----------+-------------+---------------------+-------------------|
| | | X | X | X | Optional dependency | if pruning EITHER |
| | | | | not in lock | of dep(s) in the | dev OR optional |
| | | | | | dev hierarchy | |
|------------+------+-----+----------+-------------+---------------------+-------------------|
| | | | | X | BOTH a non-optional | if pruning BOTH |
| | | | | in lock | dep within the dev | dev AND optional |
| | | | | | hierarchy, AND a | |
| | | | | | dep within the | |
| | | | | | optional hierarchy | |
|------------+------+-----+----------+-------------+---------------------+-------------------|
| | X | | | | peer dependency, or | if pruning peers |
| | | | | | only depended on by | |
| | | | | | peer dependencies | |
|------------+------+-----+----------+-------------+---------------------+-------------------|
| | X | X | | X | peer dependency of | if pruning peer |
| | | | | not in lock | dev node hierarchy | OR dev deps |
|------------+------+-----+----------+-------------+---------------------+-------------------|
| | X | | X | X | peer dependency of | if pruning peer |
| | | | | not in lock | optional nodes, or | OR optional deps |
| | | | | | peerOptional dep | |
|------------+------+-----+----------+-------------+---------------------+-------------------|
| | X | X | X | X | peer optional deps | if pruning peer |
| | | | | not in lock | of the dev dep | OR optional OR |
| | | | | | hierarchy | dev |
|------------+------+-----+----------+-------------+---------------------+-------------------|
| | X | | | X | BOTH a non-optional | if pruning peers |
| | | | | in lock | peer dep within the | OR: |
| | | | | | dev hierarchy, AND | BOTH optional |
| | | | | | a peer optional dep | AND dev deps |
+------------+------+-----+----------+-------------+---------------------+-------------------+
- If none of these flags are set, then the node is required by the
dependency and/or peerDependency hierarchy. It should not be pruned.
- If both
node.dev
and node.optional
are set, then the node is an
optional dependency of one of the packages in the devDependency
hierarchy. It should be pruned if either dev or optional deps are
being removed. - If
node.dev
is set, but node.optional
is not, then the node is
required in the devDependency hierarchy. It should be pruned if dev
dependencies are being removed. - If
node.optional
is set, but node.dev
is not, then the node is
required in the optionalDependency hierarchy. It should be pruned if
optional dependencies are being removed. - If
node.devOptional
is set, then the node is a (non-optional)
dependency within the devDependency hierarchy, and a dependency
within the optionalDependency
hierarchy. It should be pruned if
both dev and optional dependencies are being removed. - If
node.peer
is set, then all the same semantics apply as above, except
that the dep is brought in by a peer dep at some point, rather than a
normal non-peer dependency.
Note: devOptional
is only set in the shrinkwrap/package-lock file if
neither dev
nor optional
are set, as it would be redundant.