What is css-select?
The css-select package is a CSS selector engine that enables querying and manipulating HTML and XML documents using CSS selectors. It can be used to select elements from a DOM tree, similar to how you would select elements in the browser using CSS.
What are css-select's main functionalities?
Selecting elements
This feature allows you to select elements from a DOM tree using CSS selectors. The code sample demonstrates selecting all <p> elements from a given HTML string.
const cssSelect = require('css-select');
const parseHTML = require('htmlparser2').parseDocument;
const dom = parseHTML('<div><p>Hello World</p></div>');
const elems = cssSelect('p', dom);
console.log(elems[0].children[0].data); // 'Hello World'
Matching elements
This feature checks if a given element matches a CSS selector. The code sample demonstrates checking if the first child of the root element has a class 'foo'.
const cssSelect = require('css-select');
const parseHTML = require('htmlparser2').parseDocument;
const dom = parseHTML('<div class='foo'><p>Hello World</p></div>');
const isMatch = cssSelect.is(dom.children[0], '.foo');
console.log(isMatch); // true
Pseudo-selectors
This feature allows the use of pseudo-selectors to select elements. The code sample demonstrates selecting the first child of a list.
const cssSelect = require('css-select');
const parseHTML = require('htmlparser2').parseDocument;
const dom = parseHTML('<ul><li>Item 1</li><li>Item 2</li></ul>');
const firstItem = cssSelect(':first-child', dom);
console.log(firstItem[0].children[0].data); // 'Item 1'
Other packages similar to css-select
cheerio
Cheerio is a fast, flexible, and lean implementation of core jQuery designed specifically for the server. It uses css-select under the hood for its CSS selector engine, providing a familiar jQuery-like API for manipulating the DOM.
jsdom
jsdom is a pure-JavaScript implementation of many web standards, notably the WHATWG DOM and HTML Standards, for use with Node.js. It allows you to create a DOM from an HTML string and then interact with it as if you were in the browser, including using CSS selectors to find elements.
soupselect
Soupselect is a module that ports the functionality of Python's BeautifulSoup library to Node.js. It allows for similar CSS selector-based element selection but is less actively maintained and has fewer features compared to css-select.
css-select

a CSS selector compiler/engine
What?
css-select turns CSS selectors into functions that tests if elements match them. When searching for elements, testing is executed "from the top", similar to how browsers execute CSS selectors.
In its default configuration, css-select queries the DOM structure of the domhandler
module (also known as htmlparser2 DOM).
Features:
- Full implementation of CSS3 selectors
- Partial implementation of jQuery/Sizzle extensions
- Very high test coverage
- Pretty good performance
Why?
The traditional approach of executing CSS selectors, named left-to-right execution, is to execute every component of the selector in order, from left to right (duh). The execution of the selector a b
for example will first query for a
elements, then search these for b
elements. (That's the approach of eg. Sizzle
, nwmatcher
and qwery
.)
While this works, it has some downsides: Children of a
s will be checked multiple times; first, to check if they are also a
s, then, for every superior a
once, if they are b
s. Using Big O notation, that would be O(n^(k+1))
, where k
is the number of descendant selectors (that's the space in the example above).
The far more efficient approach is to first look for b
elements, then check if they have superior a
elements: Using big O notation again, that would be O(n)
. That's called right-to-left execution.
And that's what css-select does – and why it's quite performant.
How does it work?
By building a stack of functions.
Wait, what?
Okay, so let's suppose we want to compile the selector a b
again, for right-to-left execution. We start by parsing the selector, which means we turn the selector into an array of the building-blocks of the selector, so we can distinguish them easily. That's what the css-what
module is for, if you want to have a look.
Anyway, after parsing, we end up with an array like this one:
[
{ type: 'tag', name: 'a' },
{ type: 'descendant' },
{ type: 'tag', name: 'b' }
]
Actually, this array is wrapped in another array, but that's another story (involving commas in selectors).
Now that we know the meaning of every part of the selector, we can compile it. That's where it becomes interesting.
The basic idea is to turn every part of the selector into a function, which takes an element as its only argument. The function checks whether a passed element matches its part of the selector: If it does, the element is passed to the next turned-into-a-function part of the selector, which does the same. If an element is accepted by all parts of the selector, it matches the selector and double rainbow ALL THE WAY.
As said before, we want to do right-to-left execution with all the big O improvements nonsense, so elements are passed from the rightmost part of the selector (b
in our example) to the leftmost (which would be c
of course a
).
//TODO: More in-depth description. Implementation details. Build a spaceship.
API
var CSSselect = require("css-select");
CSSselect(query, elems, options)
Queries elems
, returns an array containing all matches.
query
can be either a CSS selector or a function.
elems
can be either an array of elements, or a single element. If it is an element, its children will be queried.
options
is described below.
Aliases: CSSselect.selectAll(query, elems)
, CSSselect.iterate(query, elems)
.
CSSselect.compile(query)
Compiles the query, returns a function.
CSSselect.is(elem, query, options)
Tests whether or not an element is matched by query
. query
can be either a CSS selector or a function.
CSSselect.selectOne(query, elems, options)
Arguments are the same as for CSSselect(query, elems)
. Only returns the first match, or null
if there was no match.
Options
xmlMode
: When enabled, tag names will be case-sensitive. Default: false
.
strict
: Limits the module to only use CSS3 selectors. Default: false
.
rootFunc
: The last function in the stack, will be called with the last element that's looked at. Should return true
.
Supported selectors
As defined by CSS 4 and / or jQuery.
- Universal (
*
)
- Tag (
<tagname>
)
- Descendant (
)
- Child (
>
)
- Parent (
<
) *
- Sibling (
+
)
- Adjacent (
~
)
- Attribute (
[attr=foo]
), with supported comparisons:
[attr]
(existential)
=
~=
|=
*=
^=
$=
!=
*
- Also,
i
can be added after the comparison to make the comparison case-insensitive (eg. [attr=foo i]
) *
- Pseudos:
:not
:contains
*
:icontains
* (case-insensitive version of :contains
)
:has
*
:root
:empty
:parent
*
:[first|last]-child[-of-type]
:only-of-type
, :only-child
:nth-[last-]child[-of-type]
:link
, :visited
(the latter doesn't match any elements)
:selected
*, :checked
:enabled
, :disabled
:required
, :optional
:header
, :button
, :input
, :text
, :checkbox
, :file
, :password
, :reset
, :radio
etc. *
:matches
*
*: Not part of CSS3
License: BSD-like