cq: Code Query
A tool to extract code snippets using selectors (instead of line numbers)
Try the demo
Supports JavaScript ES5, ES6, JSX, and TypeScript
cq
supports sophisticated, production-ready selectors and is used for all of the Fullstack.io Books
If you're a developer and you're interested in writing a programming book, but you're not sure where to start, then read here
Online Demo
Try the demo
cq Suite
Install
$ npm install --global @fullstackio/cq
Usage
$ cq <query> <file>
# or
$ cat file | cq <query>
Examples
Say we have a file examples/basics.js
with the following code:
{lang=javascript,crop-query=.bye-EOF}
<<
Get the bye()
function:
Query:
$ cq '.bye' examples/basics.js
Result:
{lang=javascript,crop-query=.bye}
<<
Get the calcArea()
function on the Barn
class:
Query:
$ cq '.Barn .calcArea' examples/basics.js
Result:
{lang=javascript,crop-query=.Barn .calcArea}
<<
Get the bye()
function plus the line after:
This example uses an operator context
.
The API is: context(identifier, linesBefore, linesAfter)
Query:
$ cq 'context(.bye,0,1)' examples/basics.js
Result:
{lang=javascript,crop-query=context(.bye,0,1)}
<<
Get the range of constructor
through calcArea
, inclusive, of the Barn
class
$ cq '.Barn .constructor-.calcArea' examples/basics.js
{lang=javascript,crop-query=.Barn .constructor-.calcArea}
<<
json
flag
If you pass --json
you'll get the results in JSON, which can be useful for further processing:
$ cq --json 'context(.bye,0,1)' examples/basics.js
{
"code": "const bye = function() {\n return 'bye';\n}\nbye(); // -> 'bye'",
"start": 598,
"end": 659,
"start_line": 25,
"end_line": 28
}
TypeScript Support
cq
works with TypeScript as well. Say we had the following TypeScript File AuthService.ts
:
{lang=typescript,crop-query=.Injectable-EOF}
<<
Get the AUTH_PROVIDERS
export:
Query:
$ cq '.AUTH_PROVIDERS' examples/AuthService.ts
Result:
{lang=typescript,crop-query=.AUTH_PROVIDERS}
<<
Get the isLoggedIn()
function through AUTH_PROVIDERS
Query:
$ cq '(.AuthService .isLoggedIn)-.AUTH_PROVIDERS' examples/AuthService.ts
Result:
{lang=typescript,crop-query=(.AuthService .isLoggedIn)-.AUTH_PROVIDERS}
<<
Searching for strings
cq
can search for strings as well as identifiers. Say we have the following test:
{lang=javascript,crop-query='chai'-EOF}
<<
We can get the first test:
Query:
$ cq "'My First Test'" examples/mocha.test.js
Result:
{lang=javascript,crop-query='My First Test'}
<<
Or get the it
block in the second test:
Query:
$ cq "'My Second Test' 'basic assert'" examples/mocha.test.js
Result:
{lang=javascript,crop-query='My Second Test' 'basic assert'}
<<
Sometimes we want to pull the comments before a selection. cq
supports this using the comments()
operator:
File comments.js
:
{lang=javascript,crop-query=1-EOF}
<<
Get the bye()
function with comments:
Query:
$ cq 'comments(.bye)' comments.js
Result:
{lang=javascript,crop-query=comments(.bye)}
<<
This file was itself generated using cq
.
See many more examples in the /examples
directory
Features
- Extract chunks of code from text using robust selectors (vs. brittle line numbers)
- Locate ranges of code using identifiers
- Parses ES6 & JSX (with babylon)
- Parses TypeScript
Operators
cq
supports a number of operators that modify the selection:
Motivation
When writing blog posts, tutorials, and books about programming there's a tension between code that gets copied and pasted into the text and runnable code on disk.
If you copy and paste your code into the copy, then you're prone to typos, missing steps. When things change, you have to update all of the copypasta and eyeball it to make sure you didn't miss anything. Mistakes are really easy to make because you can't really test code that's in your manuscript without it's context.
A better solution is to keep your code (or steps of your code) as runnable examples on disk. You can then load the code into your manuscript with some pre-processing.
The problem with the code-on-disk approach is how to designate the ranges of code you wish to import. Line numbers are the most obvious approach, but if you add or remove a line of code, then you have to adjust all line numbers accordingly.
cq
is a tool that lets you specify selectors to extract portions of code. Rather than using brittle line numbers, instead cq
lets you query your code. It uses babylon
to understand the semantics of your code and will extract the appropriate lines.
Query Grammar
.Identifier
Examples:
A dot .
preceding JavaScript identifier characters represents an identifier.
In this code:
const Simple = React.createClass({
render() {
return <div>{this.renderName()}</div>;
}
});
The query .Simple
would find the whole const Simple = ...
variable declaration.
Searches for identifiers traverse the whole tree, relative to the parent, and return the first match. This means that you do not have to start at the root. In this case you could query for .render
and would receive the render()
function. That said, creating more specific queries can help in the case where you want to disambiguate.
[space]
Examples:
.Simple .render
.foo .bar .baz
The space in a query selection expression designates a parent for the next identifier. For instance, the query .Simple .render
will first look for the identifier Simple
and then find the render
function that is a child of Simple
.
The space indicates to search for the next identifier anywhere within the parent. That is, it does not require that the child identifier be a direct child the parent.
In this way the space is analogous to the space in a CSS selector. E.g. search for any child that matches.
cq
does not yet support the >
notation (which would require the identifier to be a direct child), but we may in the future.
You can write child selection in parenthesis ()
if there is ambiguity. E.g.: (.foo .bar)
.
Range
Examples:
.constructor-.calcArea
.Barn .constructor-.calcArea
1-(.AuthService .login)
.foo-EOF
Given:
class Barn {
constructor(height, width) {
this.height = height;
this.width = width;
}
calcArea() {
return this.height * this.width;
}
}
A pair of selections (e.g. identifiers) joined by a dash -
form a range. A range will emit the code from the beginning of the match of the first identifier to the end of the match of the last.
You can use a parent identifier to limit the scope of the search of the range as in the query: .Barn .constructor-.calcArea
If you'd like to specify a line number, you can use a number (instead of an identifier) in a range. For example the query: 30-35
will give lines 30 through 35, inclusive.
If you want to specify a child selector at the end of a range, use parenthesis as in this query: 1-(.AuthService .login)
. The previous query will return the lines from line 1 to the end of the login()
function on AuthService
.
You can use the special line number EOF
to select until the end-of-file.
'String'
Examples:
'My Test'
'My Test' 'my should'
2-'My Test'
You can use a single-quoted string as a selection and cq
will search for that string. When a string is found, cq
will emit the statement / block associated with that string.
For instance, given:
describe("My First Test", () => {
it("basic assert", () => {
assert.equal(1, 1);
});
});
You could search for the strings 'My First Test'
or 'basic assert'
and receive the appropriate selection.
Operators
Examples:
context(.bye,1,1)
upto(.bye)
comments(.bye)
Given:
const bye = function() {
return "bye";
};
bye();
Operators allow you to change the result of the inner selection.
context()
context(selection, numLinesBeforeStart, numLinesAfterEnd)
The context()
operation takes line numbers before and after the selection. For example, context(.foo, 2, 2)
will give two lines before and two lines after the .foo
node.
Keep in mind that the selection
denotes a node which can span multiple lines. With that in mind, positive numbers "expand" the selection and negative numbers "contract". That is, if numLinesBeforeStart
is negative, then it can be interpreted as moving the start forward (increasing line numbers). Similarly, if numLinesAfterEnd
is negative, the end is moved backwards (decreasing line numbers, towards the top of the document).
context()
modifies the range that would be returned from selection
. If you'd like to specify a specific number of lines range relative to a selection
, then see the window()
operator.
window()
window(selection, startNumLinesAfter, endNumLinesAfter, reverse=false)
window()
returns a specific number of lines relative to selection
. For example, window(.foo, 0, 4)
would give 5 lines, the foo
identifier and the four lines following.
It differs from context()
in that both arguments to window()
are relative to the start of the selection
.
window()
is useful for extracting a specific range of lines near a particular selection
. The selection
is considered to start at index 0
, which means negative numbers denote the lines before the start of the selection.
If reverse
is true, start the window at the end of the selection.
firstLineOf()
Sugar - same as window(selection, 0, 0)
lastLineOf()
Sugar - same as window(selection, 0, 0, true)
upto()
The upto()
operation will return the code up-to, but not including, the selection. A convenient (but potentially confusing) default is that the upto()
operation trims whitespace. This is normally what you want, but you have to be careful when using upto()
and context()
together (because upto()
may trim lines).
choose()
choose(selection, matchIdx)
It's possible for a selection
to match more than one node. While you can often disambiguate with child selections, the choose()
operator lets you specify a particular match by index.
matchIdx
starts at 0
. Without the choose
operator, the default behavior of any selection
is: choose(selection, 0)
. Say you had two instances of the identifier .foo
then you could grab the second by using choose(.foo, 1)
.
choose
can be a bit brittle in that it specifies a specific matchIdx
. A potentially better choice is the after()
operator which finds the first selection
that occurs after a companion selector.
after()
after(selection, afterSelection)
after
finds the first selection
that occurs after afterSelection
.
The comments()
operation will return the selection plus the leading comments before the selection.
decorators()
The decorators()
operation will return the selection plus the decorators.
Say we have the following code:
import { Component } from "@angular/core";
@Component({
selector: "home",
template: `
<h1>Welcome!</h1>
`
})
export class HomeComponent {}
When we grab the selection .HomeComponent
we'll get just the class
$ cq '.HomeComponent' examples/HomeComponent.ts
export class HomeComponent {
}
We use decorators()
to get the whole thing:
$ cq 'decorators(.HomeComponent)' examples/HomeComponent.ts
@Component({
selector: 'home',
template: `<h1>Welcome!</h1>`
})
export class HomeComponent {
}
One thing to keep in mind is that decorations are actually considered children of the node they are attached to. The @Component
decoration is also an identifier. This means we get the @Component
decoration by itself like this:
$ cq '.HomeComponent .Component' examples/HomeComponent.ts
@Component({
selector: 'home',
template: `<h1>Welcome!</h1>`
})
Other Features
Multiple Queries with Gap Filling
You can have multiple queries and any if they are not contiguous they can be filled with a gap filler:
$ cq '(firstLineOf(.AuthService),.logout,.isLoggedIn,lastLineOf(.AuthService))' examples/AuthService.ts
{lang=typescript,crop-query=(firstLineOf(.AuthService),.logout,.isLoggedIn,lastLineOf(.AuthService))}
<<[](examples/AuthService.ts)
This gap filler can be customized with the --gapFiller
option on the commandline.
CLI Usage
To pre-process your markdown on the CLI use the cqmd utility.
Library Usage
var cq = require("@fullstackio/cq").default;
var results = cq(codeString, query);
console.log(results.code);
Future
- Add queries for header information such as comments,
import
s, and require
s - Add the ability to extract several sections in a single query
- Create a remark plugin to pull code into Markdown using queries
- Support extracting lines of HTML (using regular CSS selectors)
Limitations
- It's possible to specify invalid queries and the error messages are not helpful
- Only one selector is possible per query
- Some sections of code are not directly selectable (because the query language is not yet expressive enough)
- You can only select whole lines (e.g. comments on the same line after an expression are captured) - this is by design, but it should be configurable
Query API Stability
The query API may change (see Future). Any breaking API changes (query or otherwise) will result in a major version bump.
Contributing
Please feel free to submit pull requests!
Authors
Originally written by Nate Murray.
Related
cqmd
- CLI tool to pre-process markdown with cq
. (Used to generate the current README)remark-cq
- a remark plugin to slurp code snippets with cq- GraspJS - another tool to search JavaScript code based on structure
- Pygments - a handy tool to colorize code snippets on the command line
- ASTExplorer - an online tool to explore the AST of your code
Dependencies
Fullstack React Book
This repo was written and is maintained by the Fullstack React team. If you're looking to learn React, there's no faster way than by spending a few hours with the Fullstack React book.
License
MIT