Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More →

sql-parser-cst

Package Overview

Dependencies

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

sql-parser-cst

Parses SQL into Concrete Syntax Tree (CST)

0.1.0
Source
npm

Version published: 2 years ago

Weekly downloads: 5.7K; increased by39.08%

Maintainers: 1

Weekly downloads

Created: 2 years ago

Source

SQL Parser CST

SQL Parser which produces a Concrete Syntax Tree (CST).

Unlike a more usual parser which produces an Abstract Syntax Tree (AST), this parser preserves all the syntax elements present in the parsed source code, with the goal of being able to re-create the exact original source code.

Note: This is pre-alpha quality software in early development stages.

Usage

import { parse, show, sqlite } from "sql-parser-cst";

const cst = parse("SELECT * FROM my_table", {
  dialect: sqlite,
  // These are optional:
  preserveSpaces: true, // Adds spaces/tabs
  preserveNewlines: true, // Adds newlines
  preserveComments: true, // Adds comments
  includeRange: true, // Adds source code location data
});

// Change table name
cst.statements[0].clauses[1].tables[0].table.text = "your_table";

// Serialize back to SQL
show(cst); // --> SELECT * FROM your_table

AST versus CST-parsers

For example, given the following SQL:

/* My query */
SELECT ("first_name" || ' jr.') as fname
-- use important table
FROM persons

An AST-parser might parse this to the following abstract syntax tree:

{
  "type": "select_statement",
  "columns": [
    {
      "type": "alias",
      "expr": {
        "type": "binary_expr",
        "left": { "type": "column_ref", "column": "first_name" },
        "operator": "||",
        "right": { "type": "string", "value": " jr." }
      },
      "alias": "fname"
    }
  ],
  "from": [{ "type": "table_ref", "table": "persons" }]
}

Note that the above AST is missing the following information:

comments
whitespace (e.g. where the newlines are)
case of keywords (e.g. whether AS or as was written)
whether an identifier was quoted or not (and with what kind of quotes)
whether an expression is wrapped in additional (unnecessary) parenthesis.

In contrast, this CST parser produces the following concrete syntax tree, which preserves all of this information:

{
  "type": "select_statement",
  "clauses": [
    {
      "type": "select_clause",
      "selectKw": { "type": "keyword", "text": "SELECT" },
      "columns": [
        {
          "type": "alias",
          "expr": {
            "type": "paren_expr",
            "expr": {
              "type": "binary_expr",
              "left": {
                "type": "column_ref",
                "column": { "type": "identifier", "text": "\"first_name\"" }
              },
              "operator": "||",
              "right": { "type": "string", "text": "' jr.'" }
            }
          },
          "asKw": { "type": "keyword", "text": "as" },
          "alias": { "type": "identifier", "text": "fname" }
        }
      ]
    },
    {
      "type": "from_clause",
      "fromKw": { "type": "keyword", "text": "FROM" },
      "tables": [
        {
          "type": "table_ref",
          "table": { "type": "keyword", "text": "persons" }
        }
      ],
      "leading": [
        { "type": "newline", "text": "\n" },
        { "type": "line_comment", "text": "-- use important table" },
        { "type": "newline", "text": "\n" }
      ]
    }
  ],
  "leading": [
    { "type": "block_comment", "text": "/* My query */" },
    { "type": "newline", "text": "\n" }
  ]
}

Note the following conventions:

All keywords are preserved in type: keyword nodes, which are usually stored in fields named like someNameKw.
Parenthesis is represented by separate type: paren_expr node.
The original source code representation of strings, identifiers, keywords, etc is preserved in text fields.
Each node can have leading and trailing fields, which store comments and newlines immediately before or after that node. These fields will also contain information about regular spaces/tabs (e.g. {"type": "space", "text": " \t"}). This has been left out from this example for the sake of simplicity.

Acknowledgements

This started as a fork of node-sql-parser, which is based on @flora/sql-parser, which in turn was extracted from Alibaba's nquery module.

There's very little left of the original code though.

FAQs

What is sql-parser-cst?

Is sql-parser-cst popular?

Is sql-parser-cst well maintained?

Package last updated on 27 Oct 2022

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

sql-parser-cst

SQL Parser CST

Usage

AST versus CST-parsers

Acknowledgements

Related posts

New Python Packaging Proposal Aims to Solve Phantom Dependency Problem with SBOMs

The Cyber Security Council Podcast: Securing Modern Applications in a Decentralized Open Source World