htmldiff.js
Diff and markup HTML with <ins>
and <del>
tags.
Origin
Quote from the original source of this fork:
htmldiff.js
is a JavaScript port of https://github.com/myobie/htmldiff by
Keanu Lee at Inkling.
htmldiff.js is based on this fork and adds a few things:
- Diffing of video, math, widget, iframe, img and svg tags.
- Ability to set atomic tags via the API.
- A command line interface.
- TypeScript support.
- Better documentation.
See also Credits below.
Description
htmldiff takes two HTML snippets or files and marks the differences between them with
<ins>
and <del>
tags. The diffing understands HTML so it doesn't do a pure text diff,
instead it will insert the appropriate tags for changed/added/deleted text nodes, single
tags or tag hierarchies.
The module can be used as module in Node.js, with RequireJS, or even just as a script tag.
API
The module exports a single default function:
JavaScript:
diff(before, after, className, dataPrefix, atomicTags);
TypeScript:
function diff(before: string, after: string, className?: string | null, dataPrefix?: string | null, atomicTags?: string | null): string;
Parameters
before
(string) is the original HTML text.after
(string) is the HTML text after the changes have been applied.
The return value is a string with the diff result, marked by <ins>
and del
tags. The
function has three optional parameters. If an empty string or null
is used for any
of these three parameters it will be ignored:
className
(string) className will be added as a class attribute on every inserted
<ins>
and <del>
tag.dataPrefix
(string) The data prefix to use for data attributes. The so called operation
index data attribute will be named data-${dataPrefix-}operation-index
. If not used,
the default attribute name data-operation-index
will be added on every inserted
<ins>
and <del>
tag. The value of this attribute is an auto incremented counter.atomicTags
(string) Comma separated list of tag names. The list has to be in the form
tag1,tag2,...
e. g. head,script,style
. An atomic tag is one whose child nodes should
not be compared - the entire tag should be treated as one token. This is useful for tags
where it does not make sense to insert <ins>
and <del>
tags. If not used, the default
list will be used:
iframe,object,math,svg,script,video,head,style
.
Example
JavaScript:
diff = require('node-htmldiff');
console.log(diff('<p>This is some text</p>', '<p>That is some more text</p>', 'myClass'));
TypeScript:
import diff = require("node-htmldiff");
console.log(diff("<p>This is some text</p>", "<p>That is some more text</p>", "myClass"));
Please note that diff
is only an arbitrary name; since the module exports only one default
function you can use whatever name you like, e. g., diffHTML
.
Result:
<p><del data-operation-index="1" class="myClass">This</del><ins data-operation-index="1" class="myClass">That</ins> is some<ins data-operation-index="3" class="myClass"> more</ins> text.</p>
Command line interface
htmldiff beforeFile afterFile diffedFile [-c className] [-p dataPrefix] [-t atomicTags]
Parameters:
-
beforeFile
An HTML input file in its original form.
-
afterFile
An HTML input file, based on beforeFile
but with changes.
-
diffedFile
Name of the diffed HTML output file. All differences between
beforeFile
and afterFile
will be surrounded with <ins>
and <del>
tags. If diffedFile is -
(minus) the result will be written with
console.log()
to stdout.
Options:
-c className
, -p dataPrefix
and -t atomicTags
are all optional. For a
description please see API documentation above.
Development
After cloning the repository run npm i
or npm install
to install the necessary
dependencies. A run of npm run make
creates the JavaScript output file.
npm run lint
checks the TypeScript sources with TSLint. npm test
runs all the
tests from the test
directory. npm run testsample
diffs the HTML sample files
from the directory sample
and logs the result to the console.
The command line interface of htmldiff is developed in TypeScript so you have to run
npm run make
once to create the JavaScript output file.
Credits
This module wouldn't have been possible without code from the following projects/persons:
License
MIT © idesis GmbH, Max-Keith-Straße 66 (E 11), D-45136 Essen
See the LICENSE
file for details.