What is data-urls?
The data-urls npm package is used to parse and serialize data URLs as per the WHATWG specification. It allows you to work with data URLs in a structured way, extracting components like MIME type, base64 encoding, and the actual data.
What are data-urls's main functionalities?
Parsing Data URLs
This feature allows you to parse a data URL into its components. The `parse` function takes a data URL string and returns an object containing the MIME type, whether it's base64 encoded, and the actual data.
const { parse } = require('data-urls');
const dataUrl = 'data:text/plain;base64,SGVsbG8sIFdvcmxkIQ==';
const parsed = parse(dataUrl);
console.log(parsed);
Serializing Data URLs
This feature allows you to serialize data into a data URL. The `serialize` function takes an object with MIME type and data, and returns a data URL string.
const { serialize } = require('data-urls');
const data = Buffer.from('Hello, World!');
const mimeType = 'text/plain';
const serialized = serialize({ mimeType, body: data });
console.log(serialized);
Other packages similar to data-urls
datauri
The datauri package is used to convert file paths or buffers to data URLs. It provides a simple API for encoding files or buffers into data URLs, but it does not offer as detailed parsing capabilities as data-urls.
dataurl
The dataurl package provides utilities for parsing and generating data URLs. It offers similar functionality to data-urls but with a different API design. It is less focused on strict adherence to the WHATWG specification.
data-uri-to-buffer
The data-uri-to-buffer package is focused on converting data URLs to buffers. It is useful for extracting the binary data from a data URL but does not provide serialization capabilities.
Parse data:
URLs
This package helps you parse data:
URLs according to the WHATWG Fetch Standard:
const parseDataURL = require("data-url");
const textExample = parseDataURL("data:,Hello%2C%20World!");
console.log(textExample.mimeType.toString());
console.log(textExample.body.toString());
const htmlExample = dataURL("data:text/html,%3Ch1%3EHello%2C%20World!%3C%2Fh1%3E");
console.log(htmlExample.mimeType.toString());
console.log(htmlExample.body.toString());
const pngExample = parseDataURL("" +
"ANSUhEUgAAAAUAAAAFCAYAAACNbyblAAAAHElEQVQI12P4" +
"//8/w38GIAXDIBKE0DHxgljNBAAO9TXL0Y4OHwAAAABJRU" +
"5ErkJggg==");
console.log(pngExample.mimeType.toString());
console.log(pngExample.body);
API
This package's main module's default export is a function that accepts a string and returns a { mimeType, body }
object, or null
if the result cannot be parsed as a data:
URL.
- The
mimeType
property is an instance of whatwg-mimetype's MIMEType
class. - The
body
property is a Node.js Buffer
instance.
As shown in the examples above, both of these have useful toString()
methods for manipulating them as string values. However…
A word of caution on string decoding
Because Node.js's Buffer.prototype.toString()
assumes a UTF-8 encoding, simply doing dataURL.body.toString()
may not work correctly if the data:
URLs contents were not originally written in UTF-8. This includes if the encoding is "US-ASCII", aka windows-1252, which is notable for being the default in many cases.
A more complete decoding example would use the whatwg-encoding package as follows:
const parseDataURL = require("data-url");
const { labelToName, decode } = require("whatwg-encoding");
const dataURL = parseDataURL(arbitraryString);
const encodingName = labelToName(dataURL.mimeType.parameters.get("charset"));
const bodyDecoded = decode(dataURL.body, encodingName);
For example, given an arbitraryString
of data:,Hello!
, this will produce a bodyDecoded
of "Hello!"
, as expected. But given an arbitraryString
of "data:,Héllo!"
, this will correctly produce a bodyDecoded
of "Héllo!"
, whereas just doing dataURL.body.toString()
will give back "Héllo!"
.
In summary, only use dataURL.body.toString()
when you are very certain your data is inside the ASCII range (i.e. code points within the range U+0000 to U+007F).
Advanced functionality: parsing from a URL record
If you are using the whatwg-url package, you may already have a "URL record" object on hand, as produced by that package's parseURL
export. In that case, you can use this package's fromURLRecord
export to save a bit of work:
const { parseURL } = require("whatwg-url");
const dataURLFromURLRecord = require("data-url").fromURLRecord;
const urlRecord = parseURL("data:,Hello%2C%20World!");
const dataURL = dataURLFromURLRecord(urlRecord);
In practice, we expect this functionality only to be used by consumers like jsdom, which are using these packages at a very low level.