Product
Introducing Java Support in Socket
We're excited to announce that Socket now supports the Java programming language.
Split an email message stream into structured parts and join these parts back into an email message stream. If you do not modify the parsed data then the rebuilt message should be an exact copy of the original.
This is useful if you want to modify some specific parts of an email, for example to add tracking images or unsubscribe links to the HTML part of the message without changing any other parts of the email.
Supports both <CR><LF> and <LF> (or mixed) line endings. Embedded rfc822 messages are also parsed, in this case you would get two sequential 'node' objects with no 'data' or 'body' in between (first 'node' is for the container node and second for the root node of the embedded message).
In general this module is a primitive for building e-mail parsers/handlers like mailparser. Alternatively you could use it to parse other MIME-like structures, for example mbox files or multipart/form-data uploads.
See rewrite-html.js for an usage example where HTML content is modified on the fly (example script adds a link to every text/html node)
Install from npm
npm install mailsplit --save
Splitter
is a transformable stream where input is a byte stream and output is an object stream.
let Splitter = require('mailsplit').Splitter;
let splitter = new Splitter(options);
Where
'data' event emits the next parsed object from the message stream.
'node'
means that we reached the next mime node and the previous one is completely processed'data'
provides us multipart body parts, including boundaries. This data is not directly related to any specific multipart node, basically it includes everything between the end of one normal node and the header of next node'body'
provides us next chunk for the last seen 'node'
element'body'
and 'data'
partsElement with type 'node'
has a bunch of header related methods and properties, see below.
Example
let Splitter = require('mailsplit').Splitter;
let splitter = new Splitter();
// handle parsed data
splitter.on('data', data => {
switch (data.type) {
case 'node':
// node header block
process.stdout.write(data.getHeaders());
break;
case 'data':
// multipart message structure
// this is not related to any specific 'node' block as it includes
// everything between the end of some node body and between the next header
process.stdout.write(data.value);
break;
case 'body':
// Leaf element body. Includes the body for the last 'node' block. You might
// have several 'body' calls for a single 'node' block
process.stdout.write(data.value);
break;
}
});
// send data to the parser
someMessagStream.pipe(splitter);
If the data object has type='node'
then you can modify headers for that node (headers can be modified until the data object is passed over to a Joiner
)
You can manipulate specific header keys as well using the headers
object
["Subject: This is subject line"]
)"This is subject line"
)1
as the value, then it will update the second) or if no relative key index is specified, then it will remove all header value matches found for the key and append one at the last matching key index found with the specified value. If a relative key index is specified and it does not exist then it will be replaced (eg if there are two headers of X-Foo-Bar
and you pass 2
, meaning it will update the third, no updates will be made since the third did not exist)Additionally you can check the details of the node with the following properties automatically parsed from the headers:
'attachment'
, 'inline'
or false
if not setJoiner
is a transformable stream where input is the object stream form Splitter
and output is a byte stream.
let Splitter = require('mailsplit').Splitter;
let Joiner = require('mailsplit').Joiner;
let splitter = new Splitter();
let joiner = new Joiner();
// pipe a message source to splitter, then joiner and finally to stdout
someMessagStream
.pipe(splitter)
.pipe(joiner)
.pipe(process.stdout);
Rewriter
is a simple helper class to modify nodes that match a filter function. You can pipe a Splitter stream directly into a Rewriter and pipe Rewriter output to a Joiner.
Rewriter takes the following argument:
filterFunc
returns trueOnce Rewriter finds a matching node, it emits the following event:
data
data.node
includes the current node with headersdata.decoder
is the decoder stream that you can read data fromdata.encoder
is the encoder stream that you can write data to. Whatever you write into that stream will be encoded properly and inserted as the content of the current nodelet Splitter = require('mailsplit').Splitter;
let Joiner = require('mailsplit').Joiner;
let Rewriter = require('mailsplit').Rewriter;
let splitter = new Splitter();
let joiner = new Joiner();
let rewriter = new Rewriter(node => node.contentType === 'text/html');
rewriter.on('node', data => {
// manage headers with node.headers
node.headers.add('X-Processed-Time', new Date.toISOString());
// do nothing, just reencode existing data
data.decoder.pipe(data.encoder);
});
// pipe a message source to splitter, then rewriter, then joiner and finally to stdout
someMessagStream
.pipe(splitter)
.pipe(rewriter)
.pipe(joiner)
.pipe(process.stdout);
Streamer
is a simple helper class to stream nodes that match a filter function. You can pipe a Splitter stream directly into a Streamer and pipe Streamer output to a Joiner.
Streamer takes the following argument:
filterFunc
returns trueOnce Streamer finds a matching node, it emits the following event:
data
data.node
includes the current node with headers (informational only, you can't modify it)data.decoder
is the decoder stream that you can read data fromdata.done
is a function you must call once you have processed the streamlet Splitter = require('mailsplit').Splitter;
let Joiner = require('mailsplit').Joiner;
let Streamer = require('mailsplit').Streamer;
let fs = require('fs');
let splitter = new Splitter();
let joiner = new Joiner();
let streamer = new Streamer(node => node.contentType === 'image/jpeg');
streamer.on('node', data => {
// write to file
data.decoder.pipe(fs.createWriteStream(data.node.filename || 'image.jpg'));
data.done();
});
// pipe a message source to splitter, then streamer, then joiner and finally to stdout
someMessagStream
.pipe(splitter)
.pipe(streamer)
.pipe(joiner)
.pipe(process.stdout);
Parsing and re-building messages is not fast but it isn't slow either. On my Macbook Pro I got around 22 MB/second (single process, single parsing queue) when parsing random messages from my own email archive. Time spent includes file calls to find and load random messages from disk.
Streaming 20000 random messages through a plain PassThrough
Done. 20000 messages [1244 MB] processed in 10.095 s. with average of 1981 messages/sec [123 MB/s]
Streaming 20000 random messages through Splitter and Joiner
Done. 20000 messages [1244 MB] processed in 55.882 s. with average of 358 messages/sec [22 MB/s]
Dual licensed under MIT or EUPLv1.1+
FAQs
Split email messages into an object stream
We found that mailsplit demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Product
We're excited to announce that Socket now supports the Java programming language.
Security News
Socket detected a malicious Python package impersonating a popular browser cookie library to steal passwords, screenshots, webcam images, and Discord tokens.
Security News
Deno 2.0 is now available with enhanced package management, full Node.js and npm compatibility, improved performance, and support for major JavaScript frameworks.