Pond.js is a library build on top of immutable.js to provide time-based data structures, serialization and processing within our tools.
For data structures it unifies the use of time ranges, events and collections and time series. For processing it provides a chained pipeline interface to aggregate, collect and process batches or streams of events.
We are still developing Pond.js as it integrates further into our code, so it may change or be incomplete in parts. That said, it has a growing collection of tests and we will strive not to break those without careful consideration.
See the CHANGES.md document for version updates.
Rational
ESnet runs a large research network for the US Department of Energy. Our tools consume events and time series data throughout our network visualization applications and data processing chains. As our tool set grew, so did our need to build a Javascript library to work with this type of data that was consistent and dependable. The alternative for us has been to pass ad-hoc data structures between the server and the client, making all elements of the system much more complicated. Not only do we need to deal with different formats at all layers of the system, we also repeat our processing code over and over. Pond.js was built to address these pain points.
The result might be as simple as comparing two time ranges:
const timerange = timerange1.intersection(timerange2);
timerange.asRelativeString();
Or finding the average value in a timeseries:
timeseries.avg("sensor");
Or much higher level stream processing:
Pipeline()
.from(input)
.windowBy("1h")
.emitOn("eachEvent")
.aggregate({in: avg, out: avg})
.to(EventOut, event => {
result[`${event.index()}`] = event;
});
input.addEvents(incomingEvents);
What does it do?
Pond has three main goals:
- provide a robust set of time-related data structures, built on Immutable.js
- provide serialization of these structures for transmission across the wire
- provide processing operations to work with those structures
Here is a summary of what is provided:
- TimeRange - a begin and end time, packaged together.
- Index - A time range denoted by a string, for example "5m-1234" is a specific 5 minute time range, or "2014-09" is September 2014.
- Event - A timestamp and a data object packaged together.
- IndexedEvents - An Index and a data object packaged together. e.g. 1hr max value.
- TimeRangeEvents - A TimeRange and a data object packaged together. e.g. outage event occurred from 9:10am until 10:15am.
And forming together collections of events:
- Collection - A bag of Events
- TimeSeries - An ordered Collection of Events and associated meta data
And then high level processing via Event pipelines:
- Pipeline - Stream or batch processing of Events. Supports windowing, grouping and aggregation. Still under development.
Getting started
Pond will run in Node or in the browser (ideally via webpack).
Install from npm:
npm install pondjs --save
To explore via the node REPL:
node
> const pond = require("./lib/entry.js");
For further information see the Getting started guide.
Contributing
Read the contribution guidelines.
The library has Mocha tests. To run the tests interactively, use:
npm run start-tester
Then point your browser to:
http://localhost:9500/webpack-dev-server/tests
Or to run the tests (and linting) on the command line:
npm test
License
This code is distributed under a BSD style license, see the LICENSE file for complete information.
Copyright
ESnet Timeseries Library ("Pond.js"), Copyright (c) 2015, The Regents of the University of California, through Lawrence Berkeley National Laboratory (subject to receipt of any required approvals from the U.S. Dept. of Energy). All rights reserved.
If you have questions about your rights to use or distribute this software, please contact Berkeley Lab's Innovation & Partnerships Office at IPO@lbl.gov.
NOTICE. This software is owned by the U.S. Department of Energy. As such, the U.S. Government has been granted for itself and others acting on its behalf a paid-up, nonexclusive, irrevocable, worldwide license in the Software to reproduce, prepare derivative works, and perform publicly and display publicly. Beginning five (5) years after the date permission to assert copyright is obtained from the U.S. Department of Energy, and subject to any subsequent five (5) year renewals, the U.S. Government is granted for itself and others acting on its behalf a paid-up, nonexclusive, irrevocable, worldwide license in the Software to reproduce, prepare derivative works, distribute copies to the public, perform publicly and display publicly, and to permit others to do so.