What is avsc?
The avsc npm package is a library for encoding and decoding data in the Avro serialization format. It provides tools for working with Avro schemas, serializing and deserializing data, and performing schema evolution.
What are avsc's main functionalities?
Schema Definition
This feature allows you to define Avro schemas using JSON. The code sample demonstrates how to define a simple Avro schema for a record with 'name' and 'age' fields.
const avro = require('avsc');
const type = avro.Type.forSchema({
type: 'record',
fields: [
{name: 'name', type: 'string'},
{name: 'age', type: 'int'}
]
});
Serialization
This feature allows you to serialize JavaScript objects into Avro binary format. The code sample shows how to serialize an object with 'name' and 'age' fields into a buffer.
const avro = require('avsc');
const type = avro.Type.forSchema({
type: 'record',
fields: [
{name: 'name', type: 'string'},
{name: 'age', type: 'int'}
]
});
const buf = type.toBuffer({name: 'John Doe', age: 30});
Deserialization
This feature allows you to deserialize Avro binary data back into JavaScript objects. The code sample demonstrates how to deserialize a buffer back into an object.
const avro = require('avsc');
const type = avro.Type.forSchema({
type: 'record',
fields: [
{name: 'name', type: 'string'},
{name: 'age', type: 'int'}
]
});
const buf = type.toBuffer({name: 'John Doe', age: 30});
const obj = type.fromBuffer(buf);
Schema Evolution
This feature supports schema evolution, allowing you to read data written with an older schema using a newer schema. The code sample shows how to evolve a schema by adding a new field with a default value.
const avro = require('avsc');
const oldType = avro.Type.forSchema({
type: 'record',
fields: [
{name: 'name', type: 'string'}
]
});
const newType = avro.Type.forSchema({
type: 'record',
fields: [
{name: 'name', type: 'string'},
{name: 'age', type: 'int', 'default': 0}
]
});
const buf = oldType.toBuffer({name: 'John Doe'});
const obj = newType.fromBuffer(buf);
Other packages similar to avsc
avro-js
avro-js is another library for working with Avro data in JavaScript. It provides similar functionality for schema definition, serialization, and deserialization. However, avro-js is generally considered to be less performant compared to avsc.
node-avro-io
node-avro-io is a library for Avro serialization and deserialization in Node.js. It offers similar features to avsc but is less actively maintained and has fewer features related to schema evolution.
Avsc
Pure JavaScript implementation of the Avro
specification.
Features
Performance
Representative throughput rates (higher is better):
Libraries compared:
node-avsc
, this package.node-json
, built-in JSON serializer.node-protobuf
, most popular
Protocol Buffers implementation.node-pson
, alternative to JSON.node-msgpack
, official
MessagePack implementation.
These rates are for processing a realistic record schema,
modeled after a popular open-source API. You can find the raw numbers and more
details on the benchmarks page.
Installation
$ npm install avsc
avsc
is compatible with all versions of node.js since 0.11
and all major
browsers via browserify.
Documentation
Examples
Inside a node.js module, or using browserify:
var avsc = require('avsc');
-
Encode and decode values:
var type = avsc.parse({
name: 'Pet',
type: 'record',
fields: [
{name: 'kind', type: {name: 'Kind', type: 'enum', symbols: ['CAT', 'DOG']}},
{name: 'name', type: 'string'}
]
});
var buf = type.toBuffer({kind: 'CAT', name: 'Albert'});
var val = type.fromBuffer(buf);
-
Check whether a value fits a given schema:
var type = avsc.parse('./Person.avsc');
var person = {name: 'Bob', address: {city: 'Cambridge', zip: '02139'}};
var status = type.isValid(person);
-
Get a readable stream of decoded values from an Avro
container file:
avsc.createFileDecoder('./values.avro')
.on('metadata', function (type) { })
.on('data', function (val) { });
-
Respond to remote procedure calls over TCP:
var protocol = avsc.parse('./Ping.avpr')
.on('ping', function (req, ee, cb) { cb(null, 'pong'); });
require('net').createServer()
.on('connection', function (con) { protocol.createListener(con); })
.listen(8000);