Security News
Fluent Assertions Faces Backlash After Abandoning Open Source Licensing
Fluent Assertions is facing backlash after dropping the Apache license for a commercial model, leaving users blindsided and questioning contributor rights.
documentdb-utils
Advanced tools
Drop-in replacement+extensions for Azure's DocumentDB node.js client with auto-retry on 429 errors plus a lot more
Copyright (c) 2015, Lawrence S. Maccherone, Jr.
Drop-in replacement + extensions for DocumentDB node.js client with auto-retry on 429 errors plus a lot more
By providing functionality like automatic retries on 429 errors among other things, documentdb-utils makes it much easier to use Microsoft Azure DocumentDB from node.js.
Note, versions prior to 0.4.0 had a very different interface exposed as the function documentDBUtils(). That has now been removed in favor of this drop-in replacement + extensions approach.
All methods and functionality of the Microsoft Azure node.js client with the exact same signature so you can drop this into your code and get an instantaneos upgrade.
Automatically handles 429 throttling responses by retrying after the delay specified by the prior operation.
Sproc execution continuation. Automatically deals with early termination of stored procedures for exceeding resources. Just follow a simple pattern (memoization like that used in a reduce function) for writing your stored procedures. The state will be shipped back to the calling side and the stored procedure will be called again picking back up right where it left off. I want to upgrade this to serialize a sproc execution continuation in a document or attachment so we don't have to ship the state back and forth. The state for a call to documentdb-lumenize/cube can get pretty large.
All query<entity>s
methods now accept MongoDB-like queries which are atuomatically converted to SQL by sql-from-mongo. You can still send in strings or objects with parameters as before, but it senses that the query is a MongoDB like query and acts accordingly. It's much easier to compose queries like {value: 1}
than SELECT * FROM c WHERE value = 1
, IMHO.
<old-method>Array(..., callback)
as short-hand for .toArray()
calls. Example: readDocumentsArray(collectionLink, callback)
. This alone makes it easier to use higher order async functions, but I'm not done yet.
<old-method>Multi(linkArray, ..., callback)
and <old-method>ArrayMulti(linkArray, ..., callback)
as automatic fan-out to multiple collections, sprocs, etc. for each method whose first parameter is a link. If you want to run the same query against multiple collections or call a sproc by the same name in different collections, you can now do that with one line. The results are automatically aggregated into a single callback response. Example: executeStoredProcedureMulti(arrayOfCollecitonLinks, 'countDocuments', callback)
.
<old-method>AsyncJSIterator(item, callback)
wrapper of methods to enable use of async.js's higher order functions like map, filter, etc. This is used internally to provide the multi-link capability but you can use it yourself to compose your own. See test code for examples.
Aggregated stats on the number of round trips, request unit costs, retries, time lost to retries as well as total execution time for operations.
link and link array generator. Example: getLinkArray('myDB', [1, 2], 'mySproc')
results in ['dbs/myDB/colls/1/sprocs/mySproc', 'dbs/myDB/colls/2/sprocs/mySproc']
expandScript functionality allows you to "require" npm modules from within your stored procedures as well as DRY for utility functions in your sprocs
loadSprocs/loadUDFs automatically expands and loads every sproc/UDF in a directory to a list of collections
countDocuments, createSpecificDocuments, createVariedDocuments, deleteSomeDocuments, updateSomeDocuments sprocs to use as-is or as a starting point for your own sprocs
getUnderscore mixin for sprocs which returns an object that implements the underscore.js API (couldn't get lodash to work). Yes, this works inside of sprocs.
getAsync mixin for sprocs which returns an object that implements the async.js API. Again, you can use this from sprocs.
sql-from-mongo exported as sqlFromMongo
documentdb.Base.generateGuidId exported and aliased as getGUID in addition to generateGuidId
npm install -save documentdb-utils
If you put your urlConnection and masterkey in DOCUMENT_DB_URL and DOCUMENT_DB_KEY environment variables (recommended), you can simply new WrappedClient()
with no parameters.
{WrappedClient} = require('documentdb-utils')
client = new WrappedClient()
If you want to use environment variables for the first two parameters but specify a connectionPolicy and/or consistencyLevel, then simply pass in null for the first two parameters.
{WrappedClient} = require('documentdb-utils')
client = new WrappedClient(null, null, connectionPolicy, consistencyLevel)
You can also pass in the same parameters as the Azure client (urlConnection, auth, connectionPolicy, consistencyLevel) with the last two being optional.
{WrappedClient} = require('documentdb-utils')
client = new WrappedClient(urlConnection, auth, connectionPolicy, consistencyLevel)
Alternatively, if you've already created your own instance of the Azure DocumentClient, you can pass that in as the only parameter. Be sure to set your connectionPolicy, and consistencyLevel on the original client.
{WrappedClient} = require('documentdb-utils')
{DocumentClient} = require('documentdb')
_client = new DocumentClient(urlConnection, auth)
client = new WrappedClient(_client)
Now, you can use the same methods you would on DocumentClient except they are upgraded to have automatic 429 error delay/retry functionality. Further, there are some methods that are added that make it easier to use DocumentDB especially when you have multiple partitions.
Let's say you wrote this little stored procedure and saved it in hello.coffee.
module.exports = () ->
getContext().getResponse().setBody('Hello world!')
or if you prefer JavaScript saved in hello.js.
module.exports = function () {
getContext().getResponse().setBody('Hello world!');
}
Now let's write some CoffeeScript to send and execute this on two different collections:
hello = require('../sprocs/hello')
collectionLinks = getLinkArray('db1', ['coll1', 'coll2'])
sprocSpec = {id: 'hello', body: hello}
wrappedClient.upsertStoredProcedureMulti(collectionLinks, sprocSpec, (err, result, stats) ->
sprocLinks = getLinkArray(collectionLinks, 'hello')
wrappedClient.executeStoredProcedureMulti(sprocLinks, (err, result, stats) ->
console.log(result) # The output of the executions as an array
console.log(stats) # Cumulative RUs, round trips, and other stats
)
)
And in JavaScript:
var collectionLinks, hello, sprocSpec;
hello = require('../sprocs/hello');
collectionLinks = getLinkArray('db1', ['coll1', 'coll2']);
sprocSpec = {id: 'hello', body: hello};
wrappedClient.upsertStoredProcedureMulti(collectionLinks, sprocSpec, function(err, result, stats) {
var sprocLinks;
sprocLinks = getLinkArray(collectionLinks, 'hello');
return wrappedClient.executeStoredProcedureMulti(sprocLinks, function(err, result, stats) {
console.log(result);
return console.log(stats);
});
});
Look in the test directory for more examples including wrapped methods that are tailor-made for use with async.js.
In the example above we used getLinkArray to create an array of two collection links. We then used that in another call to getLinkArray to create an array of two sproc links. However, it can do a lot more than that.
Example | Output |
---|---|
getLink('myDB') | "dbs/myDB" |
getLink('I', 'A', 1) | "dbs/I/colls/A/sprocs/1" |
getLink(1, {users: 'myUser'}) | "dbs/1/users/myUser" |
collLink = getLink('myDB', 'myColl') sprocLink = getLink(collLink, 'mySproc') | "dbs/myDB/colls/myColl" "dbs/myDB/colls/myColl/sprocs/mySproc" |
getDocLink(collLink, 'myDoc') | "dbs/myDB/colls/myColl/docs/myDoc" |
getAttachmentLink('I', 'A', '1', 'myAtt') | "dbs/I/colls/A/docs/1/attachments/myAtt" |
getLinkArray(['db1', 'db2'], ['col1', 'col2']) | [ "dbs/db1/colls/col1", "dbs/db1/colls/col2", "dbs/db2/colls/col1", "dbs/db2/colls/col2" ] |
dbLinks = getLinkArray(['db1', 'db2']) collLinks = getLinkArray(dbLinks, 'myColl') | [ "dbs/db1/colls/myColl", "dbs/db1/colls/myColl" ] |
getLinkArray(['db1', 'db2'], {users: 'Joe'}) | [ "dbs/db1/users/Joe", "dbs/db2/users/Joe" ] |
I think you get the idea but there are a bunch of other combinations. Note the default for getLink and getLinkArray is dbs/colls/sprocs. The default for getDocLink and getAttachmentsLink is dbs/colls/docs/attachments. You can get multiple document and attachment links by using the {docs: 'myDoc'}
parameter format in a getLinkArray call but I use GUIDs almost all the time so I never have the same id in multiple collections.
Let's say you have this CoffeeScript source code in file sprocToExpand.coffee
module.exports = () ->
x = 1
mixinToInsert = require('../test-examples/mixinToInsert')
mixinToInsert2 = require('../test-examples/mixinToInsert2')
y = 2
This is in mixinToInsert.coffee
:
module.exports = () ->
return 3
This is in mixinToInsert2.coffee
:
f1 = () ->
mixinToInsert3 = require('../test-examples/mixinToInsert3')
return 300
f2 = (x) ->
return x * 2
module.exports = {f1, f2}
And last, this is in mixinToInsert3.coffee
module.exports = () ->
return 3000
The expandScript function will take the above and produce:
function () {
var mixinToInsert, mixinToInsert2, x, y;
x = 1;
mixinToInsert = function () {
return 3;
};
mixinToInsert2 = {
f1: function () {
var mixinToInsert3;
mixinToInsert3 = function () {
return 3000;
};
return 300;
},
f2: function (x) {
return x * 2;
}
};
return y = 2;
}
A few things to keep in mind when trying to use expandScript:
myVar = require('myPackage')
or b) myVar = require('myPackage').some.lower.reference
. If you use the first form, it will import everything so if you just want one function of a library, use the second form.require()
from within your CoffeeScript stub.Many npm packages work out of the box, but others need some cleanup before they'll work. It's just doing string manipulation after all. However, it does give you an oportunity to modularlize your code and use some npm packages in your sprocs and UDFs.
I've included mixins for these popular JavaScript utility packages that you can require()
and use when writing sprocs. See the test directory for examples. Normally I run lodash instead of underscore but I couldn't get it to run in a sproc after a bit of trying. Maybe next time.
I use lodash
(exposed as _
), async
, sqlFromMongo
, and generateGuidId
(aliased also as getGUID
) in my own code all the time and they are dependencies of documentdb-utils so I've exported them for your convenience. It will save you the effort of npm install
ing them and help avoid having multiple versions of the same depencies in your project.
I've included a number of example sprocs, UDFs, and mixins that you can use as-is or as starting points for your own. See the next section down for tips on writing sprocs that work well with the continuation functionality of WrappedClient.executeStoredProcedure.
The key to a general pattern for writing restartable stored procedures is to write them as if you were writing a reduce() function.
Note, if you follow this pattern, the upgraded executeStoredProcedure()
automatically deals with early termination of stored procedures for exceeding resources. Just follow this simple pattern. The state will be shipped back to the calling side and the stored procedure will be called again picking back up right where it left off.
Pehaps the most common use of stored procedures is to aggregate or transform data. It's very easy to think of these as "reduce" operations just like Array.reduce() or the reduce implementations in underscore, async.js, or just about any other library with aggregation functionality. It stretches the "reduce" metaphore a bit, but the pattern itself is perfectly usefull even for stored procedures that write data.
So:
memo
. Put any parameters for your sproc in that Object (e.g. `{parameter1: "my Value", parameter2: 100}memo
on the initial call.memo
object.continuation
field returned by readDocuments and queryDocuments into memo.continuation
. If you are doing only creates, updates, and deletes or even a set of readDocument() calls within your sproc and you want it to pause and resume for some reason, then set continuation
manually (value doesn't matter).memo
object.getContext().getResponse().setBody(memo)
regulary, particularly right after you kick off a collection operation (readDocuments, createDocument, etc.).false
is returned from the most recent call to a collection operation, don't issue any more async calls. Rather, perform any aggregations and wrap up the stored procedure quickly.Here is an example of a stored procedure that counts all the documents in a collection with an option to filter based upon provided filterQuery field in the initial memo. The source for this is included in this repository.
count = (memo) ->
collection = getContext().getCollection()
unless memo?
memo = {}
unless memo.count?
memo.count = 0
unless memo.continuation?
memo.continuation = null
memo.stillQueueing = true
query = (responseOptions) ->
if memo.stillQueueing
responseOptions =
continuation: memo.continuation
pageSize: 1000
if memo.filterQuery?
memo.stillQueueing = collection.queryDocuments(collection.getSelfLink(), memo.filterQuery, responseOptions, onReadDocuments)
else
memo.stillQueueing = collection.readDocuments(collection.getSelfLink(), responseOptions, onReadDocuments)
setBody()
onReadDocuments = (err, resources, options) ->
if err
throw err
count = resources.length
memo.count += count
if options.continuation?
memo.continuation = options.continuation
query()
else
memo.continuation = null
setBody()
setBody = () ->
getContext().getResponse().setBody(memo)
query()
exports.count = count
myVar = require('some_package').some.other.reference
cake compile
and cake clean
so the .js files are uploaded to npmrequire()
other bits of code or even appropriate npm modules
directly in your sprocs as well as automatic loading of all stored procedures in a given directory. This additional utility was
implemented as part of other projects but it's generally useful so it's now been moved here.stillQueueing
field in the body. This is just the last recorded value returned from
a collection operation. It's backward breaking because the key field is now
stillQueueing
whereas it was previously stillQueueingOperations
.cake publish
Because Microsoft uses JSDoc for its library, I've decided to use it also. that said, I don't yet have any documentation generation in place. That's pretty high on my list to do myself but it's also a good candidate for pull requests if anyone wants to help. Use this approach to document the CoffeeScript.
###*
# Sets the language and redraws the UI.
# @param {object} data Object with `language` property
# @param {string} data.language Language code
###
handleLanguageSet: (data) ->
outputs
/**
* Sets the language and redraws the UI.
* @param {object} data Object with `language` property
* @param {string} data.language Language code
*/
handleLanguageSet: function(data) {}
I use the relatively simplistic documentdb-mock for writing automated tests for my own stored procedures and I regularly exercise documentDBUtils in the course of running those stored procedures. I also have done extensive exploratory testing on DocumentDB's behavior using documentDBUtils... even finding some edge cases in DocumentDB's behavior. :-) However, you cannot run DoucmentDB locally and I don't have the patience to fully mock it out so there are currently less than full test coverage.
Copyright (c) 2015, 2016 Lawrence S. Maccherone, Jr.
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
FAQs
Drop-in replacement+extensions for Azure's DocumentDB node.js client with auto-retry on 429 errors plus a lot more
The npm package documentdb-utils receives a total of 35 weekly downloads. As such, documentdb-utils popularity was classified as not popular.
We found that documentdb-utils demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Fluent Assertions is facing backlash after dropping the Apache license for a commercial model, leaving users blindsided and questioning contributor rights.
Research
Security News
Socket researchers uncover the risks of a malicious Python package targeting Discord developers.
Security News
The UK is proposing a bold ban on ransomware payments by public entities to disrupt cybercrime, protect critical services, and lead global cybersecurity efforts.