Mmap for Node.js
mmap(2) / madvise(2) / msync(2) / mincore(2) for node.js revisited.
I needed shared memory mapping and came across @bnoordhuis module node-mmap, only to find that it didn't work with later versions of io.js, node.js and compatibles. So out of need I threw this together along with the functionality I found was missing in the node-mmap: advice and sync.
Strong temptations to re-order arguments to something more sane was kept at bay, and I kept it as mmap(2) and node-mmap for compatibility. Notable difference is the additional optional argument to pass a usage advise in the mapping stage. I've given advise and sync more practical arguments, out of a node.js perspective, compared to their C/C++ counterparts.
The flag constants have crooked names from C/C++ retained in order to make it straight forward for the user to search the net, and relate to man-pages.
This is my first node.js addon and after hours wasted reading up on V8 API I luckily stumbled upon Native Abstractions for Node. Makes life so much easier. Hot tip!
mmap-io is written in C++11 and LiveScript — although I love LS, it's more prudent to use TypeScript for a library, so I've rewritten that code.
It should be noted that mem-mapping is by nature potentially blocking, and should not be used in concurrent serving/processing applications, but rather has it's niche where multiple processes are working on the same giant sets of data (thereby sparing physical memory, and load times if the kernel does it's job for read ahead), preferably multiple readers and single or none concurrent writer, to not spoil the gains by shitloads of spin-locks, mutexes or such. And your noble specific use case of course.
News and Updates
2019-07-09/B: version 1.1.3, ..., 1.1.6
- rewritten the C++-code to catch up with V8/Nan breaking changes for node.js 12.*, which also removes all warnings in earlier versions.
- refactored in to wrapper functions for extracting values, so should new breaking changes come in later versions, it will be quicker to adjust.
- major "package.json"-fuckups. Unless running build manually, the js-files where never packaged, nor built. Now whitelisted in package.json.
- major fuckup 2: when they finally where packaged, they we're killed off when the C++ module was rebuilt on installment. So. Finally: js-files are now built to "dist", and the binary into "build". This way the TypeScript and LiveScript aren't required for end user
2019-07-09/A: version 1.1.1
- when replacing GNU Make, for some reason I used
yarn
in "package.json" — which may have failed builds for those not having it installed (and then not building "es-release"), and completely missing the point of getting rid of Make - updated README to reflect new build command (
npm run build
) (should only ever be needed if you clone from git and contribute) - added back the "main" entry in package.json. Hell of a blunder! Tests changed to import from root so they fail without it.
- the never before tested example code here in the README, has been ran and corrected, thanks to @LiamKarlMitchell
2019-03-08: version 1.1.0
- rewrote the es part of the lib code from LiveScript to TypeScript. The
prudent thing to do in a lib. The test remains in LS.
offs_t
changed to size_t
because of bitwidth goofyness. Thanks to @bmarkiv- removed dependency on GNU Make by adding build commands to "package.json".
Might help those on windows platform who didn't have it installed. However
they still rely on a horde of "common posix utils", so your setup might be
lacking anyway then.
- _note that there are some compile warnings because of changes in C++ API's in
NAN/V8, ignored for now in contemplation whether to switch to node-addon-api
(napi for C++), since call overhead isn't an issue in this library, everything
considered.
2018-01-16: version 1.0.0
- bumped the version to 1.0.0 since, well, why not.
- changed deprecated calls from
ForceSet
to DefineOwnProperty
- general source noise cleanups
- fixed compilation errors for newer nodejs. Thanks to @djulien
- windows-specific problems (#5, #6), and more, fixed. Thanks to @bkmartinjr
- updated README clarifying Contribution Guidelines
2017-03-08: version 0.11.1
- compilation fixes 10.8 OSX fix. Thanks to @arrayjam
2016-07-21: version 0.10.1
incore
fix for Mac OS. Thanks to @rustyconover
2016-07-14: version 0.10.0
incore
added. Thanks to @rustyconover
2015-10-10: version 0.9.4
- Compilation on Mac should work now. Thanks to @aeickhoff
2015-10-01: version 0.9.3
- Windows compatibility added. Thanks to @toxicwolf
- Rewrote the bindings to Nan 2.0.9 API version (V8/io/Node hell...)
- Had to remove the error codes to get it working in the time I had
available (or rather - didn't have..) —
error messages are still there — with code in message instead. Though,
usually nothing goes wrong, so only the test cares ;-)
- Added some helpful targets in Makefile
human_errors
, ls
only, build
only, etc. (useful if you wanna hack the module) - Since all functionality that can possibly be is in place, I bumped all the
way to 0.8. Not battle tested enough to warrant higher.
- Commented away experimental async read-ahead caching when readahead hint was on. It
hasn't broken, but it's an unnecessary risk. Plays safe. You can toy with it
yourself if you want to try to milk out some ms performance.
2015-03-04: version 0.1.3
- This is the first public commit, and the code has one day of development put into it as of now. More tests are needed so don't count on it being production ready just yet (but soon).
Install
Use npm or git.
npm install mmap-io
git clone https://github.com/ozra/mmap-io.git
cd mmap-io
npm build
Usage
Note: All code in examples are in LiveScript
# Following code is plastic fruit; not t[ae]sted...
mmap = require "mmap-io"
fs = require "fs"
some-file = "./foo.bar"
fd = fs.open-sync some-file, "r"
fd-w = fs.open-sync some-file, "r+"
# In the following comments:
# - `[blah]` denotes optional argument
# - `foo = x` denotes default value for argument
size = fs.fstat-sync(fd).size
rx-prot = mmap.PROT_READ .|. mmap.PROT_EXECUTE
priv = mmap.MAP_SHARED
# map( size, protection, privacy, fd [, offset = 0 [, advise = 0]] ) -> Buffer
buffer = mmap.map size, rx-prot, priv, fd
buffer2 = mmap.map size, mmap.PROT_READ, priv, fd, 0, mmap.MADV_SEQUENTIAL
w-buffer = mmap.map size, mmap.PROT_WRITE, priv, fd-w
# advise( buffer, advise ) -> void
# advise( buffer, offset, length, advise ) -> void
mmap.advise w-buffer, mmap.MADV_RANDOM
# sync( buffer ) -> void
# sync( buffer, offset, length ) -> void
# sync( buffer, is-blocking-sync[, do-page-invalidation = false] ) -> void
# sync( buffer, offset = 0, length = buffer.length [, is-blocking-sync = false [, do-page-invalidation = false]] ) -> void
mmap.sync w-buffer
mmap.sync w-buffer, true
mmap.sync w-buffer, 0, size
mmap.sync w-buffer, 0, size, true
mmap.sync w-buffer, 0, size, true, false
# incore( buffer ) -> [ unmapped-pages Int, mapped-pages Int ]
core-stats = mmap.incore buffer
Good to Know (TM)
- Checkout man pages mmap(2), madvise(2), msync(2), mincore(2) for more detailed intell.
- The mappings are automatically unmapped when the buffer is garbage collected.
- Write-mappings need the fd to be opened with "r+", or you'll get a permission error (13).
- If you make a read-only mapping and then ignorantly set a value in the buffer, all hell previously unknown to a JS'er breaks loose (segmentation fault). It is possible to write some devilous code to intercept the SIGSEGV and throw an exception, but let's not do that!
Offset
, and in some cases length
needs to be a multiple of mmap-io.PAGESIZE (which commonly is 4096)- Huge pages are only supported for anonymous / private mappings (well, in Linux), so I didn't throw in flags for that since I found no use.
- As Ben Noordhuis previously has stated: Supplying hint for a fixed virtual memory adress is kinda moot point in JS, so not supported.
- If you miss a feature - contribute! Or request it in an issue.
- If documentation isn't clear, make an issue.
Contribution Guidelines
- Please create a new branch like
fix-{ISSUE_NUMBER}-human-readable-descr-here
or feature-this-and-that
for your work - PR using that branch
- Have a look at surrounding code for style and follow that
- 4 spaces indent
- 12 spaces initial indent for function bodies in the C++ code
- Yeah it was a "feel it out" of the moment, I'll leave it there.
- Please, when writing an issue or making a PR: tag my handle
@ozra
, so that
GitHub adds a notification to the "participating" list (less risk I miss it)
`
Tests
npm test