
Security News
Axios Maintainer Confirms Social Engineering Attack Behind npm Compromise
Axios compromise traced to social engineering, showing how attacks on maintainers can bypass controls and expose the broader software supply chain.
rwkv-tokenizer-node
Advanced tools
0 dependency tokenizer for the RWKV project
Should also work for EleutherAI neox and pythia, as they use the same tokenizer
npm i rwkv-tokenizer-node
const tokenizer = require("RWKV-tokenizer-node");
// Encode into token int : [12092, 3645, 2]
const tokens = tokenizer.encode("Hello World!");
// Decode back to "Hello World!"
const decoded = tokenizer.decode(tokens);
Its primary purpose is for use in implementing RWKV-cpp-node , though it could probably be used for other use cases (eg. pure-JS implementaiton of gpt-neox or RWKV)
PS: Anyone who has any ideas on how to improve its performance, while not failing the test suite, is welcomed to do so.
# This run the sole test file test/tokenizer.test.js
npm run test
The python script used to seed the refence data (using huggingface tokenizer) is found at test/build-test-token-json.py This test includes a very extensive UTF-8 test file covering all major (and many minor) languages
@picocreator - is the current maintainer of the project, ping him on the RWKV discord if you have any questions on this project
@saharNooby - which the current implementation is heavily based on
@cztomsik @josephrocca @BlinkDL - for their various implementation, which is used as refence to squash out mismatching encoding with HF implementation.
FAQs
RWKV / gpt-NeoX / Pythia, 0-dep tokenizer library, for nodejs
We found that rwkv-tokenizer-node demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
Axios compromise traced to social engineering, showing how attacks on maintainers can bypass controls and expose the broader software supply chain.

Security News
Node.js has paused its bug bounty program after funding ended, removing payouts for vulnerability reports but keeping its security process unchanged.

Security News
The Axios compromise shows how time-dependent dependency resolution makes exposure harder to detect and contain.