Security News
Node.js EOL Versions CVE Dubbed the "Worst CVE of the Year" by Security Experts
Critics call the Node.js EOL CVE a misuse of the system, sparking debate over CVE standards and the growing noise in vulnerability databases.
fasttext-node
Advanced tools
A node wrapper around FastText library.
fastText is a library for efficient learning of word representations and sentence classification.
fastText builds on modern Mac OS and Linux distributions. Since it uses C++11 features, it requires a compiler with good C++11 support. These include :
Compilation is carried out using a Makefile, so you will need to have a working make. For the word-similarity evaluation script you will need:
This node module requires git and curl to be installed on your system. Installation will fail without these.
You can find the complete documentation of this module at https://jazzyarchitects.github.io/FastText/docs/FastText.html
To use this module in your code, you can import this directly:
const FastText = require('fasttext-node');
const fastText = new FastText( /* {} library configurations */);
The module exposes a train method which can be used to train a new model. The training methodology is supervised learning.
const trainFileUri = 'https://raw.githubusercontent.com/jazzyarchitects/fasttext-node/master/train.txt'
const trainResult = await fastext.train(trainFileUri,
{ /* options */
epoch: 50,
lr: 0.01
});
The first argument is the location of training file. It can be a url or file path on local machine.
The train function is an asynchronous function which will return true after the training is finished.
The options arguments is a JSON object with the following properties:
Parameter | Type | Default Value | Description |
---|---|---|---|
epoch | number | 25 | Number of epochs |
lr | number | 0.1 | Learning rate |
lrUpdateRate | number | 100 | Change the rate of updates for the learning rate |
dim | number | 100 | Size of word vectors |
ws | number | 5 | Size of the context window |
neg | number | 5 | Number of negatives sampled |
wordNgrams | number | 2 | Max length of word ngram |
loss | enum | 'ns' | Loss function. Should be one of 'ns' 'hs' or 'softmax' |
thread | number | 12 | Number of threads |
model | string | training-model | The path to save the model |
After the training has finished, the model can be used to predict the labels of new strings.
const options = {
labelCount: 3
}
const result = await fastext.predict([
'Custard Pudding tasting like raw eggs',
'Is Himalayan pink salt the same as the pink salt used for curing?',
], options);
// OR
const result = await fastext.predict(`
Custard Pudding tasting like raw eggs
Is Himalayan pink salt the same as the pink salt used for curing?`,
options
);
The predict function will return an array of predictions for each input. Each input should be on a different line in the string or in the form of an array.
The second argument to the predict function is a JSON object with the following options
Parameter | Type | Default Value | Description |
---|---|---|---|
labelCount | number | 3 | Number of labels to return per input string |
model | string | training-model | The file path of model to use for predicting the labels. Do NOT put any extension of model file (.bin or .vec) |
Example output:
[
{
"input": "Custard Pudding tasting like raw eggs",
"predictions":{
"eggs": 0.607422,
"egg-whites": 0.00390627,
"frying": 0.00390627
}
},
{
"input": "Is Himalayan pink salt the same as the pink salt used for curing?",
"predictions": {
"salt": 0.166016,
"flavor": 0.0136719,
"language": 0.0117188
}
}
]
The file you use for training should be of the format:
__label__food-safety __label__beans How long can I soak dried beans before they are considered inedible?
Each label should be prepended by '__label__' (double underscores), followed by the string whose label are specified in the line starting.
Each string can have multiple labels attached to it.
MIT License
Copyright (c) 2017 Call-Em-All
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.
FAQs
Node wrapper around facebook's fasttext library
The npm package fasttext-node receives a total of 7 weekly downloads. As such, fasttext-node popularity was classified as not popular.
We found that fasttext-node demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Critics call the Node.js EOL CVE a misuse of the system, sparking debate over CVE standards and the growing noise in vulnerability databases.
Security News
cURL and Go security teams are publicly rejecting CVSS as flawed for assessing vulnerabilities and are calling for more accurate, context-aware approaches.
Security News
Bun 1.2 enhances its JavaScript runtime with 90% Node.js compatibility, built-in S3 and Postgres support, HTML Imports, and faster, cloud-first performance.