Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More
Socket
Sign inDemoInstall
Socket

softmax

Package Overview
Dependencies
Maintainers
1
Versions
14
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

softmax

A softmax multi-armed bandit algorithm

  • 0.5.0
  • Source
  • npm
  • Socket score

Version published
Maintainers
1
Created
Source

softmax

Build Status

A softmax algorithm for multi-armed bandit problems

This implementation is based on Bandit Algorithms for Website Optimization and related empirical research in "Algorithms for the multi-armed bandit problem".

Specification

This module conforms to the BanditLab/1.0 specification.

Quick start

First, install this module in your project:

npm install softmax --save

Then, use the algorithm:

  1. Create an optimizer with 3 arms and default annealing:

    var Algorithm = require('softmax');
    
    var algorithm = new Algorithm({
      arms: 3
    });
    
  2. Select an arm (for exploration or exploitation, according to the algorithm):

    algorithm.select().then(function (arm) {
      ...
    });
    
  3. Report the reward earned from a chosen arm:

    algorithm.reward(armId, value).then(function (n) {
      ...
    });
    

API

Algorithm([config])

Create a new optimization algorithm.

Arguments

  • config (Object, Optional): algorithm instance parameters

The config object supports three parameters:

  • arms: (Number:Integer, Optional), default=2, the number of arms over which the optimization will operate
  • gamma: the annealing (cooling) factor – defaults to 1e-7 (0.0000001)
  • tau: the temperature (scaling) factor – 0 to Infinity, higher leads to more exploration

By default, gamma of 1e-7 will cause the algorithm to explore less as more information is received. In this case, the underlying "temperature" is changing. If this behavior is not desired, set tau to instead employ an algorithm with a fixed temperature. If tau is provided then gamma is ignored.

Returns

An instance of the softmax optimization algorithm.

Example

> var Algorithm = require('softmax');
> var algorithm = new Algorithm();
> assert.equal(algorithm.arms, 3);
> assert.equal(algorithm.gamma, 0.0000001);

Or, with a passed config:

> var Algorithm = require('softmax');
> var algorithm = new Algorithm({arms: 4, tau: 0.000005});
> assert.equal(algorithm.arms, 4);
> assert.equal(algorithm.tau, 0.000005);
Algorithm#select()

Choose an arm to play, according to the specified bandit algorithm.

Arguments

None

Returns

A promise that resolves to a Number corresponding to the associated arm index.

Example

> var Algorithm = require('softmax');
> var algorithm = new Algorithm();
> algorithm.select().then(function (arm) { console.log(arm); });

0
Algorithm#reward(arm, reward)

Inform the algorithm about the payoff from a given arm.

Arguments

  • arm (Integer): the arm index (provided from algorithm.select())
  • reward (Number): the observed reward value (which can be 0, to indicate no reward)

Returns

A promise that resolves to a Number representing the count of observed rounds.

Example

> var Algorithm = require('softmax');
> var algorithm = new Algorithm();
> algorithm.reward(0, 1).then(function (n) { console.log(n); });

1
Algorithm#serialize()

Obtain a plain object representing the internal state of the algorithm.

Arguments

None

Returns

A promise that resolves to an Object representing parameters required to reconstruct algorithm state.

Example

> var Algorithm = require('softmax');
> var algorithm = new Algorithm();
> algorithm.serialize().then(function (state) { console.log(state); });

{
  arms: 2,
  gamma: 0.0000001,
  counts: [0, 0],
  values: [0, 0]
}
Algorithm#load(state)

Restore an instance of an algorithm to a previously serialized state. This method overrides any options parameters passed at instantiation.

Arguments

  • state (Object): a serialized algorithm state (provided from algorithm.serialize())

Returns

A promise that resolves to a Number representing the count of observed rounds.

Example

> var state = {arms: 2, gamma: 0.0000001, counts: [1, 2], values: [1, 0.5]};
> var Algorithm = require('softmax');
> var algorithm = new Algorithm();
> algorithm.load(state).then(function (n) { console.log(n); });

3

Tests

To run the unit test suite:

npm test

Or, to run the test suite and view test coverage:

npm run coverage

Note: tests against stochastic methods (e.g. algorithm.select()) are inherently tricky to test with deterministic assertions. The approach here is to iterate across a semi-random set of conditions to verify that each run produces valid output. So, strictly speaking, each call to npm test is executing a slightly different test suite. At some point, the test suite may be expanded to include a more robust test of the distribution's properties – though because of the number of runs required, would be triggered with an optional flag.

Contribute

PRs are welcome! For bugs, please include a failing test which passes when your PR is applied. Travis CI provides on-demand testing for commits and pull requests.

Caveat emptor

This implementation relies on the native Math.random() which uses a seeded "random" number generator. In addition, the underlying calculations often encounter extended floating point numbers. Arm selection is therefore subject to JavaScript's floating point precision limitations. For general information about floating point issues see the floating point guide.

While these factors generally do not impede common application, I would consider the implementation suspect in an academic setting.

Keywords

FAQs

Package last updated on 15 Feb 2016

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

npm

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc