perceptual
Have you ever encountered a video or some music that was too loud and moved the volume slider down halfway, only to discover it was still basically as loud as before you moved the slider? The most common approach for developers building these sliders is to take the ratio of how far along the slider moved and apply that precise ratio as a multiplier on the sound's amplitude. Surprisingly, this approach doesn't actually mirror how we perceive loudness. Enter perceptual
.
Convert volume slider fractions to amplitudes and nothing more.
Theory
This is a utility to help make our volume controls feel more natural. Our hearing follows a logarithmic scale. We perceive less difference between loud sounds than we do between soft sounds. We want our volume controls to mirror how we perceive sound.
In order to make our controls more natural, we'll use an exponential scale. We'll present the user a scale between 0 and 100%, but the percentage will actually select a percentage of a reasonably-chosen range denominated in decibels. For example, if this range is 60dB, then 50% corresponds to 50% * 60 = 30dB. However, this range is negative; it starts at -60dB and increases toward 0dB so we subtract 60dB, so 30 - 60 = -30. From here, we just convert decibels to amplitude using an established formula, amplitude = 10(db/20), so for example 10(-30/20) = 10-1.5 = 0.03. So 50% perceived loudness yields about 3% of the total amplitude.
There are two more things to note. First, we'll express everything as percentages everywhere, so these functions take a percentage in and express a percentage out. In the previous example, perceptualToAmplitude(50%) = 3%. Second, we allow users to boost the volume of other users, and we use a different scale for perceived > 100%. We scale these to a different "boost" range, which by default is 6dB of boost.
Usage
Convert Interface Value to Amplitude
If you have a volume slider or other control which returns a Number from 0 to 1, the following will convert the slider's position to an amplitude.
> var perceptual = require('@discordapp/perceptual')
> perceptual.perceptualToAmplitude(0.5)
0.056234132519034905
This tells us that the corresponding amplitude that will be perceived as 50% as loud is about 5.6%. By default, this method uses a 50dB range.
From here we can apply this amplitude value (0.056234) to the source, e.g. to the volume
property of a HTMLMediaElement
. We can alternatively multiply sample amplitudes with this volume in a context where we have direct access to the samples.
As a second example,
> perceptual.perceptualToAmplitude(50, 100, 60)
3.162277660168379
This supplies a normalizedMax
argument to specify that we want a range from 0 to 100 instead. It also supplies a non-default dynamic range of 60dB. For these settings, 50% of the loudness corresponds to 3.2% of the amplitude.
We can also boost the original volume using a separate dynamic range,
> perceptual.perceptualToAmplitude(150, 100, 60, 6)
141.25375446227542
With our 6dB boost range, 150% loudness yields 141% amplitude.
Convert Amplitude to Interface Value
perceptual
also supplies a method to calculate from amplitudes to perceptual interface values.
> perceptual.amplitudeToPerceptual(0.5)
0.8795880017344075
This tells us that 50% amplitude corresponds to about 88% perceived loudness on our default 50dB range.