aflr - python SDK
aflr is the official api.audio Python 3 SDK. This SDK provides easy access to the api.audio API from applications written in python.
📝 Table of Contents
🧐 About
This repository is actively maintained by Aflorithmic Labs. For examples, recipes and api reference see the api.audio docs.
🏁 Getting Started
Installation
You don't need this source code unless you want to modify it. If you want to use the package, just run:
pip install aflr -U
Install from source with:
python setup.py install
Prerequisites
Python 3.6+
🚀 Hello World
Create a file hello.py
touch hello.py
Authentication
The library needs to be configured with your account's secret key which is available in your Aflorithmic Dashboard. Import the aflr package and set aflr.api_key
with the api-key you got from the dashboard:
import aflr
aflr.api_key = "your-key"
Create Text to audio in 4 steps
Let's create our first audio from text.
✍️ Create a new script:
script = aflr.Script.create(scriptText="Hello world", scriptName="hello")
print(script)
🎤 Create an speech audio file from the script using Joanna's voice:
response = aflr.Speech.create(scriptId=script["scriptId"], voice="Joanna")
print(response)
🎧 Now let's master the speech file with high quality and a nice background track.
response = aflr.Mastering.create(
scriptId=script.get("scriptId"),
backgroundTrackId="full__citynights.wav"
)
print(response)
🎉 Finally, get the urls of the audio files generated:
urls = aflr.Mastering.retrieve(scriptId=script["scriptId"])
print(urls)
Or download the files in your current folder:
filepath = aflr.Mastering.download(scriptId=script["scriptId"], destination=".")
print(filepath)
Easy right? 🔮 This is the hello.py
final picture:
import aflr
aflr.api_key = "your-key"
script = aflr.Script.create(scriptText="Hello world", scriptName="hello")
response = aflr.Speech.create(scriptId=script["scriptId"], voice="Joanna")
print(response)
response = aflr.Mastering.create(
scriptId=script.get("scriptId"),
backgroundTrackId="full__citynights.wav"
)
print(response)
urls = aflr.Mastering.retrieve(scriptId=script["scriptId"])
print(urls)
filepath = aflr.Mastering.download(scriptId=script["scriptId"], destination=".")
print(filepath)
Now let's run the code:
python hello.py
Once completed, check the files in the hello.py
root folder - you will see a new audio file. Play it!
📑 Documentation
Import
import aflr
Authentication
The library needs to be configured with your account's secret key which is available in your Aflorithmic Dashboard. Set aflr.api_key
with the api-key you got from the dashboard:
aflr.api_key = "your-key"
Authentication with environment variable (recommended)
You can also authenticate using aflr_key
environment variable and the aflr SDK will automatically use it. To setup, open the terminal and type:
export aflr_key=<your-key>
If you provide both environment variable and aflr.api_key
authentication, the aflr.api_key
will be used.
Resource Usage
There are two approaches to use the resources.
First approach is to import the resource classes you want to use first, then use resource methods. For example, to use Script
, we could do:
from aflr import Script
Script.create()
The second approach is to use it directly from aflr:
import aflr
aflr.Script.create()
Same logic applies for other resources (Speech
, Voice
, Sound
...)
Script
resource
The Script resource/class allows you to create, retrieve and list scripts. Learn more about scripts here.
Script methods are:
create()
- Create a new script.
- Parameters:
scriptText
* [Required] (string) - Text for your script. A script can contain multiple sections and SSML tags. Learn more about scriptText details hereprojectName
(string) - The name of your project. Default value is "default"moduleName
(string) - The name of your module. Default value is "default"scriptName
(string) - The name of your script. Default value is "default"scriptId
(string) - Custom identifier for your script. If scriptId parameter is used, then projectName, moduleName and scriptName are required parameters.
- Example:
script = aflr.Script.create(
scriptText="<<sectionName::hello>> Hello {{username|buddy}} <<sectionName::bye>> Good bye from {{location|barcelona}}",
projectName="myProject",
moduleName="myModule",
scriptName="myScript",
scriptId="id-1234"
)
retrieve()
- Retrieve a script by id.
list()
- List all scripts available in your organization.
get_random_text()
- Retrieve random text from a list of categories.
Speech
resource
Speech allows you to do Text-To-Speech (TTS) with our API using all the voices available. Use it to create a speech audio file from your script.
Speech methods are:
-
create()
Send a Text-To-Speech request to our Text-To-Speech service.
- Parameters:
scriptId
* [Required] (string) - The script IDvoice
(string) - Voice name. See the list of available voices using Voice resource. Default voice is "Joanna".speed
(string) - Voice speed. Default speed is 100.effect
(string) - Put a funny effect in your voice. You can try the following ones: dark_father
, chewie
, 88b
, 2r2d
, volume_boost_low
volume_boost_middle
volume_boost_high
(Volume boost allows you to adjust the volume of speech. NOTE! Volume boost effect only applies to speech creation and will be overwritten by the mastering process)silence_padding
(integer) - Add a silence padding to your speech tracks (in milliseconds). Default is 0 (no padding)audience
(dictionary) - List of dicts containing the personalisation parameters as key-value pairs. This parameter depends on the number of parameters you used in your script resource. For instance, if in the script resource you have scriptText="Hello {{name}} {{lastname}}"
, the audience should be: [{"username": "Elon", "lastname": "Musk"}]
sections
(dictionary) is a dictionary (key-value pairs), where the key is a section name, and the value is another dictionary with the section configuration ( valid parameters are: voice, speed, effect, silence_padding). If a section is not found here, the section will automatically inherit the voice, speed, effect and silence_padding values you defined above (or the default ones if you don't provide them). See an example below with 2 sections and different configuration parameters being used.
sections={
"firstsection": {
"voice": "Matthew",
"speed": 110,
"silence_padding": 100,
"effect": "dark_father"
},
"anothersection": {
"voice": "en-GB-RyanNeural",
"speed": 100
}
}
voiceName
(DEPRECATED, use voice
instead)scriptSpeed
(DEPRECATED, use speed
instead)
- Simple example:
response = aflr.Speech.create(
scriptId="id-1234",
voice="Joanna"
)
- Complete example:
response = aflr.Speech.create(
scriptId="id-1234",
voice="Matthew",
speed=100,
effect="dark_father",
silence_padding= 1000,
audience=[{"username": "Elon", "lastname": "Musk"}],
sections={
"firstsection": {
"voice": "Matthew",
"speed": 110,
"silence_padding": 100,
"effect": "dark_father"
},
"anothersection": {
"voice": "en-GB-RyanNeural",
}
}
)
-
retrieve()
Retrieve the speech file urls.
-
download()
Download the speech files in your preferred folder.
Voice
resource
Voice allows you to retrieve a list of the available voices from our API.
Voice methods are:
-
list()
List all the available voices in our API. The parameters are all optional, and can be used in combination to get the perfect voice for your usecase.
- Parameters:
provider
(string) - Try one of: google, polly, azure, msnrproviderFullName
(string) - Try with one of: amazon polly, google, microsoft azure, aflorithmic labslanguage
(string) - Try with one of: english, spanish, french, germanaccent
(string) - Try with one of: american, british, neutral, portuguese/brazilian, american soft, mexican, australiangender
(string) - Try with one of: male, femaleageBracket
(string) - Try with one of: adult, child, seniortags
(string) - Try with one or more (separated by commas) of: steady, confident, balanced, informative, serious, instructional, slow, storytelling, calm, clear, deep, formal, sad, thin, fast, upbeat, fun, energetic, tense, very fast, flat, low pitched, high pitched, low-pitched, sing-y, cooperative, kind, stable, monotonous, neutral, responsible, business man, straight to the point, knowledgeable, focused, newscastery, newsreader, interviewer, reliable, friendly, welcoming, good for handing out information, slightly friendlyindustryExamples
(string) - Try with one or more (separated by commas) of: fitness, business, commercial, fashion, travel, audiobook, real estate, faith, health industry, comercial, realestate, kids entertainment, games, customer service, education, storytelling, entertainment, kids, education audiobook
- Example:
all_voices = aflr.Voice.list()
- Example:
french_voices = aflr.Voice.list(language="french",tags="steady, fun")
-
list_parameters()
This endpoint lets you see which attributes you can filter the voices by, along with the allowed values for each attribute. You can later use these parameters and values to filter the voices you wish to list.
Sound
resource
Sound allows you to design your own sound template from a script and a background track. In order to get a sound template/project, make sure you requested speech for your script resource first.
Sound methods are:
create()
Creates a sound template, compresses the sound project into a zip file and returns the url.
retrieve()
Retrieve the url of the sound project zip file.
list_sound_templates()
List all the available sound templates in our api.
list()
List all the available background tracks in our API.
list_v2()
List all the available background tracks in our API including a 15 seconds audio snippet.
download()
Download the sound project zip file in your preferred folder.
Mastering
resource
Mastering allows you to create and retrieve a mastered audio file of your script. A mastered version contains the speech of the script, a background track, personalised parameters for your audience and a mastering process to enhance the audio quality of the whole track. In order to get a mastered audio file, make sure you requested speech for your script resource first.
Mastering methods are:
create()
Creates a mastered version of your script.
- Parameters:
scriptId
* [Required] (string) - The script resource ID.backgroundTrackId
* [Required] (string) - The background track file ID.audience
(list) - List of dicts containing the personalisation parameters. This parameter depends on the number of parameters you used in your script resource. In the script documentation example above, we used 2 parameters: username
and location
, and in the following example below we want to produce the script for username Antonio
with location Barcelona
.public
(boolean) - Boolean flag that allows to store the mastered file in a public s3 folder. Default value is False
. Warning - This will cause your mastered files to be public to anyone in the internet. Use this at your own risk.vast
(boolean) - Boolean flag that allows to create a VAST file of your mastered file. The vast
flag only works if public
is True
. Default value is False
.
- Example:
response = aflr.Mastering.create(
scriptId="id-1234",
backgroundTrackId="full__citynights.wav",
audience=[{"username":"antonio", "location":"barcelona"}]
)
retrieve()
Retrieves the mastered file urls.
download()
Download the mastered files in your preferred folder.
File
resource
File allows you to retrieve all the files available in api.audio for your organization.
Available soon.
Authors
License
This project is licensed under the terms of the MIT license.