
Company News
Socket Named Top Sales Organization by RepVue
Socket won two 2026 Reppy Awards from RepVue, ranking in the top 5% of all sales orgs. AE Alexandra Lister shares what it's like to grow a sales career here.
corpus-replicator
Advanced tools
Corpus Replicator is a corpus generation tool that enables the creation of multiple
unique output files based on templates. The primary intended use case is the
creation of a seed corpus that can be used by fuzzers. Support for additional output
formats can be added via the creation of Recipes. If a desired format is unsupported,
support can be added via the creation of a CorpusGenerator.
The goal is to create an efficient corpus that maximizes code coverage and minimizes file size. Small unique files that execute quickly are preferred.
Currently four media types can be generated animation, audio, image and
video.
Corpus Replicator relies on FFmpeg.
pip install corpus-replicator
This is an example recipe file.
# "base" contains required entries and default flags
base:
codec: "h264" # name of the codec
container: "mp4" # container/file extension
library: "libx264" # name of library
medium: "video" # supported medium
tool: "ffmpeg" # name of supported tool
default_flags:
encoder: # "encoder" flag group
["-c:v", "libx264"]
resolution: # "resolution" flag group
["-s", "320x240"]
# variations allow flags to be added and overwritten
# one file will be generated for each entry in a flag group
variation:
resolution: # flag group - overwrites default flag group in "base"
- ["-s", "640x480"]
- ["-s", "32x18"]
- ["-s", "64x64"]
monochrome: # flag group - adds new flag group
- ["-vf", "hue=s=0"]
Running the recipe will generate a corpus:
$ corpus-replicator example.yml video -t test
Generating templates...
1 recipe(s) will be used with 1 template(s) to create 4 file(s).
Generating 4 'video/libx264/h264/mp4' file(s) using template 'test'...
Optimizing corpus, checking for duplicates...
Done.
Resulting corpus:
$ ls generated-corpus/
video-h264-libx264-test-monochrome-00.mp4
video-h264-libx264-test-resolution-01.mp4
video-h264-libx264-test-resolution-00.mp4
video-h264-libx264-test-resolution-02.mp4
A more complex corpus can be generated by using multiple Recipes and Templates at
once.
Recipes are stored in src/corpus_replicator/recipes.
FAQs
Unknown package
We found that corpus-replicator demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 2 open source maintainers collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Company News
Socket won two 2026 Reppy Awards from RepVue, ranking in the top 5% of all sales orgs. AE Alexandra Lister shares what it's like to grow a sales career here.

Security News
NIST will stop enriching most CVEs under a new risk-based model, narrowing the NVD's scope as vulnerability submissions continue to surge.

Company News
/Security News
Socket is an initial recipient of OpenAI's Cybersecurity Grant Program, which commits $10M in API credits to defenders securing open source software.