See how one ESPnet3 recipe holds configs, stages, dataset code, and outputs in one project.

$pip install espnetRecipes
Clone a recipe, run in minutes
Pick a built-in recipe and clone it locally — configs, data pipeline, and training script all included.
$ espnet3 clone librispeech/asr --project my_projectCreates my_project/ with the full recipe — edit & run immediately.
Guides
Scale training to multiple nodes, handle large datasets, and run batch inference.
Adapt a cloned recipe to your own dataset, model, and training loop.
Understand the YAML config slots for training, inference, dataset, resolvers, and publication.
Move an ESPnet2 recipe to ESPnet3 and understand the structure, config, and parallel changes.
Read the high-level migration notes if your reference point is Hugging Face, NeMo, or SpeechBrain.
Research Pipeline
Stages
Click a stage, to see the details
dataset:
train:
data_src: librispeech/asr
data_src_args:
split: train-clean-100
valid:
data_src: librispeech/asr
data_src_args:
split: dev-cleanmodel:
_target_: espnet3.systems.asr.task.ASRTask
token_type: bpe
ctc_weight: 0.3dataset:
test:
eval1:
data_src: librispeech/asr
data_src_args:
split: test-cleanmetrics:
- _target_: espnet3.systems.asr.metrics.wer.WER
normalize: true
remove_whitespace: falsepack_model:
out_dir: ${exp_dir}/model_pack
include:
- ${recipe_dir}/src
- ${recipe_dir}/src
exclude:
- last.ckptinputs:
- name: speech
type: audio
- name: prompt
type: textResearch at Scale
Key features
Built for the foundation model era — distributed, multilingual, reproducible.
Community
Contribute to ESPnet3
Add a recipe, extend the framework, or improve the docs — contributions of all sizes are welcome.
