espnet3.systems.asr.system.ASRSystem
espnet3.systems.asr.system.ASRSystem
class espnet3.systems.asr.system.ASRSystem(training_config=None, inference_config=None, metrics_config=None, **kwargs)
Bases: BaseSystem
ASR-specific system.
This system adds. : - Tokenizer training inside train()
Additional stage log paths. : - train_tokenizer -> training_config.tokenizer.save_path
Initialize the ASR system with ASR-specific stage mappings.
pack_model(*args, **kwargs)
Pack model artifacts into an espnet3 bundle.
train(*args, **kwargs)
Train the model, training the tokenizer first if needed.
This stage checks for a cached tokenizer model and runs tokenizer training before delegating to the base training routine.
- Raises:RuntimeError – If neither dataset references nor
dataset_direxist.
train_tokenizer(*args, **kwargs)
Train a SentencePiece tokenizer based on configured text.
The text builder configured in training_config.tokenizer.text_builder is used to generate training text, which is then saved and consumed by the SentencePiece trainer.
- Raises:RuntimeError – If required tokenizer config is missing or invalid.
