espnet2.asr_transducer.encoder.blocks.conv_input.ConvInput
Less than 1 minute
espnet2.asr_transducer.encoder.blocks.conv_input.ConvInput
class espnet2.asr_transducer.encoder.blocks.conv_input.ConvInput(input_size: int, conv_size: int | Tuple, subsampling_factor: int = 4, vgg_like: bool = True, output_size: int | None = None)
Bases: Module
ConvInput module definition.
- Parameters:
- input_size β Input size.
- conv_size β Convolution size.
- subsampling_factor β Subsampling factor.
- vgg_like β Whether to use a VGG-like network.
- output_size β Block output dimension.
Construct a ConvInput object.
forward(x: Tensor, mask: Tensor | None = None) β Tuple[Tensor, Tensor]
Encode input sequences.
- Parameters:
- x β ConvInput input sequences. (B, T, D_feats)
- mask β Mask of input sequences. (B, 1, T)
- Returns: ConvInput output sequences. (B, sub(T), D_out) mask: Mask of output sequences. (B, 1, sub(T))
- Return type: x
