espnet2.gan_svs.visinger2.visinger2_vocoder.create_fb_matrix
Less than 1 minute
espnet2.gan_svs.visinger2.visinger2_vocoder.create_fb_matrix
espnet2.gan_svs.visinger2.visinger2_vocoder.create_fb_matrix(n_freqs: int, f_min: float, f_max: float, n_mels: int, sample_rate: int, norm: str | None = None) β Tensor
Create a frequency bin conversion matrix.
- Parameters:
- n_freqs (int) β Number of frequencies to highlight/apply
- f_min (float) β Minimum frequency (Hz)
- f_max (float) β Maximum frequency (Hz)
- n_mels (int) β Number of mel filterbanks
- sample_rate (int) β Sample rate of the audio waveform
- norm (Optional *[*str ]) β If βslaneyβ,
- band (divide the triangular mel weights by the width of the mel)
- **(**Default ( *(*area normalization ) .) β None)
- Returns: Triangular filter banks (fb matrix) of size (n_freqs, n_mels) meaning number of frequencies to highlight/apply to x the number of filterbanks. Each column is a filterbank so that assuming there is a matrix A of size (β¦, n_freqs), the applied result would be A * create_fb_matrix(A.size(-1), β¦).
- Return type: Tensor
