Changing kernel size in wav2vec pretrained model
|
|
3
|
823
|
September 22, 2022
|
Back-propagate through torch.stft and torch.istft
|
|
4
|
1141
|
September 20, 2022
|
Torchaudio backend change on Windows
|
|
0
|
881
|
September 5, 2022
|
CNN speech recognition model good train/test accuracy yet poor inference on outside data?
|
|
3
|
761
|
September 4, 2022
|
CTCDecoder on GPU
|
|
1
|
721
|
September 1, 2022
|
How can I classify the language of voice data?
|
|
0
|
411
|
August 30, 2022
|
How can I classify words spoken by one person?
|
|
1
|
449
|
August 24, 2022
|
Exploding loss in encoder/decoder model
|
|
1
|
649
|
August 23, 2022
|
Audio frame wise training
|
|
0
|
594
|
August 3, 2022
|
Can I use TimeStretch on Mel-frequency spectrograms?
|
|
0
|
522
|
August 2, 2022
|
How to format forward function for multiple inputs
|
|
5
|
2000
|
August 2, 2022
|
Converting architecture to multiple input, single output
|
|
0
|
542
|
July 31, 2022
|
Using nn.transformer for audio spectrograms?(speech recognition)
|
|
0
|
530
|
July 27, 2022
|
How to use torchauido load mp3 file
|
|
0
|
555
|
July 19, 2022
|
MFCC and LFCC in frame by frame processing
|
|
1
|
764
|
July 13, 2022
|
Numpy is not available error
|
|
7
|
8833
|
July 11, 2022
|
Random paramters value for torchaudio augmentation
|
|
1
|
428
|
July 1, 2022
|
How to use and finetune XLS-R wave2vec2 feature extractor in PyTorch?
|
|
1
|
738
|
June 25, 2022
|
CNN ASR getting nan and core dump after epoch 1 with custom dataset
|
|
2
|
482
|
June 20, 2022
|
WavNet Synthesis
|
|
0
|
565
|
June 20, 2022
|
Pre-trained Model on Spectrograms
|
|
1
|
734
|
May 16, 2022
|
PyTorch equivalent of tf.gather
|
|
7
|
2063
|
May 14, 2022
|
Raspberry pi 4 64-bit
|
|
1
|
835
|
May 14, 2022
|
Torchaudio.functional.lfilter VS scipy.signal.lfilter
|
|
3
|
1648
|
May 11, 2022
|
Pretrain Resnet34 based on mel spectrogram features
|
|
0
|
593
|
May 5, 2022
|
Implementing scriptable channel masking transform for multi-channel input
|
|
0
|
488
|
April 29, 2022
|
How to feed mfcc features into Resnet/VGG
|
|
2
|
783
|
April 28, 2022
|
CTC loss error after first epoch
|
|
1
|
1355
|
April 21, 2022
|
Unable to reproduce results in Emformer
|
|
8
|
730
|
April 20, 2022
|
RNN on a Spectrogram - Loss Isn't Going Down
|
|
4
|
594
|
April 19, 2022
|