Latest nlp topics - PyTorch Forums

Topic	Replies	Views	Activity
After LSTM, too much zero	1	321	May 8, 2023
How to make sure which masking arguments I need to provide for calling torch.nn.Transformer model?	0	774	May 7, 2023
Get logits with generation method	0	437	May 7, 2023
Good resource to learn TorchText	1	445	May 7, 2023
Reset CUDA_VISIBLE_DEVICES?	1	1098	May 5, 2023
Facebook BART Fine-tuning - Transformers CUDA error: CUBLAS_STATUS_NOT_INITIALIZE	10	1309	May 2, 2023
Expected input batch_size (3) to match target batch_size (162) for text classification	3	388	May 2, 2023
Training a generative model without teacher forcing	2	2377	April 30, 2023
nn.MultiheadAttention to get heatmap	2	850	April 30, 2023
Is this a right implementation of perplexity?	2	1367	April 30, 2023
Running out of memory training two layer biLSTM w batch size 32	7	548	April 28, 2023
How are results averaged during evaluation for Trainer class in HuggingFace	0	724	April 27, 2023
Exploding memory	11	997	April 27, 2023
Sigmoid function problem for NLP tasks	1	337	April 25, 2023
Input shape of target mask in nn.Transformer	1	1601	April 24, 2023
LSTM/GRU/RNN (dropout) Pytorch implementation following TensorFlow one	2	2177	April 23, 2023
Isn't nn.Transformer confusing with num_decoder_layers=0?	0	435	April 20, 2023
Attention block without copies for contiguity	0	476	April 19, 2023
TypeError: empty() received an invalid combination of arguments - got (tuple, dtype=NoneType, device=NoneType), but expected one of: * (tuple of ints size, *, tuple of names names, torch.memory_format memory_format, torch.dtype dtype, torch.layout layout	3	20127	April 19, 2023
Dataloader with attribute error	1	326	April 18, 2023
Need to use two pytorch_model.bin at the same time, can I renamed it?	1	582	April 15, 2023
Using different feature size between source and target nn.Transformer	4	2359	April 15, 2023
Is nn.MultiheadAttention attn_mask working differently in pytorch 2.0?	4	1114	April 12, 2023
How to convert Pytorch model (AlignTTS) to ONNX?	0	620	April 12, 2023
Expected hidden size different to actual hidden size	1	430	April 11, 2023
GPU memory usage increase with the large dataset	3	567	April 10, 2023
Transformer learning gets worse with more encoder blocks	0	365	April 10, 2023
Should I use GPU or TPU on Google Colab?	1	1649	April 10, 2023
How to train a classifier to return logits for three separate labels	1	513	April 9, 2023
Loss isn't changing with LSTM	2	421	April 7, 2023