Latest nlp topics - PyTorch Forums

Topic	Replies	Views	Activity
Different value between BertSDPA scratch and pretrained sequence classification	0	131	July 30, 2024
How do I apply Batch Normalization on a sequential data?	5	2637	July 28, 2024
OpenNMT beam.py beam search class for seq2seq language models	0	124	July 25, 2024
Which is better TorchServe Batching or ClientSide Batching for text classification	0	28	July 24, 2024
Discrepancy Between key_padding_mask and attn_mask in MultiheadAttention Layer	9	1952	July 23, 2024
Have a question on `W_ii @ x_t` in lstm	0	15	July 20, 2024
nn.TransformerEncoder for classification	9	8358	July 20, 2024
How to reduce TPU RAM usage when calculating large tensors	1	212	July 19, 2024
Can someone help me understand how to use a .pt file?	2	99	July 16, 2024
Why is the 0 vector still not returned after set padding_idx?	1	28	July 15, 2024
How to write my own LearningRate function	1	30	July 15, 2024
Llama-2 CUDA OOM during inference but not training	8	905	July 10, 2024
Error for not having the same size in the input and output in the seqtoseq algorithm	0	38	July 10, 2024
'is_sparse_any' from 'torch._cannot import namesubclasses.meta_utils'	2	211	July 9, 2024
T5 model training stops without any error	11	1628	July 4, 2024
Right loss function for VAE with word2vec (cbow and skipgram)	0	73	July 4, 2024
Why charater based LSTM are taking more time than word based LSTM while next word prediction	0	43	July 3, 2024
How to create a "Both" option in multiple choice model training to solve IndexError: Target out of bounds error?	0	72	June 27, 2024
Should Transformer's causal attention mask be upper-triangular or lower-triangular?	1	1459	June 27, 2024
Torchtext AG_NEWS dataset return (int, tensor)	4	197	June 27, 2024
Why does LSTM accept incorect input size	1	104	June 23, 2024
Models fail to train on simple classification problem, any hints?	0	117	June 18, 2024
How to combine additional features into word embeddings	0	170	June 16, 2024
How to Add New Classes to a Model that Already Has a Certain Number of Classes	1	138	June 14, 2024
Attention for RNN Decoder with multiple layers	2	1470	June 11, 2024
Nsight-compute profiling for torch?	0	309	June 10, 2024
AttributeError: Can't pickle local object 'setup_data_loader.<locals>.seed_worker'	2	888	June 10, 2024
NLP: Named Entity Recognition: Location: looking for a model supporting composite city names like like "Paris, TX"	0	161	May 31, 2024
Flash Attention with variable-length sequences	1	2008	May 27, 2024
Automatically cast input to Huggingface model’s device map	1	1686	May 26, 2024