Latest nlp topics - PyTorch Forums

Topic	Replies	Views	Activity
Log softmax probabilities all equal in rnn decoder because pointer network scores are all < -90.0	0	140	November 19, 2024
How to correct TypeError: zip argument #1 must support iteration training in multiple GPU	6	1122	November 13, 2024
Training starting again in sampling code	3	54	November 9, 2024
AutoModelForCausalLM dataset process	1	284	November 9, 2024
Can someone explain the benefits of Batches?	2	195	November 8, 2024
Teacher forcing ratio	0	250	November 8, 2024
Search in documents	2	184	November 7, 2024
Torch using two GPUs with NV link	8	711	November 5, 2024
Could not get the file at http://www.quest.dcs.shef.ac.uk/wmt16_files_mmt/training.tar.gz. [RequestException] None	6	2089	October 29, 2024
Regarding Scaled Dot Product Attention	4	220	October 25, 2024
Memory Leak with a simple code	3	84	October 22, 2024
Build Auto Tagging System	7	312	October 22, 2024
Why transformer model is predicting only one random word repetatively in every iteration	1	93	October 19, 2024
LogSoftmax vs Softmax	26	56361	October 15, 2024
Why transformer model is behaving like this?	1	64	October 14, 2024
Variable length time series data	1	168	October 12, 2024
I want to eliminate the accumulation of memory usage during the learning loop	0	30	October 7, 2024
The forward function of a multi-layer Elman RNN from tutorial has two errors	0	18	October 1, 2024
Hi everyone, I'm new in nlp, I'm trying to build a machine translation model using BERT and I'm having trouble training the model, my predicted tokens all return the id of the token <eos> ( 3) in the first epoch. How do I handle this. Note: I used label s	0	19	September 29, 2024
Transformer example: Position encoding function works only for even d_model?	4	2753	September 25, 2024
Is the nn.Transformer package missing nn.Generate	0	116	September 23, 2024
Flex Attention Extremely Slow	1	464	September 20, 2024
How tokens per second calculated for LLM training	0	43	September 18, 2024
Drop row from tensor in cuda	3	215	September 14, 2024
Unhashable list while training sbert	0	101	September 14, 2024
RuntimeError: CUDA error: device-side assert triggered, LayoutLM Fine-Tuning	10	935	September 10, 2024
Model predicted almost correct sentences at the time of training but is only predicting <START> token at the time of test	0	26	September 10, 2024
Self Self-attention implementation results are 'a bit' suprising	0	54	September 10, 2024
Extracting embeddings from log probabilities	0	116	September 9, 2024
Can transformer automatically learn the length of sequences?	0	38	September 9, 2024