PyTorch Forums

Topic	Replies	Views	Activity
I want to create a learnable parameter when aggregrate several models together	7	718	August 7, 2023
Python slice value cannot be used as a value: jit	1	1102	August 7, 2023
Root Cause (first observed failure): distributed	6	3080	August 7, 2023
Cuda devices inaccessible suddenly	1	314	August 7, 2023
How to increase RTX 3090 GPU usage? C++	0	386	August 7, 2023
Evaluation but gradients needed	1	360	August 7, 2023
Does batched_nms in torch support backpropagation? vision	1	282	August 7, 2023
Nordvpn vs hidemyass windows	1	423	August 7, 2023
What is `sym_size` and how is it different from `size` in pytorch C++? C++	1	791	August 7, 2023
Different result for the same pytorch model on PC and android mobilephone? Mobile	0	607	August 7, 2023
Backpropagation for each dimension of output	5	325	August 7, 2023
How to apply Conv2D to [time_dim, batch_dim, C_out, H_out, W_out]? vision	3	438	August 6, 2023
GPU Performance Bottleneck: What are the possible causes? projects	4	1267	August 6, 2023
Correct way to build and get encodings from siamese using pretrained model	19	2505	August 6, 2023
Weighted average of model parameter with trainable scalar autograd	0	206	August 6, 2023
Activation maximization for ResNets vision	0	309	August 6, 2023
A question about the center parameter of torchaudio.MelSpectrogram audio	0	415	August 6, 2023
The attention mechanism is added to the model, but the results barely change? vision	2	666	August 6, 2023
UnpicklingError: STACK_GLOBAL requires str	9	6317	August 6, 2023
Downgraded performance after upgraded to Windows 11	2	636	August 6, 2023
Expected target size [100, 24, 40, 40], got [100] vision	4	307	August 6, 2023
How to call torch.distributed.nn.all_gather on each node independently? distributed	2	941	August 6, 2023
Compiling the model results in 20X slow down torch.compile	9	2651	August 6, 2023
No gradient update when using fsdp with hugginface accelerate distributed	1	828	August 5, 2023
Size mismatch for model.diffusion_model.output_blocks.8.0.skip_connection.bias: copying a param with shape torch.Size([320]) from checkpoint, the shape in current model is torch.Size([640])	0	2067	August 5, 2023
Function signatures to implement new backend C++	2	451	August 5, 2023
Shouldn't LSTM and LSTMCell produce identical sequence of hidden states when both are fed one timestep at a time? nlp	2	367	August 5, 2023
Gradient Hessian product error autograd	1	555	August 5, 2023
Pruning channels of pretrained ResNet-50 model while preserving the unpruned channels' weight?	3	672	August 5, 2023
Model not giving any output after full fine-tunining(Instruction based fine-tuning) on DDP distributed	8	909	August 5, 2023