|
State of the bazel build and updating the Bazel build files to support Bzlmod
|
|
0
|
3
|
February 23, 2026
|
|
PR Review request
|
|
0
|
10
|
February 22, 2026
|
|
[ROCm][CI] fp8 acceptable accuracy threshold
|
|
2
|
58
|
February 20, 2026
|
|
Problem of freeze metrics after first epoch
|
|
0
|
21
|
February 20, 2026
|
|
Pytorch + torchsparse compatibility for rtx5090
|
|
1
|
40
|
February 19, 2026
|
|
Transforms are being applied to masks for some reason
|
|
1
|
31
|
February 19, 2026
|
|
Backend-agnostic serialization / inference workflow
|
|
4
|
34
|
February 19, 2026
|
|
Unable to allocate shared memory(shm) for file
|
|
4
|
88
|
February 18, 2026
|
|
LLaVA Steering: Why does grounding fix hallucinations in captioning but not in Yes/No QA?
|
|
0
|
15
|
February 18, 2026
|
|
Are MPS operators rewritten?
|
|
2
|
38
|
February 18, 2026
|
|
Extracting int8 weights and other quant params after convert_pt2e
|
|
1
|
35
|
February 17, 2026
|
|
XNNPACKQuantizer.set_module_name() not working as expected
|
|
2
|
38
|
February 17, 2026
|
|
Limitations of Int8 QAT for Linear Layers
|
|
1
|
29
|
February 17, 2026
|
|
Severe training slowdown after unfold-based patchification
|
|
5
|
58
|
February 17, 2026
|
|
Question About Backward–ReduceScatter Overlap in FSDP Figure 5
|
|
2
|
27
|
February 17, 2026
|
|
Support libtorch for linux arm
|
|
3
|
34
|
February 17, 2026
|
|
Inter-process sharing CUDA Tensor
|
|
1
|
24
|
February 17, 2026
|
|
CUDA support for RTX 50x on Windows
|
|
4
|
1161
|
February 17, 2026
|
|
RPN+ROI architectiure (Faster R-CNN)
|
|
1
|
838
|
February 16, 2026
|
|
Nested torch.compile-d function calls with different options / CUDA graph options
|
|
0
|
20
|
February 16, 2026
|
|
CTCDecoder returns result.tokens longer than input emissions - why +2 extra tokens?
|
|
5
|
28
|
February 16, 2026
|
|
Fp16_compress_hook casts to FP16 before dividing by world_size — causes NaN with large gradients
|
|
0
|
13
|
February 16, 2026
|
|
How to pass float32 into specific model layer but keep the rest of weights float16 (deepspeed)?
|
|
0
|
13
|
February 15, 2026
|
|
Utilising Serialized Torch Models in Apache Spark (scala)
|
|
0
|
23
|
February 14, 2026
|
|
Pytorch dev 130
|
|
2
|
34
|
February 13, 2026
|
|
Is there any way to compute multiple jvps without repeatly computing function value?
|
|
5
|
41
|
February 13, 2026
|
|
Post training quantized model gets the error "Copying from quantized Tensor to non-quantized Tensor is not allowed" even though I'm not copying tensor
|
|
5
|
57
|
February 12, 2026
|
|
Mark_dynamic does not work when the dimension is 1
|
|
0
|
24
|
February 12, 2026
|
|
Is torch Muon optimizer compatible with FSDP/HSDP?
|
|
1
|
44
|
February 12, 2026
|
|
Flex_attention and torch.compile
|
|
1
|
31
|
February 12, 2026
|