For anyone running CSM-1B (Sesame’s TTS model) on RTX 5090 — I got it working at 0.46x RTF with CUDA graph replay using nightly cu128. Had to patch HF Transformers in 4 places: StaticCache index_copy_ → slice assignment, 3x arange fixes in modeling_csm.py, and cudagraph_mark_step_begin calls in the generate loop. Full pipeline + auto-patcher here: https://github.com/D3velop-llc/csm-rtx5090
this worked on my rtx 5050 too. Thank you very much.
RTX 5050 + CUDA 12.8 + PyTorch (Working Setup)
Hi everyone,
I recently set up PyTorch on a machine with an NVIDIA RTX 5050 Laptop GPU, and wanted to share a working configuration since older/stable builds didn’t properly support this GPU architecture.
Working Setup
-
GPU: NVIDIA GeForce RTX 5050 Laptop GPU
-
CUDA Toolkit: 12.8
-
Python: 3.11
-
PyTorch: Nightly build (CUDA 12.8)
Installation Command
pip install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu128
Verification
import torch
print("Torch version:", torch.__version__)
print("CUDA available:", torch.cuda.is_available())
if torch.cuda.is_available():
print("GPU:", torch.cuda.get_device_name(0))
Notes
- Nightly builds with cu128 worked correctly and enabled full GPU acceleration.
Hope this helps anyone trying to get newer RTX GPUs working with PyTorch ![]()
That’s wrong as all of our binaries using CUDA >= 12.8 support the Blackwell architecture.
yes, i updated the mistake.
Hi Sean_V, you need for this the version of pytorch 2.11.0+cu128, This is because previous version didn’t have this version