RTX 5090 interconnection with pytorch

Dear all
I have installed NVIDIA RTX 5090 on my PC for deep rienforcement leerning with pycharm. My PC RAM is 16 GB, and CPU is 12th Gen Intel(R) Core™ i7-12700. However, when I run my code the system freezes and goes to black screen even without reaching to maximum capacity of GPU and RAM. I just want to know that the hardwares are compatible or pytorch does not support the RTX 5090 and I have software issue. RTX 5090 is cuda 12.9 and I have installed cuda 12.9 nightly version.

Thanks.

All of our PyTorch binaries built with CUDA 12.8+ support Blackwell GPUs. You might need to check system logs to see why your system seemingly crashes, which could indicate e.g. an underpowered PSU.

1 Like

Accordingly, the observed freezing is unlikely to stem from software-related causes or coding on my part??

Yes, I would recommend checking the system logs to see what kind of issues were detected that could cause the black screen and system freeze.

1 Like

Thanks a lot for your help!

Sure! Let me know if you found something interesting in dmesg or any other system logs which could explain the issues you are seeing.

1 Like

Hello!
my system log was the same as below, is it a hardware issue? and how can I find the problem?

TimeCreated : 7/25/2025 2:23:54 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-DistributedCOM
Id : 10016
Message : The específico de la aplicación permission settings do not grant Local Activación permission for
the COM Server application with CLSID
{2593F8B9-4EAF-457C-B68A-50F6B8EA6B54}
and APPID
{15C20B67-12E7-4BB6-92BB-7AFF07997402}
to the user UCLM\Ali.Noutash SID (S-1-12-1-1865765806-1139200683-376727440-516299602) from address
LocalHost (con LRPC) running in the application container No disponible SID (No disponible). This
security permission can be modified using the Component Services administrative tool.

TimeCreated : 7/25/2025 2:23:37 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:37 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:37 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:37 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:37 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:37 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:37 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:37 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:37 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:37 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:37 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:37 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:36 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:36 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:36 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name:

TimeCreated : 7/25/2025 2:23:36 PM
LevelDisplayName : Warning
ProviderName : Microsoft-Windows-WHEA-Logger
Id : 17
Message : A corrected hardware error has occurred.

               Component: PCI Express Root Port
               Error Source: Advanced Error Reporting (PCI Express)

               Primary Bus:Device:Function: 0x0:0x1:0x0
               Secondary Bus:Device:Function: 0x0:0x0:0x0
               Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
               Secondary Device Name: