This is my code base and I want to add pytorch profiler to the fit() function. Can anyone let me know how to embed the pytorch profiler?
I also raised the issue in the verl github, but was not resolved. Any help or insights will be appreciated.
It only print the cpu events without CUDA events. But, the GPU utilization and VRAM is almost 80 to 100% over the course of training.