Diffusion Models Beat GANs - implementation

Less experienced one here. I would like to implement something along the line of the paper: [2105.05233] Diffusion Models Beat GANs on Image Synthesis. Apply the same concepts but in a simpler implementation.
The code from OpenAI is quite complex and computationally demanding.
How would you suggest proceeding to tackle this project?