Repeating Values/Same output across Episodes Qnetworks

I have run into an issue with a dual Q deep neural network implementing the Bell equation to compute loss. I currently am not entirely sure whats causing the issue but every time I run the code the output is the same additionally the output is the same across every episode where it would almost seem like the network is not learning as it should and the end result just repeats over and over. I am not sure if its how im computing my loss and teaching the network, or if it is related to the setup of the neural network itself.

Hello! If you could post a minimal reproducible snippet someone might be able to assist.