I have a doubt DQN say for nth state i am getting
qnthvalues = [1,2,3]
So here max q value is selected which 2 pos or the 3 rd value and i am doing the action 3 and getting qn+1thvalue and now should i apply bellman eq for that action or the 3rd value of qn+1th value and leave other value the same for target value
qn+1value = [2,3,4] targetq_values = [2,3,bellmaneq(4)] Or targetq_values = bellmaneq(qn+1values) #for all q values of that state
(So for all q values we will be applying or will be applying for the action q value alone.