RL Water Tank example by MATLAB does not converge

11 visualizaciones (últimos 30 días)

Alp hace alrededor de 13 horas

0
Enlazar

Enlace directo a esta pregunta

https://es.mathworks.com/matlabcentral/answers/2181119-rl-water-tank-example-by-matlab-does-not-converge

Editada: Alp hace alrededor de 11 horas

I am following the RL water tank control tutorial by MATLAB: https://www.mathworks.com/help/reinforcement-learning/ug/control-water-level-using-ddpg-agent.html (MATLAB R2025b)

However, even the model is learning at the beginning, towards the end of the training, Q0 value explodes and the reward drops from almost maximum to below zero. I need to obtain stable and good results with the official DDPG water tank control to use it as a baseline in my research, and hence, I prefer not to modify hyperparameters of the network, the reward function and the stopping criteria.

Is anyone able to reproduce good results using the given RL water tank example? Or is it okay if it is not stable in its default configuration?=

Here are my results: