How does the type of solver (fixed vs variable step) affect reinforcement learning in Simulink environment?
32 visualizaciones (últimos 30 días)
Mostrar comentarios más antiguos
Vincent
el 13 de Ag. de 2025 a las 17:10
Respondida: Chuguang Pan
el 14 de Ag. de 2025 a las 2:14
So I successfully trained a robot manipulator to grab a object with a TD3 agent with the variable solver ode45. But when I changed the solver to a fixed solver (backward euler) with step size 0.01 (I also used rate transition block between the agent and the state,reward,done signal but the block read no operation since the agent and fixed solver was both at sample time = 0.01 so I'm assuming it had no affect) my agent performed worse and did not learn.
The reason why I used the rate transition block and changed to a fixed solver is because I'm trying to use a custom RL agent that I programmed on my own but I can't seem to use a variable solver otherwise the model doesn't compile when I try to train it. But it seems that fixed solver has negative effects on training.
Has anyone successfully trained a custom agent with a fixed solver or had similar issues? Any adivce or insight is greatly appreicated
3 comentarios
Torsten
el 13 de Ag. de 2025 a las 20:05
Since my solver step size is 10 times smaller than the agent sample time., is it the case my solver will update 10 times and then on the 10th update pass the resulting state as the next state into my agent?
I have no experience with Simulink. Maybe you should ask this as a new question in the forum and/or ask MATLAB Support.
Respuestas (1)
Chuguang Pan
el 14 de Ag. de 2025 a las 2:14
You can use Simulink Debug Functionality to step through the simulation and examine the output of agent.
0 comentarios
Ver también
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!