I see a zero mean reward for the first agent in multi-agent RL Toolbox

3 visualizaciones (últimos 30 días)

ali farid el 11 de Sept. de 2023

0
Enlazar

Enlace directo a esta pregunta

https://es.mathworks.com/matlabcentral/answers/2019651-i-see-a-zero-mean-reward-for-the-first-agent-in-multi-agent-rl-toolbox

Hello, I have extended the PPO Coverage coverage path planning example of the Matlab for 5 agents. I can see now that always, I have a reward for the first agent, and the problem is always, I see a zero mean reward in the toolbox for the first agent like the following image which is not correct. Do you have any idea what is happening there?