PPO agent with continuous action example

I go through the example using PPO to build land rocket model. (https://www.mathworks.com/help/reinforcement-learning/ug/train-ppo-agent-to-land-rocket.html?s_tid=blogs_rc_4) However, the action in this example is discrete. I wonder when I change my action to continuous action . How do I create actornetwork as there is numact as one parameter in actor_network

Respuestas (1)

Emmanouil Tzorakoleftherakis
Emmanouil Tzorakoleftherakis el 22 de Jul. de 2020

0 votos

Hello,
If you want to use PPO, i.e. a stochastic actor with continuous action space, you can follow the structure shown here.

Productos

Etiquetas

Preguntada:

el 22 de Jul. de 2020

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by