PPO agent with continuous action example

22 Jul. 2020

1 Respuesta

Actualizado a las 22 Jul. 2020

7 Visualizaciones (30 días)

Iniciar sesión para responder a esta pregunta.

Follow Question

Iniciar sesión para responder a esta pregunta.

Follow Question

Mostrar comentarios más antiguos

0 votos

I go through the example using PPO to build land rocket model. (https://www.mathworks.com/help/reinforcement-learning/ug/train-ppo-agent-to-land-rocket.html?s_tid=blogs_rc_4) However, the action in this example is discrete. I wonder when I change my action to continuous action . How do I create actornetwork as there is numact as one parameter in actor_network

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Follow Question

Respuestas (1)

Emmanouil Tzorakoleftherakis el 22 de Jul. de 2020

0 votos

Hello,

If you want to use PPO, i.e. a stochastic actor with continuous action space, you can follow the structure shown here.

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Categorías

Más información sobre Reinforcement Learning en Centro de ayuda y File Exchange.

Productos

MATLAB

Etiquetas

el 22 de Jul. de 2020

Emmanouil Tzorakoleftherakis

el 22 de Jul. de 2020

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by