How Does MATLAB Internally Format Actions as dlarray in DDPG with Recurrent Networks (LSTM)?

Question

Farid el 10 de Mzo. de 2025

0
Enlazar

Enlace directo a esta pregunta

https://es.mathworks.com/matlabcentral/answers/2174967-how-does-matlab-internally-format-actions-as-dlarray-in-ddpg-with-recurrent-networks-lstm

Comentada: Farid el 13 de Mzo. de 2025

In MATLAB's RL toolbox, when using DDPG with LSTM-based actors/critics, the conversion of actions to dlarray is handled automatically. Since users cannot directly control this process:

Are actions formatted with 'T' (time) or 'C' (channel) dimensions when passed between the actor and critic networks?

How does MATLAB structure actions for compatibility with recurrent layers (e.g., aligning sequences for LSTM time steps)?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

praguna manvi el 13 de Mzo. de 2025

0
Enlazar

Enlace directo a esta respuesta

https://es.mathworks.com/matlabcentral/answers/2174967-how-does-matlab-internally-format-actions-as-dlarray-in-ddpg-with-recurrent-networks-lstm#answer_1561724

Abrir en MATLAB Online

Hi @Farid,

In the functions "getAction" and "getValue" for the "actor" and "critic" networks, respectively, the inputs/observations are reshaped and formatted into "CBT" format in the following case of sequential layer network inputs, such as when using "lstm" layer. This ensures the data is in the format that the networks expect in general. To explore this further, you can use the example below:

openExample('rl/CreateDDPGAgentUsingRecurrentNeuralNetworksExample

This example will provide more insights into how the data is structured and processed within these networks when we look underneath these functions.

1 comentario
Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos

Farid el 13 de Mzo. de 2025

Thank you for time and your help

Iniciar sesión para comentar.

How Does MATLAB Internally Format Actions as dlarray in DDPG with Recurrent Networks (LSTM)?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

1 comentario
Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Community Treasure Hunt

How Does MATLAB Internally Format Actions as dlarray in DDPG with Recurrent Networks (LSTM)?

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

1 comentario Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

1 comentario
Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos