The definition of the Target update frequency in Reinforcement Learning Designer.

Question

Xian Zheng Hong el 7 de Mzo. de 2024

0
Enlazar

Enlace directo a esta pregunta

https://es.mathworks.com/matlabcentral/answers/2091631-the-definition-of-the-target-update-frequency-in-reinforcement-learning-designer

Comentada: Xian Zheng Hong el 16 de Mzo. de 2024

Respuesta aceptada: UDAYA PEDDIRAJU

In DDPG Agent, there are four networks. Online policy, Target policy, Online Q and Target Q.

The [Target update frequency] is used to the Target policy and Target Q in Reinforcement Learning Designer.

Are the Update frequency of the Online policy and Online Q same as the [Target update frequency] ?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

UDAYA PEDDIRAJU el 12 de Mzo. de 2024

1
Enlazar

Enlace directo a esta respuesta

https://es.mathworks.com/matlabcentral/answers/2091631-the-definition-of-the-target-update-frequency-in-reinforcement-learning-designer#answer_1424086

Hi Xian,

No, the update frequency of the Online Policy and Online Q networks is not the same as the Target Update Frequency. The Target Update Frequency specifically applies to how often the Target Policy and Target Q networks are updated, which is typically less frequent or managed differently to ensure stability in learning.

1 comentario
Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos

Xian Zheng Hong el 16 de Mzo. de 2024

Thanks for answering. Here is my another question.

Are the Online policy and Online Q updated at every time step in Reinforcement Learning Designer Toolbox?

Iniciar sesión para comentar.

The definition of the Target update frequency in Reinforcement Learning Designer.

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

1 comentario
Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Community Treasure Hunt

The definition of the Target update frequency in Reinforcement Learning Designer.

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

1 comentario Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

1 comentario
Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos