ExperienceBufferLength in Reinforcement Learning Toolbox

11 visualizaciones (últimos 30 días)

qun wang el 15 de Nov. de 2021

0
Enlazar

Enlace directo a esta pregunta

https://es.mathworks.com/matlabcentral/answers/1587044-experiencebufferlength-in-reinforcement-learning-toolbox

Comentada: Francisco Serra el 2 de Mayo de 2024

Hello, everyone,

I found a problem with the 'ExperienceBufferLength' property in 'rlDDPGAgentOptions' when specifying options for rl agents.

Usually this property is set as 1e6 in the examples of the Help documentation, such as here.

In this example, every episode has 600 (60/0.1) steps. Does the agent start to train when the experience buffer is filled up with the experiences (S,A,R,S'). If so, it would take at least 1667 (1000000/600 ) episodes before the agent starts to improve.

So I want to know how to determine this value.

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Respuesta aceptada

Ari Biswas el 17 de Nov. de 2021

0
Enlazar

Enlace directo a esta respuesta

https://es.mathworks.com/matlabcentral/answers/1587044-experiencebufferlength-in-reinforcement-learning-toolbox#answer_833904

The agent will train until at least one minibatch can be sampled from the buffer. If your mini batch size is 64, then the first learn step will occur after the buffer has stored 64 experiences. The experience buffer is circular, i.e., it removes older experiences when full. The size of the buffer is hence important. You may lose important experiences if the buffer size is too small.

4 comentarios
Mostrar 2 comentarios más antiguosOcultar 2 comentarios más antiguos

Arman Ali el 27 de Sept. de 2022

How about if we want to fill our buffer first and then start taking minibatches?? how to implement this in matlab?

Francisco Serra el 2 de Mayo de 2024

Abrir en MATLAB Online

For that you can set:

agent.AgentOptions.NumWarmStartSteps=experience_buffer_length

As default, this is set to the minibatch size, but changing to the experience buffer size will force the algorithm to wait until the buffer is full.

Iniciar sesión para comentar.

Más respuestas (0)

Iniciar sesión para responder a esta pregunta.

Categorías

AI and Statistics Deep Learning Toolbox

Más información sobre Deep Learning Toolbox en Help Center y File Exchange.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by