Saving Trained RL Agent after Training

Question

PB75 el 29 de Abr. de 2021

0
Enlazar

Enlace directo a esta pregunta

https://es.mathworks.com/matlabcentral/answers/816585-saving-trained-rl-agent-after-training

Comentada: Zaid Jaber el 14 de Nov. de 2023

Respuesta aceptada: Emmanouil Tzorakoleftherakis

Hi All,

I trained a RL agent, the environment output was acceptable, my plan was to initially validate the agent in the simulation after training finished with the following code.

As i was concerned that I would restart training on the agent when I ran the script to run the 'sim' function, my IsDone flag in the simulation was manually set to 1 (previously 0 to permit training) and additionally commented out the 'training' function.

%trainingStats = train(agentSS,env,trainingOpts)
rng(0) 
simOptions = rlSimulationOptions('MaxSteps',maxsteps);
experience = sim(env,agentSS,simOptions);

There was no ouput from the simulation, with no warnings, I then reset the IsDone flag back to 0, and reran the script, now the ouput was 0 on all scopes.

Did I lose the trained agent data when I set the IsDone flag to 1 after training?.

My next step was to try to save the trained agent with adding the following code found in the documentation, but still joy. My thoughts are I have overwritten and lost the trained data!

save("initialAgent.mat","agentSS")
load('initialAgent.mat')
rng(0) 
simOptions = rlSimulationOptions('MaxSteps',maxsteps);
experience = sim(env,agentSS,simOptions);

How can I add code to ensure the trained agent data is saved automatically via 'RLTrainingOptions' after training has been completed, such as when maxepisodes are reached? Do not want to make the same mistake.

Is this correct?

trainingOpts = rlTrainingOptions(...
    'MaxEpisodes',maxepisodes, ...
    'MaxStepsPerEpisode',maxsteps, ...
    'StopTrainingCriteria','AverageReward',...
    'StopTrainingValue',-100,... 
    'ScoreAveragingWindowLength',100,...
    'SaveAgentCriteria',"EpisodeCount",...
    'SaveAgentValue',maxepisodes,...
    'SaveAgentDirectory',"savedAgents")

Thanks

Patrick

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Emmanouil Tzorakoleftherakis el 29 de Abr. de 2021

0
Enlazar

Enlace directo a esta respuesta

https://es.mathworks.com/matlabcentral/answers/816585-saving-trained-rl-agent-after-training#answer_687975

Editada: Emmanouil Tzorakoleftherakis el 29 de Abr. de 2021

Setting the IsDone flag to 1 does not erase the trained agent - it actually makes sense that the sim was not showing anything because it was immediately stopped by the IsDone flag.

To save the final agent, simply add the save command you have right after when you call 'train'.

My guess is that when you reran the whole script, you created a new agent from scratch and saved it again to a mat file, which replaced the already trained agent. This is why it's good practive to always have sections in your (live) script, so that you can pick exactly what lines you want to run.

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Apoorv Pandey el 27 de Feb. de 2023

How to save multiple agents like in this example

https://in.mathworks.com/help/reinforcement-learning/ug/train-agents-for-path-following.html

Zaid Jaber el 14 de Nov. de 2023

@Emmanouil Tzorakoleftherakis

Hi

How i can use that file to resume training with more number of episodes ?

Iniciar sesión para comentar.

Saving Trained RL Agent after Training

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

Saving Trained RL Agent after Training

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

3 comentarios Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo