receiving different training results while running the same code
Mostrar comentarios más antiguos
I ran the training of my RL model but forgot to save so i thought i would run the same script again
but i am getting a slightly changed response ?
shouldnt i get the same training results?
also what is relation b/w different sampling times of like actor and agent.
3 comentarios
Emmanouil Tzorakoleftherakis
el 2 de Jun. de 2023
Can you clarify what you mean by different sample times between agent and actor? Each agent has its own sample time which indicates how often you want to do inference/get an action. Did you see different sample time option used for the actor?
Sourabh
el 2 de Jun. de 2023
Emmanouil Tzorakoleftherakis
el 5 de Jun. de 2023
max steps will depend on your agent sample time. If it's 100, it means thatthe total episode duration will be 100* ts where ts is the agent sample time.
Also, smaller sample time does not necessarily mean better control. As a rule of thumb, your sample time should only be as small as needed to get good results, not smaller than that to avoid wasting computational resources.
Respuesta aceptada
Más respuestas (0)
Categorías
Más información sobre Reinforcement Learning en Centro de ayuda y File Exchange.
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!