Custom environment in Deep reinforcement learning

2 visualizaciones (últimos 30 días)
Bha Pr
Bha Pr el 1 de Abr. de 2020
Comentada: SULAKSHNA DEVI el 13 de Mayo de 2020
I am currently trying to buid to a custom environment for the implementation of deep reinforcement learning. My considered environment has 4 states low, med, high, severe represented by 1,2,3,4 respectively and the actions to be taken are 1,2,3 and rewards are decided on the basis of context like temperature, pressure,humidity which varies with time. So how i can define my reward that changes with time in mystepfunction?

Respuestas (1)

Ari Biswas
Ari Biswas el 20 de Abr. de 2020
One way to solve this is by introducing a property to keep track of elapsed time in your custom MATLAB environment. You can use this property to compute rewards and increment this as needed in the step function.
  1 comentario
SULAKSHNA DEVI
SULAKSHNA DEVI el 13 de Mayo de 2020
The property here refers to function. Can you please provide explanation on this

Iniciar sesión para comentar.

Categorías

Más información sobre Deep Learning Toolbox en Help Center y File Exchange.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by