Custom environment in Deep reinforcement learning

Bha Pr

1 Abr. 2020

1 Respuesta

Actualizado a las 13 Mayo 2020

4 Visualizaciones (30 días)

Iniciar sesión para responder a esta pregunta.

Follow Question

Iniciar sesión para responder a esta pregunta.

Follow Question

Mostrar comentarios más antiguos

0 votos

I am currently trying to buid to a custom environment for the implementation of deep reinforcement learning. My considered environment has 4 states low, med, high, severe represented by 1,2,3,4 respectively and the actions to be taken are 1,2,3 and rewards are decided on the basis of context like temperature, pressure,humidity which varies with time. So how i can define my reward that changes with time in mystepfunction?

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Follow Question

Respuestas (1)

Ari Biswas el 20 de Abr. de 2020

0 votos

One way to solve this is by introducing a property to keep track of elapsed time in your custom MATLAB environment. You can use this property to compute rewards and increment this as needed in the step function.