How do I define a continuous reward function for RL environment?

Prashanth Chivkula

5 Oct. 2020

1 Respuesta

Respuesta aceptada

Actualizado a las 12 Oct. 2020

7 Visualizaciones (30 días)

Iniciar sesión para responder a esta pregunta.

Follow Question

Iniciar sesión para responder a esta pregunta.

Follow Question

Mostrar comentarios más antiguos

0 votos

I am trying to follow the double integrator example for giving a continuous reward function. When I used the custom template, and defined the reward using the QR cost function, I get an error stating that the reward should be a scalar value. Where can I find the property of reward and change it to accept vector values?

3 comentarios
Mostrar 1 comentario más antiguo Ocultar 1 comentario más antiguo

Prashanth Chivkula el 12 de Oct. de 2020

Yes I did that, thank you, Just to confirm the output of the cost function will always be a scalar value, right? So in the double integrator continuous example there are two states but the output reward at each step is a scalar value, right?