photo

Sayak Mukherjee


Last seen: alrededor de 2 años hace Con actividad desde 2020

Followers: 0   Following: 0

Programming Languages:
Python, MATLAB
Spoken Languages:
Bengali, English, Hindi

Estadística

  • Revival Level 1
  • Thankful Level 1

Ver insignias

Feeds

Ver por

Pregunta


Mirror symmetry in actions in reinforcement learning
I am training a RL control problem to perforem neck kinematics. I want the action space to have mirror symmetry as explained in ...

alrededor de 2 años hace | 0 respuestas | 0

0

respuestas

Pregunta


Control the exploration in soft actor-critic
What is the best way to control the exploration in SAC agent. For TD3 agent I used to control the exploration by adjusting the v...

más de 2 años hace | 1 respuesta | 1

1

respuesta

Pregunta


Reinforcement learning agent not being saved during training
I am trying to train my model using TD3 agent. During the training process I am trying to save the agent above a certain episode...

casi 3 años hace | 1 respuesta | 0

1

respuesta

Pregunta


Dont need to save 'savedAgentResultStruct' with RL agent
When I am saving agents during RL iterations using 'EpisodeReward' criteria, matlab is also saving 'savedAgentResultStruct' alon...

casi 4 años hace | 0 respuestas | 0

0

respuestas

Pregunta


Change revolute joint parameter in env.ResetFcn during reinforcement learning
What is the best way to randomize the initial revolute joint angle during eacg episode of reinforcement learning right now I am...

alrededor de 4 años hace | 0 respuestas | 0

0

respuestas

Pregunta


What is the best activation function to get action between 0 and 1 in DDPG network?
I am using DDPG network to run a control algorithm which has inputs (actions of RL agent, 23 in total) varying between 0 and 1. ...

alrededor de 4 años hace | 1 respuesta | 0

1

respuesta

Pregunta


Expected reward blows up while training (DDPG agent, reinforcement learning)
I am training a DDPG network and after training for around 5000 iterations, the model seems doesnot seem to converge while the e...

alrededor de 4 años hace | 1 respuesta | 0

1

respuesta

Pregunta


Use saved reinforcement learning DDPG agent
I have saved DDPG agent using the optiopn rlTrainingOptions.SaveAgentValue = 3000 During the simulations number of agents are ...

alrededor de 4 años hace | 1 respuesta | 0

1

respuesta