Respondida
reinforcement learning PMSM-code
You can find the example here.

casi 5 años hace | 0

| aceptada

Respondida
How to deal with a large number of state and action spaces?
Even if the NX3 inputs are scalars, I would reorganize them into an "image" and use imageInput layer for the first layer as oppo...

casi 5 años hace | 0

Respondida
Q learning algorithm in image processing using matlab.
Hello, Finding an example that exactly matches what you need to do may be challenging. If you are looking for the "deep learnin...

casi 5 años hace | 0

| aceptada

Respondida
Need help with Model based RL
Hello, If you want to use the existing C code to train with Reinforcement Learning Toolbox, I would use the C caller block to b...

casi 5 años hace | 1

| aceptada

Respondida
How to set the reinforcement learning block in Simulink to output 9 actions
Hello, the example you are referring to does not output 3 values for the pid gains. The PID gains are "integrated" into the neu...

casi 5 años hace | 0

Respondida
Where to update actions in environment?
Reinforcement Learning Toolbox agents expect a static action space, so fixed number of options at each time step. To create a dy...

casi 5 años hace | 0

Respondida
How to check the weight and bias which taked by getLearnableParameters?
Can you provide some more details? What does 'wrong answer' mean? How do you know the weights you are seeing are not correct? Ar...

casi 5 años hace | 0

Respondida
Gradient in RL DDPG Agent
If you put a break point right before 'gradient' is called in this example, you can step in and see the function implementation....

casi 5 años hace | 0

| aceptada

Respondida
Soft Actor Critic deploy mean path only
Hello, Please take a look at this option here which was added in R2021a to allow exactly the behavior you mentioned. Hope this...

casi 5 años hace | 0

| aceptada

Respondida
How to pretrain a stochastic actor network for PPO training?
Hello, Since you already have a dataset, you will have to use Deep Learning Toolbox to get your initial policy. Take a look at ...

casi 5 años hace | 1

Respondida
Failure in training of Reinforcement Learning Reinforcement Learning Onramp
Hello, We are aware and working to fix this issue. In the meantime, can you take a look at the following answere? https://www....

alrededor de 5 años hace | 0

Respondida
DQN Agent with 512 discrete actions not learning
I would initially revisit the critic architecture for 2 reasons: 1) Network seems a little simple for a 3->512 mapping 2) This...

alrededor de 5 años hace | 0

Respondida
How does the Q-Learning update the qTable by using the reinforcement learning toolbox?
Can you try critic.Options.L2RegularizationFactor=0; This parameter is nonzero by default and likely the reason for the discre...

alrededor de 5 años hace | 0

Respondida
File size of saved reinforcement learning agents
Hello, Is this parameter set to true? If yes, then it makes sense that mat files are growing in size as the buffer is being pop...

alrededor de 5 años hace | 0

| aceptada

Respondida
Saving Trained RL Agent after Training
Setting the IsDone flag to 1 does not erase the trained agent - it actually makes sense that the sim was not showing anything be...

alrededor de 5 años hace | 0

| aceptada

Respondida
How to Train Multiple Reinforcement Learning Agents In Basic Grid World? (Multiple Agents)
Training multiple agents simultaneously is currently only supported in Simulink. The predefined Grid World environments in Reinf...

alrededor de 5 años hace | 1

| aceptada

Respondida
How to create a neural network for Multiple Agent with discrete and continuous action?
If you want to specify the neural network structures yourself, there is nothing specific you need to do - simply create two acto...

alrededor de 5 años hace | 0

| aceptada

Respondida
Is it possible apply Reinfocrement Learning to classify data?
If you already have a labeled dataset, supervised learning is the way to go. Reinforcement learning is more for cases where data...

alrededor de 5 años hace | 0

| aceptada

Respondida
Combining two deep neural networks to train simultaneously
Hello, You can do this in Simulink - see the following examples for reference. https://www.mathworks.com/help/reinforcement-l...

alrededor de 5 años hace | 1

| aceptada

Respondida
DQN learns at first but then worsens.
To confirm that this is an exploration issue, can you try setting the EpsilonMin param to a high value? e.g. 0.99. If after doin...

alrededor de 5 años hace | 0

Respondida
How to resume train a trained agent?about Q learning agents.
Hello, To see how to iew the table values, take a look at the answer here. Also, you don't have to do anything specific to con...

alrededor de 5 años hace | 1

| aceptada

Respondida
Reinforcement learning action getting saturated at one range of values
Your scaling layer is not set up correctly. You want to scale to (upper limit-lower limit)/2 and then shift accordingly. scali...

alrededor de 5 años hace | 1

| aceptada

Respondida
How can I provide constraints to the actions provided by the Reinforcement Learning Agent?
Hard constraints are not typically supported during training in RL. You can specify limits/constraints as you mention above, but...

alrededor de 5 años hace | 0

| aceptada

Respondida
Exporting data only works as pdf. Axis labels are getting small and unreadable
You cannot save as .fig from the episode manager plot. If you have the training data though (it's good practice to save this dat...

alrededor de 5 años hace | 1

| aceptada

Respondida
Reinforcement Learning multiple agent validation: Can I have a Simulink model host TWO agents and test them
That should be possible. Did you follow the multi-agent examples? Since the agents are trained already you may want to check the...

alrededor de 5 años hace | 0

| aceptada

Respondida
Do the actorNet and criticNet share the parameter if the layers have the same name?
No, each network has its own parameters. Shared layers are not supported out of the box, you would have to implement custom trai...

alrededor de 5 años hace | 0

| aceptada

Respondida
Any RL Toolbox A3C example?
Hello, To get an idea of what an actor/critic architecture may look like, you can use the 'default agent' feature that creates ...

alrededor de 5 años hace | 1

| aceptada

Respondida
After training my DDPG RL agent and saving it, unexpected simulation output
See answer here

alrededor de 5 años hace | 0

| aceptada

Respondida
Saved agent always gives constant output no matter how or how much I train it
The problem formulation is not correct. I suspect that even during training, you are seeing a lot of bang bang actions. The bigg...

alrededor de 5 años hace | 1

| aceptada

Respondida
How can I create a Reinforcement Learning Agent representation based on Recurrent neural network (RNN, LSTM, among others)
Hello, Which release are you using? R2020a and R2020b support LSTM policies for PPO and DQN agents. Starting in R2021a you can ...

alrededor de 5 años hace | 2

| aceptada