Why doesn't the Artificial Neural Network stop training at the point where it achieves the best performance indicated by the lowest MSE on the performance plot?

Question

Sunita el 22 de Dic. de 2023

0
Enlazar

Enlace directo a esta pregunta

https://es.mathworks.com/matlabcentral/answers/2063317-why-doesn-t-the-artificial-neural-network-stop-training-at-the-point-where-it-achieves-the-best-perf

Editada: Matt J el 25 de Dic. de 2023

Why doesn't the Artificial Neural Network stop training at the point where it achieves the best performance indicated by the lowest Mean Squared Error (performance) on the performance plot?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Ganesh el 22 de Dic. de 2023

1
Enlazar

Enlace directo a esta respuesta

https://es.mathworks.com/matlabcentral/answers/2063317-why-doesn-t-the-artificial-neural-network-stop-training-at-the-point-where-it-achieves-the-best-perf#answer_1376422

Editada: Ganesh el 24 de Dic. de 2023

Hi @Sunita,

The ANN cannot stop at the lowest point, because at the time of training there is no lowest point. Ideally, the lowest loss that a model can achieve is 0. However, during training, the model has no idea of whether the loss will increase or decrease further down the line.

A model training works in the following way:

The model starts off with initial weights.
The model makes a prediction and assesses the loss attained for a data point.
The model corrects the weight according to the loss attained.
The model makes a prediction using the corrected weight and assesses the loss for next data point.
The process continues until all data points are exhausted.
The performance plot shows the loss attained at each step.

The model thus, has no way of knowing it’s performance on data it has not seen.

It is to be noted that the lowest mean squared error does not mean that the model is performing the best at that point. It means that the model performs well for the data it has already seen. Allowing it to train over more data would ensure that the model makes accurate predictions for unseen data.

Hope this answer helps!

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Sunita el 23 de Dic. de 2023

Thank you for your response. My question pertains to the following data obtained during model training:

The model stopped at epoch 22 with an MSE of 0.0242
The best performance was observed at epoch 16 with an MSE of 0.031
However, the final MSE between actual and predicted values is 0.0308

Shouldn't the third data align with the best performance (second) value ?

Matt J el 25 de Dic. de 2023

Editada: Matt J el 25 de Dic. de 2023

The best performance was observed at epoch 16 with an MSE of 0.031

An MSE of 0.031 is worse than an MSE of 0.0242. MSE is something you want to minimize.

Shouldn't the third data align with the best performance (second) value ?

No, see here:

https://www.mathworks.com/matlabcentral/answers/2063317-why-doesn-t-the-artificial-neural-network-stop-training-at-the-point-where-it-achieves-the-best-perf#comment_3007012

Iniciar sesión para comentar.

Why doesn't the Artificial Neural Network stop training at the point where it achieves the best performance indicated by the lowest MSE on the performance plot?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuestas (1)

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Ver también

Categorías

Etiquetas

Community Treasure Hunt

Why doesn't the Artificial Neural Network stop training at the point where it achieves the best performance indicated by the lowest MSE on the performance plot?

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuestas (1)

3 comentarios Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Ver también

Categorías

Etiquetas

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo