Why does training perfomance change when a validation set is considered?

Question

0 votos

Hello!

This question is related with this http://www.mathworks.com/matlabcentral/answers/49140-is-validation-set-being-used-for-training-in-nn.

For example, I considered the input and output:

input=1:1:10 output=[1:2:15 24 24]

and then I try 3 different options:

OPTION 1 rand('twister',1) net = feedforwardnet(4); net.trainParam.epochs =3; net.divideFcn='divideind'; [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd] = divideind(10,1:10); [net,tr,Y1,E1] = train(net,input,output);

OPTION 2 rand('twister',1) net = feedforwardnet(4); net.trainParam.epochs =3; net.divideFcn='divideind'; [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd] = divideind(10,1:8,9:10); %net.divideParam.trainRatio=1;net.divideParam.valRatio=0;net.divideParam.testRatio=0; [net,tr,Y1,E1] = train(net,input,output);

OPTION 3 rand('twister',1) net = feedforwardnet(4); net.trainParam.epochs =3; net.divideFcn='divideind'; [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd] = divideind(8,1:8); [net,tr,Y1,E1] = train(net,input(:,1:8),output(:,1:8));

The initialisations are similar, the all 3 options stopped because they reached the maximum epoch. I checked epoch=0 and the weights and bias are similar but the (training) performance isn't. And from epoch=0, everything is different when comparing the 3 options. If I don't change divideFcn and I consider the same experiments as before, using the same indices for training, I have the same problem. So it isn't because of divideind! I'd like to understand why this is happening. I checked the functions step by step. Could anyone help me? Thank you very much. Ana

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Greg Heath el 5 de Oct. de 2012

I took a prelimiary look. Something subtle is going on.

1. Option 1 is irrelevant.

2. I chose Nepochs = 1 and and rng(0) initialization.

3. The final weights for Options 2 & 3 are different (They shouldn't be).

I'll be baahk.

Aahnold.

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Follow Question

Answer 1

Greg Heath el 28 de Nov. de 2012

0 votos

The difference in the last two results was completely caused by using

1) ... = train(net,input(:,1:8),output(:,1:8));

instead of

2) ... = train(net,input,output);

Verification: For each of these 2 syntaxes I ran 3 trials for one epoch with

a. divideind(10,1:8,9:10);

b. divideind(10,1:8);

c. divideind(8,1:8);

For each syntax the 3 trials yielded identical results.

The reason why probably lies in the code of train:

type train

Hope this helps.

Thank you for officially accepting my answer.

Greg

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Answer 2

Zeeshan el 27 de Nov. de 2012

0 votos

Hi,

I think because the data is divided randomly to check for validation of model, therefore some network may get trained better than the other because it was trained on a different set of data (randomly chosen training data).

I am also working on a comparison of architectures and I am going to fix the time points for each dataset for training and validation to compare them.

Regards,

Shan

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Greg Heath el 28 de Nov. de 2012

Incorrect. See my answer below.

Greg

Iniciar sesión para comentar.

Why does training perfomance change when a validation set is considered?

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Respuesta aceptada

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Más respuestas (1)

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Categorías

Productos

Etiquetas

Community Treasure Hunt

Why does training perfomance change when a validation set is considered?

1 comentario Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Respuesta aceptada

0 comentarios Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Más respuestas (1)

1 comentario Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Categorías

Productos

Etiquetas

Ver también

Community Treasure Hunt

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos