Different training results for neural network when using full dataset versus partial dataset

Question

Katy el 20 de Jul. de 2023

0
Enlazar

Enlace directo a esta pregunta

https://es.mathworks.com/matlabcentral/answers/1998773-different-training-results-for-neural-network-when-using-full-dataset-versus-partial-dataset

Comentada: Mrutyunjaya Hiremath el 24 de Jul. de 2023

Respuesta aceptada: Mrutyunjaya Hiremath

I'm training a network using 'narxnet' and 'train'.

My training data is a part of a larger dataset. These are the two scenarios in which I get different results.

Trim the dataset so the entire input data is the training data. 'trainInd' = the entire dataset; no validation or test indices are provided
Use the entire dataset, but specify the training data by 'trainInd' (using the indices of the exact data from scenario 1); no validation or test indices are provided

The training terminates at the same conditions, and I'm using the same dataset, but I get different results. I've also experimented with adjusting the training data indices in scenario 2 based on # of delays specified with no luck.

Does anyone have any insight ino what might be causing this? (I'm aware with the issues of not specifying validation data, I'm just trying to replaicate behavior at the moment).

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Mrutyunjaya Hiremath el 21 de Jul. de 2023

0
Enlazar

Enlace directo a esta respuesta

https://es.mathworks.com/matlabcentral/answers/1998773-different-training-results-for-neural-network-when-using-full-dataset-versus-partial-dataset#answer_1276493

The difference in results between scenario 1 and scenario 2 could be due to the different order of data samples seen during training. When you trim the dataset so that the entire input data is used for training (scenario 1), the network sees the data in the same order as it appears in the dataset. However, when you specify the training data using the indices (scenario 2), the network sees the data in a different order based on the selected indices.

In a neural network, the order in which data samples are presented during training can have an impact on the convergence and final performance of the model. Different orders of data samples can lead to different weight updates during training, potentially resulting in slightly different results.

To address this issue and ensure more consistent results, you can try the following:

Shuffle the dataset: Before creating the neural network and specifying the trainInd in scenario 2, shuffle the entire dataset randomly. This will help to randomize the order of data samples and potentially lead to more consistent training.
Set the random seed: If you are using a random number generator during training (e.g., weight initialization or mini-batch shuffling), set a fixed random seed before running both scenarios. This ensures that the randomization process during training is the same for both scenarios, leading to more reproducible results.

By shuffling the dataset and setting the random seed, you should get more consistent results between scenario 1 and scenario 2. Keep in mind that neural networks are still sensitive to other factors such as network architecture, learning rate, and training parameters, so it's possible to see slight differences even with these measures in place. However, the consistency should be improved.

4 comentarios
Mostrar 2 comentarios más antiguosOcultar 2 comentarios más antiguos

Mrutyunjaya Hiremath el 22 de Jul. de 2023

Abrir en MATLAB Online

There is no direct built-in option to shuffle the training data when using the 'train' function for training neural networks. However, there are workarounds to achieve the desired shuffling behavior and provide separate training and validation datasets.

One approach to shuffle the data and provide separate training and validation datasets is to use the 'cvpartition' function to create custom cross-validation partitions. You can create custom partition objects for training and validation data and then use them with the 'train' function.

Here's an example of how to do it:

% Assuming you have your input data matrix 'X' and corresponding targets 'Y'
% Shuffle the data and targets
shuffledIndices = randperm(size(X, 1));
shuffledX = X(shuffledIndices, :);
shuffledY = Y(shuffledIndices, :);
% Create custom cross-validation partition for training and validation
cv = cvpartition(size(X, 1), 'HoldOut', 0.2); % 80% for training, 20% for validation
% Get training and validation indices from the partition
trainInd = training(cv);
valInd = test(cv);
% Split the shuffled data into training and validation datasets
trainingData = shuffledX(trainInd, :);
trainingTargets = shuffledY(trainInd, :);
validationData = shuffledX(valInd, :);
validationTargets = shuffledY(valInd, :);
% Train the neural network using the 'train' function
% Use the trainingData, trainingTargets for training
% Use the validationData, validationTargets for validation

Katy el 23 de Jul. de 2023

Thank you for the clear explanation!

Mrutyunjaya Hiremath el 24 de Jul. de 2023

You are most welcome :)

Iniciar sesión para comentar.

Different training results for neural network when using full dataset versus partial dataset

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

4 comentarios
Mostrar 2 comentarios más antiguosOcultar 2 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Community Treasure Hunt

Different training results for neural network when using full dataset versus partial dataset

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

4 comentarios Mostrar 2 comentarios más antiguosOcultar 2 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

4 comentarios
Mostrar 2 comentarios más antiguosOcultar 2 comentarios más antiguos