How does tree bagger handle NaN values

7 Feb. 2020

1 Respuesta

Actualizado a las 27 Dic. 2020

3 Visualizaciones (30 días)

Iniciar sesión para responder a esta pregunta.

Follow Question

Iniciar sesión para responder a esta pregunta.

Follow Question

Mostrar comentarios más antiguos

0 votos

In building a random forest classifier I have some features with a large amount of NaN values, but it is not clear to me how Tree Bagger handles these NaNs. I've seen quite a bit of documentation of how that is handled in other high level programming languages, but I don't see explicitly how this is done in Matlab. Can anyone point me in the right direction so I can understand the default settings for this or user specified settings?

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Follow Question

Respuestas (1)

Puru Kathuria el 27 de Dic. de 2020

0 votos

General rules that are followed while NaN or missing values are encountered:

Rule1: The algorithm simply discards the data points where all the features have NaN values and does not use them while training.

Rule 2: If a data point have a few NaN feature values then the algorithm will find the split on the basis of valid values first.

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Categorías

Más información sobre Descriptive Statistics en Centro de ayuda y File Exchange.

Productos

Statistics and Machine Learning Toolbox

Versión

R2017b

Etiquetas

el 7 de Feb. de 2020

el 27 de Dic. de 2020

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by