What is the reference category for mnrfit when response variable is a categorical array

7 visualizaciones (últimos 30 días)
I'm using mnrfit to estimate a binary logit model. The document says
Y can be one of the following:
  • An n-by-k matrix, where Y(i,j) is the number of outcomes of the multinomial category j for the predictor combinations given by X(i,:). In this case, the number of observations are made at each predictor combination.
  • An n-by-1 column vector of scalar integers from 1 to k indicating the value of the response for each observation. In this case, all sample sizes are 1.
  • An n-by-1 categorical array indicating the nominal or ordinal value of the response for each observation. In this case, all sample sizes are 1.
and "The estimates for the kth category are taken to be zero as mnrfit takes the last category as the reference category". But if I have a categorical response variable (say H and L), how do I know which one MATLAB takes as the reference category without converting them to 1 and 2 first?

Respuesta aceptada

Alok Nimrani
Alok Nimrani el 21 de Feb. de 2019
In general, the reference category will be the last category listed by "categories" function.
For example, in the first example in the "mnrfit" documentation, executing the following code:
>> cats=categories(sp)
results in:
cats =
'setosa'
'versicolor'
'virginica'
Here, the "mnrfit" function will use the last category in "cats" (in this case, "virginica") as the reference category.
  4 comentarios
Varshini Murugesh
Varshini Murugesh el 23 de Jun. de 2023
I was wondering how you know this is correct. categories() seems to list categorical values alphabetically.
So, to make one variable my reference, I decided to choose a letter later in the alphabet. This didn't change the absolute value of the coefficients compared to the original model.

Iniciar sesión para comentar.

Más respuestas (1)

Saeed Babanajad
Saeed Babanajad el 18 de Sept. de 2021
I am trying to use mnrfit to run a model on my data. My data has about five continours numeric variables and my output is an integer between 9 and 2. I have a couple very important questions that I will be so grateful if you could help:
1) should I use Hierarchical Multinomial Regression Model based on my output nature?
2) when I use the above type (and define my output using categorical function), how should I know that my B values (coefficient of the model) correspond to which output type? In other words, the B has a matrix of 8 columns. How can I know which columns correspond to which output category?
Your help is greatly appreciated!

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by