Dear Professor Bierlaire,
Thank you for your response, and I apologize for the lack of clarity in my previous questions. I have successfully used Biogeme to estimate the coefficients and class probabilities for a latent class logit model. I would like to classify the dataset based on the estimated class membership probabilities in order to better understand the characteristics of each class. In my case, the model includes two latent classes and two alternatives (choice 1 and choice 2). Equation are as follows:
P(predict choice 1) = P(class 1)*P(predict choice 1 | class 1) + P(class 2)*P(predict choice 1 | class 2)
P(predict choice 2) = P(class 1)*P(predict choice 2 | class 1) + P(class 2)*P(predict choice 2 | class 2)
So, class membership probability should be
P(posterior class 1 of each traveler) = P(class 1)*P(predict choice 1 | class 1) / P(predict choice 1) if traveler choose choice 1
or
P(posterior class 1 of each traveler) = P(class 1)*P(predict choice 2 | class 1) / P(predict choice 2) if traveler choose choice 2
My question is whether the traveler's choice indicated in the red text should be based on the actual observed choice from the data or the predicted choice from the model calculations.