Dear Neeraj Sir and fellow scholars,
Greetings from Chandigarh!
I have a doubt regarding the coding aspect in a case of Logistic Regression.
Here, Dependent variable is – Out of Pocket Expenses, coded as; 1-Low expenditure, 2-High Expenditure.
Following are 5 Independent Variables:
1. Age Category, coded as;
1-Under five age group,
2-Child,
3-Adolescents,
4-Adults, and
5-Senior Citizen
2. Monthly Consumer Expenditure, coded as;
1-Lowest monthly group,
2-Lower middle,
3-Upper middle, and
4- Highest monthly exp group
3. Nature of Treatment, coded as:
1-Allopathy
2-Ayush
9-Others
4. Pregnancy, coded as;
1-Pregnant
2- Not Pregnant.
Now, my question is that – Is this coding correct especially in the case of last two independent variables i.e. Nature of Treatment and Pregnancy?
Earnestly waiting for some response.
Thanks and Regards,
Dr. Sukhmani.
--
Protocols of this Group:
1. Plz search previous post in group before posing the question.
2. Don't write query in someone's post. Always use the option of New topic for the new question. You can do this by writing to dataanalys...@googlegroups.com
3. Its better to give a proper subject to your post/query. It'll help others while searching.
4. Never write Open ended queries. This group intend to help research scholars NOT FOR WORK THEM.
5. Never write words like URGENT in ur posts. People will help them when they are free.
6. Never upload any info about National Seminars/Conferences. Send such info on personal emails. And feel free to share any RESEARCH related info.
7. No Happy New Year, Happy Diwali, Happy Holi, Happy B'day, Happy Anniversary etc. allowed on this group.
8. Few months back there was a facility for asking & sharing the Research Papers. Now there is no provision of asking for the research paper here.
Let’s make a better research environment.
---
You received this message because you are subscribed to the Google Groups "DataAnalysis" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataanalysistrai...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dataanalysistraining/ecbeca08-f3d7-42cf-a4e7-426b8dd789b8n%40googlegroups.com.
Dear Sir,
Thankyou for your feedback and suggestion about Crosstabulation.
Sorry to bother you again, but I still need clarity on the following:
1. In crosstabulation, we will check the Association between each IV with the DV, and for this, we have to check the Significance of Chi-Square. Am I right sir?
2. But in case, if we have to check the Prediction of IV on DV, then also we can use crosstabulation OR regression is required here?
3. Lastly, you are right…Logistic regression is getting complex here. But I still need to know that…normally we give same kind of coding to “Control/No effect/low effect” category in every variable, i.e. in this case:
Dependent Variable (Out of Pocket Expenses) coded as; 1-Low expenditure, 2-High Expenditure. Here 1 is being assigned to “Control/Low effect” category.
Independent variables:
1. Age Category, coded as; 1-Under five age group, 2-Child, 3-Adolescents, 4-Adults, and 5-Senior Citizen. Here 1 is being assigned to “Control/Low effect” category.
2. Monthly Consumer Expenditure, coded as; 1-Lowest monthly group, 2-Lower middle, 3-Upper middle, 4-Highest monthly exp group. Here 1 is being assigned to “Control/Low effect category”.
3. Nature of Treatment, coded as: 1-Allopathy, 2-Ayush, 9-Others. Here 1 is being assigned to Allopathy. But in actual data 85% respondents are using Allopathy and only 5% are using Others. Now, here - Is it correct to assign 1 to Allopathy which means “Control/No/Low effect” category?
4. Pregnancy, coded as; 1-Pregnant, 2- Not Pregnant. Here also 1 is being assigned to Pregnant, thereby meaning “Control/Low effect” category. But how can be “Pregnant” considered as Control/No/Low effect category” especially in conjunction with other IVs.
And moreover, specifically in SPSS in Logistic Regression, while Defining the Categorical Covariates, we select Reference Category as FIRST where we have given lowest number to code the control category.

As per SPSS requirement also, How can we assign 1 to Allopathy and 1 to Pregnancy, thereby making them as Control category as similar to 1-Under five age and 1-Lowest monthy exp which conveys Control/No/Effect category.
I hope I am clear sir. Please guide on this as I’m very confused on this aspect.
Thanks and Regards,
Dr. Sukhmani.

To view this discussion on the web visit https://groups.google.com/d/msgid/dataanalysistraining/706d66d9-79a8-4d04-bb45-b48b7b740649n%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dataanalysistraining/CAAd%3Dc8PDB7F4f_qqG6Q5Q0vVHYFcRE_eqysh2MM74cbrGHB59w%40mail.gmail.com.
--


To view this discussion on the web visit https://groups.google.com/d/msgid/dataanalysistraining/fe0b9657-d879-4bce-8114-4e0a4c4846adn%40googlegroups.com.
Dear Neeraj Sir,
I am greatly indebted for your inputs and moreover, your valuable time that you are putting into this discussion.
Thank you for this wonderful explanation (2 tables) explaining the Basis of Technique applied to Non-metric (categorical) variables.
In the first table, in the footnote, you have mentioned - *Results of Non-Metric Variables will be computed from the perspective of the base category.
Now, here is my Main confusion…regarding the Coding of Base (Control) category
In your second message, you have pointed that in my case Logistic Regression can be applied but it will be complex to interpret the output. Totally agree….
But, in general I am asking, in case of Logistic Regression,
We should or shouldn’t code the Base (Control) category of all the Independent
Variables by giving the same number (for e.g., 0 or 1)?
SPSS itself codes the base (control) category as 0 while calculating the
result, right sir?
So, it becomes very necessary for the researcher to carefully code the
categories of IDVs, especially in case of Logistic Regression. Moreover, while entering
data in SPSS for LR, we have to specify which of our IDVs are Categorical variables
along with the Reference Category (Last or First) (kindly see the screenshot
below). “Last” is chosen when we have used highest number to code the control
(base) category and “First” is chosen when we have used the lowest number to
code the control (base) category, in all the Categorical IDVs, Okay sir?

Now, in my case…
Dependent Variable is (Out of Pocket Expenses) coded as; 1-Low expenditure, 2-High Expenditure. Here 1 is being assigned to “Control/Low effect” category.
Independent variables:
1. Age Category, coded as; 1-Under five age group, 2-Child, 3-Adolescents, 4-Adults, and 5-Senior Citizen. Here 1 is being assigned to “Control/Low effect” category.
2. Monthly Consumer Expenditure, coded as; 1-Lowest monthly group, 2-Lower middle, 3-Upper middle, 4-Highest monthly exp group. Here 1 is being assigned to “Control/Low effect category”.
3. Nature of Treatment, coded as: 1-Allopathy, 2-Ayush, 9-Others. Here 1 is being assigned to Allopathy. But in actual data 85% respondents are using Allopathy and only 5% are using Others. Now, here - Is it correct to assign 1 to Allopathy which means “Control/No/Low effect” category?
4. Pregnancy, coded as; 1-Pregnant, 2- Not Pregnant. Here also 1 is being assigned to Pregnant, thereby meaning “Control/Low effect” category. But how can be “Pregnant” considered as Control/No/Low effect category” especially in conjunction with other IVs.
Sir, I am extremely sorry, but I am still not getting the clarity in Coding aspect i.e. How can “Allopathy” and “Pregnancy” be considered as Base (Control) category in Logistic Regression (In my opinion they cannot be regarded as having No or Less effect) whereas in other IDVs “Under five age group” and “Lowest income group” are regarded as Base (Control) category?
Sir, kindly explain this as per your statement “*Results of Non-Metric Variables will be computed from the perspective of the base category.” OR, Pl Pardon my asking, How would you code them while calculating LR?
Lastly, may be coding is not that important aspect in case of Chi-Square and Log-linear analysis But I think it is an important aspect in case of LR.
Please guide, sir!
Thankyou and Regards,
To view this discussion on the web visit https://groups.google.com/d/msgid/dataanalysistraining/054633cb-0505-4e16-8b7c-9265748a8eb4n%40googlegroups.com.