What is the meaning of the denominator P[E] in the 3rd video:

33 views
Skip to first unread message

Sourabh Sinha

unread,
Mar 22, 2014, 2:09:50 PM3/22/14
to wekamooc...@googlegroups.com
Is it the "joint conditional probabilities of the attributes" i.e. = P[sunny, cool, humid, windy]? Why we can't calculate it?

Thanks

Sourabh Sinha

unread,
Mar 22, 2014, 2:14:32 PM3/22/14
to wekamooc...@googlegroups.com
I meant section 3.3 video lecture: "Using probabilities" 

Huihui Duan

unread,
Mar 23, 2014, 8:54:17 PM3/23/14
to wekamooc...@googlegroups.com
Hi Sourabh: 

You are right, we CAN calculate P[E] = P[(sunny, cool, humid, windy) | yes]*P[yes] + P[(sunny, cool, humid, windy) | no]*P[no]. Actually, P[E] is the sum of 0.0053 and 0.0206. However, due to P[E] will be always a fixed number, we don't need to calculate it; because we will normalize the probabilities as discusses in the video.

Hope this helps. 

Best,
Huihui

Fernando DM

unread,
Mar 24, 2014, 6:59:34 PM3/24/14
to wekamooc...@googlegroups.com
Hi Sourabh, 

Just in case, keep in mind that "sunny" is a category and not a variable, and "windy" is a variable and not a category, E (evidence) is the combination of the values for the new day (Sunny,Cool,High,True), when normalized the Pr (yes) is to get the Pr of YES, without knowing about E (then it is not considered in this point).  

***** ie (A new day) *******
Outlook Temp. Humidity Wind =>Play
Sunny   Cool  High     True =>?

Likelihood of the two classes
For “yes” = 2/9 * 3/9 * 3/9 * 3/9 * 9/14 = 0.0053
For “no” = 3/5 * 1/ * 4/5 * 3/5 * 5/14 = 0.0206

Finally:
Conversion into a probability by normalization:
P(“yes”) = 0.0053 / (0.0053 + 0.0206) = 0.205
P(“no”) = 0.0206 / (0.0053 + 0.0206) = 0.795

Since the probability of play=NO, is greater than the probability of play=YES, this new record (new day) is classified as Play = NO.

However, check out: Lesson 3.3 - page 21 and the "CHAPTER 4 Algorithms: The Basic Methods" from the course book.

For any further information, please let me know.

Best Regards,
Ing. Fernando García
Data Mining Specialist
Reply all
Reply to author
Forward
0 new messages