Exam review

jhooks1

unread,

Oct 4, 2013, 9:17:39 PM10/4/13

to cs...@googlegroups.com

Regarding KNN on the review (Sept 10):

"Make the case the Knn suffers from the following problems:

They are computationally expensive (in terms of run time and memory)
They are intolerant of attribute noise.
They are intolerant of irrelevant attributes.
They are sensitive to the choice of the algorithm's similarity function.
They provide little usable information regarding the structure of the data
There is no natural way to work with nominal-valued attributes or missing attribute"

If we get this question on the exam, are we expected to just list these problems, or do we need to prove them somehow?

Tim Menzies

unread,

Oct 5, 2013, 9:49:15 AM10/5/13

to cs...@googlegroups.com

thanks for asking this question on the forum. that way, we can all see the question and help with the answer.

I'd add one "for example" to each line. e.g. "They provide little usable information regarding the structure of the data". For example, while other learners offer some statement about patterns in the data (such as the frequency counts of Naive Bayes), all a standard kNN does is store the training examples in a box with no summary information.

--
You received this message because you are subscribed to the Google Groups "csx73" group.
To unsubscribe from this group and stop receiving emails from it, send an email to csx73+un...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--

:: there are some who call me... tim.m...@gmail.com

:: prof @ cs.ai.se.csee.wvu.usa.sol.virgo.all.nil

:: +1-304-376-2859

:: http://menzies.us (skype = menzies.tim)

vita= http:// goo.gl/8eNhY

pubs= http:// goo.gl/8KPKA

stats= http:// goo.gl/vggy1

wow = http:// goo.gl/2Wg9A

</hubris>

jhooks1

unread,

Oct 5, 2013, 11:34:34 PM10/5/13

to cs...@googlegroups.com

Thanks.

I have some more questions:

"computer the variance of 6 tens and 2 gives." Are gives zeros (and a zero variance)?

From the comparing different learners lecture:

one :mu 0.3 :rank 1
two :mu 0.8 :rank 2

We are displaying these results in a percentile chart from .45 to .85.
How is .3 not getting cut off?

Tim Menzies

unread,

Oct 6, 2013, 3:31:42 PM10/6/13

to cs...@googlegroups.com

On Sat, Oct 5, 2013 at 11:34 PM, jhooks1 <jho...@mix.wvu.edu> wrote:
> Thanks.
>
> I have some more questions:
>
> "computer the variance of 6 tens and 2 gives." Are gives zeros (and a zero
> variance)?

6 6 6 6 6 6 6 2

mu = (6*6 +2)/6 = 38/7 = 5.42

sum (x_i - mu)^2 = 6*(6-6.33333)^2 + (2-6.333)^2 = 6*0.333^2 +
-4.333^2 = 0.666+ 16 = 13.73

sd = sqrt(sum/(n-1)) = sqrt(13.73/(7-1)) = 1.51

> From the comparing different learners lecture:
>
> one :mu 0.3 :rank 1
> two :mu 0.8 :rank 2
>
> We are displaying these results in a percentile chart from .45 to .85.
> How is .3 not getting cut off?

my bad. typo. the range should be wider

t

jhooks1

unread,

Oct 7, 2013, 3:49:29 PM10/7/13

to cs...@googlegroups.com

Are we allowed to use calculators on this exam (e.g. calculating logarithms)?

Tim Menzies

unread,

Oct 7, 2013, 8:07:37 PM10/7/13

to cs...@googlegroups.com

you can use calculators or leave your answers as fractions.

t

On Mon, Oct 7, 2013 at 3:49 PM, jhooks1 <jho...@mix.wvu.edu> wrote:

Are we allowed to use calculators on this exam (e.g. calculating logarithms)?

--

You received this message because you are subscribed to the Google Groups "csx73" group.
To unsubscribe from this group and stop receiving emails from it, send an email to csx73+un...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Jason Hooks

unread,

Oct 8, 2013, 12:29:24 AM10/8/13

to cs...@googlegroups.com

Thanks for answering my questions.

Can you go over

  happy? age       hair
  ------ --------  ------
  y      teenager  black
  y      teenager  bald
  y      teenager  blond
  y      teenager  blond
  y      old man   black
  n      old man   blond
  n      old man   bald

In the following example, what is the entropy of class variable happy? in the entire data set? (see the algorithm later in the section

I am a little confused on the algorithm and what the answer is to the entire data set. Entropy is listed as E(r) = sum Pi*log(Pi). should it be sum -Pi*log(Pi)?

Jeffrey Yancey

unread,

Oct 8, 2013, 2:38:18 PM10/8/13

to cs...@googlegroups.com

Thanks for all the examples, they've been helpful while studying.

I would just like to echo JHook's question concerning the following: "computer the variance of 6 tens and 2 gives.". Specifically asking what a "give" is. I have no questions regarding the computation, that is quite clear.

Again, I really appreciate this group's discussion.

Brian Dunar

unread,

Oct 8, 2013, 4:13:41 PM10/8/13

to cs...@googlegroups.com

I think he means calculate the entropy of...

10,10,10,10,10,10, 2

-Brian

Brian Dunar

unread,

Oct 8, 2013, 4:15:36 PM10/8/13

to cs...@googlegroups.com

Sorry, I meant variance.

Tazin Afrin

unread,

Oct 8, 2013, 4:18:38 PM10/8/13

to cs...@googlegroups.com

I think it makes more sense if it is "computer the variance of 6 tens and 2 fives"

then it would be the variance of 10 10 10 10 10 10 5 5

:)

Jeffrey Yancey

unread,

Oct 8, 2013, 4:32:15 PM10/8/13

to cs...@googlegroups.com

Tazin, you are a genius.

Reply all

Reply to author

Forward