kicking myself :)

18 views
Skip to first unread message

Ted Pedersen

unread,
Apr 5, 2013, 6:23:50 PM4/5/13
to semeval-2013-ws...@googlegroups.com
I had about 10 systems or variations that I was running during the
evaluation, and I ended up picking three for submission. Here's how
they did.

sys1 average F1 = 0.56828125
sys7 average F1 = 0.5878125
sys9 average F1 = 0.57015625

One of the ones I didn't submit did like this :

sys6.pk1 average F1 = 0.73828125
sys6.pk2 average F1 = 0.6859375
sys6.pk3 average F1 = 0.65859375

Sigh. :)

Ah well, that's how it goes. Mostly wanted to mention this as a way of
saying thanks for the gold standard data, this is the sort of
fun/frustrating thing one finds as a result of having that available,
along with a very nice evaluation program!

And I *really* need to figure out what I didn't like about sys6 at the
time of submission. My instincts may need a bit of tuning. :)

More as it develops.
Ted

Ted Pedersen

unread,
Apr 5, 2013, 10:15:18 PM4/5/13
to semeval-2013-ws...@googlegroups.com
Well my excitement has subsided a bit. :) It seems like F1 is
correlates pretty strongly to the number of clusters found, so if the
number of clusters is larger F1 is generally larger (although not
always). I guess we can see that in the results with the singleton
baseline getting an F1 of 1.000. But,

I am running some random baselines to try and understand things a bit,
and at least in that case as the number of clusters increases, F1
likewise increases, leading me to wonder how much F1 is actually
showing us...

1 of 2 random senses assigned (balanced distribution)

average F1 = 0.54890625
average Rand Index = 0.5006200396825399
average Adj Rand Index = -3.7523664631867494E-4
average Jaccard Index = 0.2698606769455676
============ average number of created clusters: 2.0
============ average cluster size: 32.0

1 of 5 random senses assigned (balanced distribution)

average F1 = 0.56734375
average Rand Index = 0.5612549603174602
average Adj Rand Index = 0.0011575900428277936
average Jaccard Index = 0.14526247612439977
============ average number of created clusters: 5.0
============ average cluster size: 12.799999999999976

1 of 10 random senses assigned (balanced distribution)

average F1 = 0.59671875
average Rand Index = 0.5810218253968255
average Adj Rand Index = 1.7524984876489096E-4
average Jaccard Index = 0.08182754500715404
============ average number of created clusters: 9.98
============ average cluster size: 6.41422222222221

1 of 25 random senses assigned (balanced distribution)

average F1 = 0.66890625
average Rand Index = 0.5924454365079366
average Adj Rand Index = -0.0015843012763914102
============ average number of created clusters: 23.2
============ average cluster size: 2.769600269448325

1 of 50 random senses assigned (balanced distribution)

average F1 = 0.761875
average Rand Index = 0.5972817460317459
average Adj Rand Index = 0.0010363484473483394
average Jaccard Index = 0.01997375346134799
============ average number of created clusters: 35.88
============ average cluster size: 1.790505301429098
Reply all
Reply to author
Forward
0 new messages