Task results - statistical significance

8 views

Skip to first unread message

Lucia Specia

unread,

Apr 7, 2012, 8:02:11 AM4/7/12

to semeval2012_lexi...@googlegroups.com

Dear all,

Please see below attached the statistical significance tests between all pairs of systems (including baselines) using a randomization test with 1000 iterations and a p-value threshold of 0.05 to be considered statistically significant.

As the table shows, all results are statistically significant except the differences between systems 'baseline-Simple Freq', ' UNT-SimpRank' and 'annlor-simple'.

Best,

Lucia, Sujay, Rada

---------- Forwarded message ----------
From: Cyril Grouin <cyril....@limsi.fr>
Date: 6 April 2012 10:37
Subject: Re: short description of your system - little urgent
To: semeval2012_lexi...@googlegroups.com

Hi Lucia,

Thank you for the results provided in the Excel file. Could it be possible to compute the statistical significance between all participants, especially between the three first one? Indeed, results are very close.

Best regards,
Cyril (on behalf of "annlor" team).

Le jeudi 5 avril 2012 19:51:45 UTC+2, Lucia Specia a écrit :

Dear all,

We are preparing the description paper for Task 1. Could you please send us a short description of your system(s)? 1-2 paragraphs (max 1/2 page) would be great. Please specify the method, whether you used the trial data for training, any external resource you used and anything else you judge particularly relevant. If you are not submitting a paper to SemEval describing your system and already have a publication for it, please send us the reference.

If you can send this by April 12 it will be great!

Best,

Lucia