Hi Abishek,
I just had a look at your evaluation file and saw why you receive a different score. I have to admit you couldn't know that our evaluation is slightly different from what is described on Codalab in order to get a more accurate, message focused similarity score.
What you are doing is basically creating one vector for all headline scores, and one vector for all gold standard scores to then calculate the cosine similarity of both.
With that approach you're missing 3 vital parts which are different to our evaluation:
What we did is (for both datasets, the GS and the submission), we filtered the data in order to find out which headlines have multiple entities. Then, we created one vector for each instance (the vectors are having different lengths according to the number of entities related to it). Having a vector for each instance, we calculated the cosine similarity for each instance. Those similarity scores have been summed up and divided by the number of instances in order to receive an average cosine similarity score for all instances which has been multiplied by the cosine weight at the end.
Is that clear for you?
Best,
Tobias
We’ve been looking into this and I think we found a good extension to the current way of evaluation.
We are going to treat vectors with a length of 1 differently than vectors including the scores of multiple entities.
This will allow us to still take the relation of entities into consideration while creating an overall score, as well as it is handling the “single score” problem. Scores with different signs (+/- or vice versa) are still going to be 0 since the sentiment is totally opposite. But for having a positive score for a positive prediction (or negative & negative), we are using an additional measure in order to include the distance in the overall score. This will be the distance of both scores ( 1 - | GSi - PSi | ) which gives us a similar score between 0 and 1 as the cosine similarity does. In addition, to not overweight single scores (for a 1 in the cosine similarity you need to have predicted multiple sentiments correctly while the single score is derived from only one prediction) we are weighting the cosine similarity scores in accordance with the length of the given input vector.
Putting all scores in one GS vector and one Input vector to then use the cosine similarity (or something similar) is no solution for this since the task was to score the entities and not instances (or documents if you wish). By creating one vector for all scores, each entity would be treated in the same way. The entity (cashtag) - instance (tweet) relation would totally be ignored.
I'm looking forward to hearing your thoughts on this.
Best,
Tobias
--You received this message because you are subscribed to the Google Groups "SemEval 2017: Task 5" group.To unsubscribe from this group and stop receiving emails from it, send an email to semeval-2017-ta...@googlegroups.com.To post to this group, send email to semeval-2...@googlegroups.com.To view this discussion on the web visit https://groups.google.com/d/msgid/semeval-2017-task-5/2b63cebf-c75f-4f04-8ae1-b4bfdb1c34a1%40googlegroups.com.For more options, visit https://groups.google.com/d/optout.
Hello Manel and Tobias,I do agree that predicting sentiment of opposite polarity is not totally good but then the question is - Is this metric fair enough? As I have understood from the working of this metric, it heavily penalizes classification error. Fair Enough. But is this what the cosine similarity function does too? Cosine similarity would only penalize by same the amount if the gold score and the predicted score are exactly opposite in nature. Example - if the gold score is [ 0.9, 0.5, 0.8] and the predicted score is [-0.9, -0.5, -0.8] then only cosine similarity would penalize by same amount i.e it would output 0. But sadly this is not the case with your proposed metric. Please look into this!Regards,Abhishek Kumar
To unsubscribe from this group and stop receiving emails from it, send an email to semeval-2017-task-5+unsub...@googlegroups.com.
To post to this group, send email to semeval-2...@googlegroups.com.To view this discussion on the web visit https://groups.google.com/d/msgid/semeval-2017-task-5/2b63cebf-c75f-4f04-8ae1-b4bfdb1c34a1%40googlegroups.com.For more options, visit https://groups.google.com/d/optout.
--You received this message because you are subscribed to the Google Groups "SemEval 2017: Task 5" group.To unsubscribe from this group and stop receiving emails from it, send an email to semeval-2017-ta...@googlegroups.com.To post to this group, send email to semeval-2...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/semeval-2017-task-5/743a6154-7122-4889-8d67-139f22384c70%40googlegroups.com.
On 24 Feb 2017, at 19:37, Abhishek Kumar <abhiro...@gmail.com> wrote:
Hello Manel and Tobias,I do agree that predicting sentiment of opposite polarity is not totally good but then the question is - Is this metric fair enough? As I have understood from the working of this metric, it heavily penalizes classification error. Fair Enough. But is this what the cosine similarity function does too? Cosine similarity would only penalize by same the amount if the gold score and the predicted score are exactly opposite in nature. Example - if the gold score is [ 0.9, 0.5, 0.8] and the predicted score is [-0.9, -0.5, -0.8] then only cosine similarity would penalize by same amount i.e it would output 0. But sadly this is not the case with your proposed metric. Please look into this!Regards,
Abhishek Kumar
To unsubscribe from this group and stop receiving emails from it, send an email to semeval-2017-task-5+unsub...@googlegroups.com.
To post to this group, send email to semeval-2...@googlegroups.com.To view this discussion on the web visit https://groups.google.com/d/msgid/semeval-2017-task-5/2b63cebf-c75f-4f04-8ae1-b4bfdb1c34a1%40googlegroups.com.For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "SemEval 2017: Task 5" group.
To unsubscribe from this group and stop receiving emails from it, send an email to semeval-2017-ta...@googlegroups.com.
To post to this group, send email to semeval-2...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/semeval-2017-task-5/743a6154-7122-4889-8d67-139f22384c70%40googlegroups.com.