Hi Barak,
sorry for my late answer!
First of all,
precision and recall are normalized to the interval [0, 1]
where 0 means worst quality and 1 best/perfect quality. Granularity is
not normalized and it should be minimized, i.e., a granularity of 1
means best quality; the higher the granularity the worse the quality.
then:
if i don't detect any plagiairism (which is great) i get:
This
would not be great since there is definitely plagiarism to be found.
So, if you report nothing you will receive an overall score of 0.
recall = 0/0
precision = 0/0
granularity = 0/0
The
above explanation in mind, you see that recall and precision are the
worst possible values. Granularity is in this case an extreme value, and
for the calculation of the overall performance it will be set to 1, i.e., granularity = min(1, granularity).
Although this means you achieved best granularity you will still get an overall 0
because of the bad precision and recall values.
if i do declare some passage (i.e of size 1000 chars) i get:
recall = 0/0
precision = 0/1000
granularity = 0/1
I presume that with "some passage" you
refer to a passage which has not actually been plagiarized, i.e., a
false-positive. The overall performance would therefore be the same as
above, 0.
Anyway, note that you miscalculated the granularity: Since you
reported a 1000 char-passage which was not actually plagiarized the
denominator of the granularity would be 0, too.
I hope I could clarify the evaluation measures, but if not, don't hesitate to get back to me.
Best,
Martin