Hey there,
I got a little bit confused by the WSI Java evaluation tool and what implicit information my input actually contains. I provide the following clustering:
subTopicID resultID
1.1 1.1
1.4 1.2
1.5 1.3
1.4 1.4
1.1 1.5
1.7 1.6
1.1 1.7
1.6 1.8
1.6 1.9
1.2 1.10
1.4 1.11
1.5 1.12
1.1 1.13
1.6 1.14
1.4 1.15
1.10 1.16
... ....
When I swap line two and three (1.1 1.1 with 1.4 1.2), but do not change anything else, the WSI evaluation tool output changes (see below "Resulting list"), as cluster 1 and 2 swap lables, and thus result in a different 'resulting list'.
Why is that the case? As I understand, the line position of a cluster-document-mapping in the file has only an influence on the position of a document within each cluster.
And: How can the information about the order of the clusters themselves be encoded? When swapping cluster labels (eg. making 1.1 to 1.10) , it seems to have no effect on the resulting document order.
Thanks a lot, Michael
============= Query 134 : "aida" ==============
================== 10 snippet clusters: ====================
The cluster 1 contains the snippets: [1, 5, 7, 13, 25, 26, 28, 31, 32, 38, 40, 46, 48, 53, 55, 58, 59, 70, 71, 74, 76, 77, 85, 86, 91, 92, 98]
The cluster 3 contains the snippets: [3, 12, 23, 34, 36, 41, 47, 50, 51, 54, 57, 62, 68, 75, 81, 83, 94]
The cluster 2 contains the snippets: [2, 4, 11, 15, 27, 29, 33, 42, 52, 60, 61, 67, 69, 82, 88, 99]
The cluster 6 contains the snippets: [10, 24, 56, 63, 78, 84, 90, 93, 95, 100]
The cluster 5 contains the snippets: [8, 9, 14, 17, 21, 39, 43, 66, 87]
The cluster 7 contains the snippets: [16, 18, 35, 45, 64, 65]
The cluster 4 contains the snippets: [6, 20, 79, 80]
The cluster 8 contains the snippets: [19, 30, 37, 49]
The cluster 10 contains the snippets: [72, 89, 96, 97]
The cluster 9 contains the snippets: [22, 44, 73]
[ INFO ] WSIEvaluator -
================== Starting Evaluation ==================
Resulting list: [1, 2, 3, 6, 8, 10, 16, 19, 22, 72, 5, 4, 12, 20, 9, 24, 18, 30, 44, 89, 7, 11, 23, 79, 14, 56, 35, 37, 73, 96, 13, 15, 34, 80, 17, 63, 45, 49, 97, 25, 27, 36, 21, 78, 64, 26, 29, 41, 39, 84, 65, 28, 33, 47, 43, 90, 31, 42, 50, 66, 93, 32, 52, 51, 87, 95, 38, 60, 54, 100, 40, 61, 57, 46, 67, 62, 48, 69, 68, 53, 82, 75, 55, 88, 81, 58, 99, 83, 59, 94, 70, 71, 74, 76, 77, 85, 86, 91, 92, 98]
============= Query 134 : "aida" ==============
================== 10 snippet clusters: ====================
The cluster 2 contains the snippets: [1, 5, 7, 13, 25, 26, 28, 31, 32, 38, 40, 46, 48, 53, 55, 58, 59, 70, 71, 74, 76, 77, 85, 86, 91, 92, 98]
The cluster 3 contains the snippets: [3, 12, 23, 34, 36, 41, 47, 50, 51, 54, 57, 62, 68, 75, 81, 83, 94]
The cluster 1 contains the snippets: [2, 4, 11, 15, 27, 29, 33, 42, 52, 60, 61, 67, 69, 82, 88, 99]
The cluster 6 contains the snippets: [10, 24, 56, 63, 78, 84, 90, 93, 95, 100]
The cluster 5 contains the snippets: [8, 9, 14, 17, 21, 39, 43, 66, 87]
The cluster 7 contains the snippets: [16, 18, 35, 45, 64, 65]
The cluster 4 contains the snippets: [6, 20, 79, 80]
The cluster 8 contains the snippets: [19, 30, 37, 49]
The cluster 10 contains the snippets: [72, 89, 96, 97]
The cluster 9 contains the snippets: [22, 44, 73]
[ INFO ] WSIEvaluator -
================== Starting Evaluation ==================
Resulting list: [2, 1, 3, 6, 8, 10, 16, 19, 22, 72, 4, 5, 12, 20, 9, 24, 18, 30, 44, 89, 11, 7, 23, 79, 14, 56, 35, 37, 73, 96, 15, 13, 34, 80, 17, 63, 45, 49, 97, 27, 25, 36, 21, 78, 64, 29, 26, 41, 39, 84, 65, 33, 28, 47, 43, 90, 42, 31, 50, 66, 93, 52, 32, 51, 87, 95, 60, 38, 54, 100, 61, 40, 57, 67, 46, 62, 69, 48, 68, 82, 53, 75, 88, 55, 81, 99, 58, 83, 59, 94, 70, 71, 74, 76, 77, 85, 86, 91, 92, 98]