How to calculate krippendorff alpha with MASI distance

elgesto elge

unread,

Sep 23, 2015, 12:41:11 PM9/23/15

to nltk-users

Hello,

I have a task to calculate inter-annotator agreement in multi-label classification, where for each example more than one label can be assigned. I found that NLTK can measure agreement based on distance metric.

I am looking for an example of calculating krippendorff alpha with MASI distance.

This is what I have.

import nltk
from nltk.metrics import masi_distance


toy_data = [['1', 5723, [1,2]],['2', 5723, [2,3]]]

task = nltk.metrics.agreement.AnnotationTask(data=toy_data, distance=masi_distance)
print task.alpha()

This code fails with

TypeError: unhashable type: 'list'

Obviously, the following doesn't work to.

toy_data = [['1', 5723, set([1,2])],['2', 5723, set([2,3])]]

Do you have a working example?

Thank you!

Steven Bird

unread,

Sep 23, 2015, 12:50:27 PM9/23/15

to nltk-users

From the documentation, it looks like each item needs to be immutable, i.e. your `toy_data` shouldn't consist of lists but tuples.

http://www.nltk.org/_modules/nltk/metrics/agreement.html

It would be great if someone could contribute more documentation, e.g. here:

http://www.nltk.org/_modules/nltk/metrics/agreement.html#AnnotationTask.alpha

--
You received this message because you are subscribed to the Google Groups "nltk-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to nltk-users+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--

--
Sent from my phone, please excuse miscorrected typos

elgesto elge

unread,

Sep 24, 2015, 3:40:24 AM9/24/15

to nltk-users

Tuples are not appropriate.

It fails with error message

AttributeError: 'tuple' object has no attribute 'intersection'

Alexis Dimitriadis

unread,

Sep 25, 2015, 7:19:48 AM9/25/15

to nltk-...@googlegroups.com

Sounds like the code expects `frozenset` objects: They have an intersection() method and are immutable.

It also sounds like it is badly in need of more documentation...

Alexis

--

elgesto elge

unread,

Sep 26, 2015, 1:02:43 AM9/26/15

to nltk-users

You are right, it's frozenset. For some reason it still doesn't work.

toy_data = [['1', 5723, frozenset([2,3])],['2', 5723, frozenset([2,3])]]

fails with

ret = 1.0 - (self.Do_alpha() / De)
ZeroDivisionError: float division by zero

Reply all

Reply to author

Forward