Hi, I had a problem attempting to retrain SEMAFOR. I am using the version available here: (
https://github.com/Noahs-ARK/semafor/)
When running 3_1_idCreateAlphabet.sh, I get a Number Format Exception "for input string "3:4" "
I am using the naacl2012 splits, this string is part of the role span pairs in cv.train.sentences.frame.elements.
Is this step assuming the data will be formatted differently? In the training/data/README it describes the data as it is in the naacl2012 directory, however it seems like the colon is causing problems here.
I tried editing cv.train.sentences.frame.elements to only include the first token, rather than a span (so 3 instead of 3:4), just to see if it would run through like that, but this provides another error. (IndexOutOfBoundsExceptions: index (2) must be less than size (1).
Thanks in advance!