Hi,
My name is Greg; I'm a grad student at UC Berkeley working with Dan Klein. We're trying to use AuToBI to predict break indices for CTS data, currently Switchboard. We used the Games models to try to run on the attached files:
java -jar AuToBI.jar -input_file=text/sw2018.A.alignments.TextGrid \
-wav_file=audio/wav/sw02018.A.wav \
-pitch_accent_detector=models/games.acc.detection.model \
-pitch_accent_classifier=models/games.acc.classification.model \
-intonational_phrase_boundary_detector=models/games.inton_phrase.detection.model \
-intermediate_phrase_boundary_detector=models/games.inter_phrase.detection.model \
-phrase_accent_classifier=models/games.phrase_acc.classification.model \
-boundary_tone_classifier=models/games.pabt.classification.model \
-out_file=text/sw2018.A.breaks
The output is as follows:
1 [main] INFO edu.cuny.qc.speech.AuToBI.util.AuToBIUtils - Reading words from: text/sw2018.A.alignments.TextGrid
edu.cuny.qc.speech.AuToBI.core.AuToBIException: No feature extractor registered for feature: nominal_PitchAccentType
at edu.cuny.qc.speech.AuToBI.AuToBI.initializeReferenceCounting(AuToBI.java:308)
at edu.cuny.qc.speech.AuToBI.AuToBI.extractFeatures(AuToBI.java:263)
at edu.cuny.qc.speech.AuToBI.AuToBI.run(AuToBI.java:1390)
at edu.cuny.qc.speech.AuToBI.AuToBI.main(AuToBI.java:1298)
We did some digging around and tried to manually register the PitchAccentTypeFeatureExtractor, but that just led to another error. It looked like a bunch of required feature extractors were not registered, including for features like max[subregionC[delta[rnormC[I]],psyl]] where it was not clear what should be added (the deprecated registerAllFeatureExtractors didn't work). We got the same error from using today's version of the code from GitHub.
What are we doing wrong? Any help would be appreciated!
Thanks,
Greg