Dear Oliver!
First, thanks a lot for updating the script, now we're able to run the tool on your HTK lattice example, However, when I'm trying to run the tool on the lattices that I've generated, I get the following error:./LatticeWordSegmentation -KnownN 1 -UnkN 2 -PruneFactor 16 -InputFilesList short_test_list.txt -InputType fst -LatticeFileType htk -NumIter 100 -OutputDirectoryBasename Results_SVite/ -ReferenceTranscription short_test_list.txt.ref -CalculateLPER -CalculatePER -CalculateWER -AmScale 1 -EvalInterval 1 -WordLengthModulation -1 -UseViterby 151 -DeactivateCharacterModel 175 -HTKLMScale 0
-----------------------------------------------------------------------------------------------------------
LatticeWordSegmentation: build date: Apr 19 2016 - time: 10:42:59
-----------------------------------------------------------------------------------------------------------
Reading nBest file [0/1] from HTK FST 011_011c0201.lat
142459 States | 7698208 Arcs (7442 States | 19851 Arcs after pruning)
WARNING: Reference transcription contains more characters than input transcription!
Calculating LPER with pruning from inf with step size 0 to inf!
Pruning factor: inf
Lattice phoneme error rate:
PER: -nan %, Precision: -nan %, Recall: -nan %, F-score: -nan %
Ins: 0, Del: 0, Sub: 0, Corr: 0, NFound: 0, NRef: 0
Starting word segmentation!
Initializing empty language model with KnownN=1, UnkN=2!
Iteration: 1 of 100
Segmentation fault
I run it on just one lattice, the lattice and reference transcription can be found here: https://drive.google.com/folderview?id=0B6Dx-6Uhwb_jVVB6amh6aWpHY1k&usp=sharing. To me they look exactly as the example lattices/transcriptions. Do you know what might be the problem?
Thanks and kind regards,
Kate