Hello again, sorry to bother, but it seems that not everything is ok, I tried to run the compute log probability and experience this following error:
java -ea -mx5g -server -cp ./src/ edu.berkeley.nlp.lm.io.ComputeLogProbabilityOfTextStream google.binary brown.txt
Reading LM Binary google.binary {
} [19s]
Scoring file -; current log probability is 0.0 {
Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: 2
at edu.berkeley.nlp.lm.map.CompressedNgramMap.getValueAndOffset(CompressedNgramMap.java:71)
at edu.berkeley.nlp.lm.StupidBackoffLm.getLogProb(StupidBackoffLm.java:59)
at edu.berkeley.nlp.lm.ArrayEncodedNgramLanguageModel$DefaultImplementations.getLogProb(ArrayEncodedNgramLanguageModel.java:70)
at edu.berkeley.nlp.lm.StupidBackoffLm.getLogProb(StupidBackoffLm.java:129)
at edu.berkeley.nlp.lm.io.ComputeLogProbabilityOfTextStream.computeProb(ComputeLogProbabilityOfTextStream.java:84)
at edu.berkeley.nlp.lm.io.ComputeLogProbabilityOfTextStream.main(ComputeLogProbabilityOfTextStream.java:64)
In brown.txt I have 1 sentence per line, as described.