Hi, sorry for the delay
Have you tried the official Kaldi example? You should try running it.
But I might be wrong; please keep it in mind.
Also, as a general rule in these recipes, no-one cared too much about the performance of the monophone system, its purpose was only to provide initialization for triphone systems.
So you can play with the number of gaussians (--totgauss) which defaults to 1000, to improve the score, if you for some reason care.
I cannot provide more help now (I cannot even run timit now), but perhaps/hopefully someone else will be able to.
y,