Hello,
I have observed that I have been missing some errors as stdout/stderr is buffered by default in python (before 3.7).
I am using a machine with python versions 2.7.17, 3.6.9 and I see many people/institutions using similar python configs. Therefore, I think that making sure kaldi scripts running python unbuffered will help in catching some errors which might be invisible due to the buffered stdout/stderr. I can come up with a specific example if needed where this happens. This would also be beneficial for more real-time tracking of logs.
For that reason, I believe setting the
PYTHONUNBUFFERED environment variable in the main recipes like WSJ, TEDLIUM would be helpful.
Let me know if I am on the right path. If so, what would be a good place to add the `export PYTHONUNBUFFERED=1` line? Is the beginning of
run.sh a good place to add this and submit a PR?
Regards,
Sourya