I think your question is based on a misunderstanding of what Kaldi is all about.
Kaldi isn't a single speech recognition model that we sometimes "update", it is a set of tools and we provide example scripts for different setups. And it doesn't really make sense to evaluate the accuracy of the Aspire model on Librispeech, when kaldi actually has scripts to build models trained with Librispeech and we report results for it.
E.g. just look at the file
# local/chain/compare_wer.sh exp/chain_cleaned/tdnn_1b_sp exp/chain_cleaned/tdnn_1c_sp
# System tdnn_1b_sp tdnn_1c_sp
# WER on dev(fglarge) 3.77 3.35
# WER on dev(tglarge) 3.90 3.49
# WER on dev(tgmed) 4.89 4.30
# WER on dev(tgsmall) 5.47 4.78
# WER on dev_other(fglarge) 10.05 8.76
# WER on dev_other(tglarge) 10.80 9.26
# WER on dev_other(tgmed) 13.07 11.21
# WER on dev_other(tgsmall) 14.46 12.47
# WER on test(fglarge) 4.20 3.87
# WER on test(tglarge) 4.28 4.08
# WER on test(tgmed) 5.31 4.80
# WER on test(tgsmall) 5.97 5.25
# WER on test_other(fglarge) 10.44 8.95
# WER on test_other(tglarge) 11.05 9.41
# WER on test_other(tgmed) 13.36 11.52
# WER on test_other(tgsmall) 14.90 12.66