i created a lattice file for a audio using the below commands:
(prior to this also did , fix_dir and compute_cmvn)
#extract ivectors
steps/online/nnet2/extract_ivectors.sh --nj 1 --cmd
run.pl data/test_clean_hires data/lang exp/nnet3_cleaned/extractor exp/nnet3_cleaned/ivectors_test_clean_hires
#make a graph
utils/mkgraph.sh --self-loop-scale 1.0 --remove-oov data/lang exp/nnet3_cleaned/tdnn_sp exp/nnet3_cleaned/tdnn_sp/graph_tgsmall
#decode using the graph (lattice file is created(lat.1.gz) )
steps/nnet3/decode.sh --acwt 1.0 --post-decode-acwt 10.0 --nj 1 --online-ivector-dir exp/nnet3_cleaned/ivector_test_clean_hires exp/nnet3_cleaned/tdnn_sp/graph_tgsmall data/test_clean_hires exp/nnet3_cleaned/tdnn_sp/decode_test_tgsmall
#to get the transcript
../../../src/latbin/lattice-best-path ark:'gunzip -c exp/nnet3_cleaned/tdnn_sp/decode_test_tgsmall/lat.1.gz |' ark,t:| utils/
int2sym.pl -f 2- data/lang/words.txt > out.txt
now i got the transcript , but i want phonemes with the start and end time , like normal ctm files
i have the lattice file , i have checked most of the scripts to convert from lattice to phonemes , but not able to find a script suitable for my use case
FYI , i checked this script (lattice-to-phone-lattice)
but the output is other lattice file not txt or ctm file
please let me know if there is any direct script or multiple scripts to get required output
thanks.
sai..