Sir, You were absolutely correct. Our total RAM size is 156GB and all of it is being used up as can be seen in the snippet below.
fstcomposecontext --context-size=3 --central-position=1 --read-disambig-syms=data/lang/phones/
disambig.int --write-disambig-syms=data/lang/tmp/
disambig_ilabels_3_1.int data/lang/tmp/ilabels_3_1.4809
ERROR: FstHeader::Read: Bad FST header: standard input
mv: cannot stat 'data/lang/tmp/ilabels_3_1.4809': No such file or directory
fstisstochastic data/lang/tmp/CLG_3_1.fst
ERROR: FstHeader::Read: Bad FST header: data/lang/tmp/CLG_3_1.fst
ERROR (fstisstochastic[5.1.0~1-eba4]:ReadFstKaldi():kaldi-fst-io.cc:35) Reading FST: error reading FST header from data/lang/tmp/CLG_3_1.fst
[ Stack-Trace: ]
kaldi::MessageLogger::HandleMessage(kaldi::LogMessageEnvelope const&, char const*)
kaldi::MessageLogger::~MessageLogger()
fst::ReadFstKaldi(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >)
main
__libc_start_main
_start
[info]: CLG not stochastic.
make-h-transducer --disambig-syms-out=exp/tri1/graph/
disambig_tid.int --transition-scale=1.0 data/lang/tmp/ilabels_3_1 exp/tri1/tree exp/tri1/final.mdl
ERROR (make-h-transducer[5.1.0~1-eba4]:Input():kaldi-io.cc:742) Error opening input stream data/lang/tmp/ilabels_3_1
[ Stack-Trace: ]
kaldi::MessageLogger::HandleMessage(kaldi::LogMessageEnvelope const&, char const*)
kaldi::MessageLogger::~MessageLogger()
kaldi::Input::Input(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, bool*)
main
__libc_start_main
_start
Command exited with non-zero status 1
Command being timed: "./run_tri_graph.sh"
User time (seconds): 31809.43
System time (seconds): 199.38
Percent of CPU this job got: 105%
Elapsed (wall clock) time (h:mm:ss or m:ss): 8:24:43
Average shared text size (kbytes): 0
Average unshared data size (kbytes): 0
Average stack size (kbytes): 0
Average total size (kbytes): 0
Maximum resident set size (kbytes): 159112304
Average resident set size (kbytes): 0
Major (requiring I/O) page faults: 2978
Minor (reclaiming a frame) page faults: 34686328
Voluntary context switches: 1874307
Involuntary context switches: 46521
Swaps: 0
File system inputs: 860768
File system outputs: 2286432
Socket messages sent: 0
Socket messages received: 0
Signals delivered: 0
Page size (bytes): 4096
Sir, we are running this on a virtual instance with 24 cpus and memory 156 GB. Increasing the memory beyond this is not an option for us. I realise an option is to connect the machines using slurm/sge but i want to know if there is any other work around?
Thank you sir.