Any tips on speeding up output from HDF5?
For large projects, outputting a VCF or Hapmap from the HDF5 is quite slow. Does anyone know of ways to speed up this process? Below are examples of how I am creating VCFs and Hapmaps from HDF5 files. Any tips would be greatly appreciated!
/usr/local/bin/tassel-5-standalone/
run_pipeline.pl -Xms80g -Xmx190g -fork1 -h5 HDF5/$Study\_productioHapMap_noKO.h5 -filterAlign -filterAlignMinFreq $MAF -filterAlignRemMinor -export ./hapmap/$Study.vcf -exportType VCF -runfork1 > ./logs/VCFFromHDF5.log
/usr/local/bin/tassel-5-standalone/
run_pipeline.pl -Xms80g -Xmx190g -h5 HDF5/$Study\_productioHapMap_noKO.h5 -filterAlign -filterAlignMinFreq $MAF -filterAlignRemMinor -export hapmap/$Study.hmp.txt -exportType Hapmap > ./logs/HapmapFromHDF5.log