Hi all,
I am trying to impute using STITCH from bam files generated from simulated data. When I run the command:
STITCH.R --chr="rep_1" --bamlist=bamlist_sorted_1x_wRGSM.txt --posfile=bam_list_160_1x_pos.txt --outputdir=./ --K=40 --nGen=100 --nCores=8
I get:
[2019-11-29 17:43:02] Running STITCH(chr = rep_1, nGen = 100, posfile = bam_list_160_1x_pos.txt, K = 40, S = 1, outputdir = ./, nStarts = , tempdir = NA, bamlist = bamlist_sorted_1x_wRGSM.txt, cramlist = , sampleNames_file = , reference = , genfile = , method = diploid, output_format = bgvcf, B_bit_prob = 16, outputInputInVCFFormat = FALSE, downsampleToCov = 50, downsampleFraction = 1, readAware = TRUE, chrStart = NA, chrEnd = NA, regionStart = NA, regionEnd = NA, buffer = NA, maxDifferenceBetweenReads = 1000, maxEmissionMatrixDifference = 1e+10, alphaMatThreshold = 1e-04, emissionThreshold = 1e-04, iSizeUpperLimit = 600, bqFilter = 17, niterations = 40, shuffleHaplotypeIterations = c(4, 8, 12, 16), splitReadIterations = 25, nCores = 8, expRate = 0.5, maxRate = 100, minRate = 0.1, Jmax = 1000, regenerateInput = TRUE, originalRegionName = NA, keepInterimFiles = FALSE, keepTempDir = FALSE, outputHaplotypeProbabilities = FALSE, switchModelIteration = NA, generateInputOnly = FALSE, restartIterations = NA, refillIterations = c(6, 10, 14, 18), downsampleSamples = 1, downsampleSamplesKeepList = NA, subsetSNPsfile = NA, useSoftClippedBases = FALSE, outputBlockSize = 1000, outputSNPBlockSize = 10000, inputBundleBlockSize = NA, genetic_map_file = , reference_haplotype_file = , reference_legend_file = , reference_sample_file = , reference_populations = NA, reference_phred = 20, reference_iterations = 40, reference_shuffleHaplotypeIterations = c(4, 8, 12, 16), output_filename = NULL, initial_min_hapProb = 0.2, initial_max_hapProb = 0.8, regenerateInputWithDefaultValues = FALSE, plotHapSumDuringIterations = FALSE, plot_shuffle_haplotype_attempts = FALSE, plotAfterImputation = TRUE, save_sampleReadsInfo = FALSE, gridWindowSize = NA, shuffle_bin_nSNPs = NULL, shuffle_bin_radius = 5000, keepSampleReadsInRAM = FALSE, useTempdirWhileWriting = FALSE, output_haplotype_dosages = FALSE)
[2019-11-29 17:43:02] Program start
[2019-11-29 17:43:02] Get and validate pos and gen
[2019-11-29 17:43:03] Done get and validate pos and gen
[2019-11-29 17:43:03] Get BAM sample names
[2019-11-29 17:43:03] Done getting BAM sample names
[2019-11-29 17:43:03] Generate inputs
[2019-11-29 17:43:04] WARNING - sample sample_28_sorted_1x has no informative reads. It is being given random reads. Consider removing from analysis
[2019-11-29 17:43:04] WARNING - sample sample_46_sorted_1x has no informative reads. It is being given random reads. Consider removing from analysis
[2019-11-29 17:43:04] WARNING - sample sample_100_sorted_1x has no informative reads. It is being given random reads. Consider removing from analysis
The last line repeats for every sample in the bam list. The name of the simulated chromosome is "rep_1" in the bams and the position file. Any ideas about the problem?
Thanks,
Aryn