In this testing set, I'm just doing one normal and one tumor. If possible, I would like to scale this up, though, to one tumor and say 40 normals, as per GATK's
recommendation for a panel of normals.It looks like it could be a problem with the BAM index.
Here's my command and the start of the error
python /abs/path/to/Platypus_0.8.1/Platypus.py callVariants -o indels_matched.vcf --bamFiles=/abs/path/to/tumordna.bam,/abs/path/to/normaldna.bam --logFileName=platypus_trouble.log --refFile=/abs/path/to/hg38.fa --genIndels=1 --genSNPs=0
2017-08-03 08:18:16,496 - INFO - Beginning variant calling
2017-08-03 08:18:16,496 - INFO - Output will go to normaldna.vcf
[W::hts_idx_load2] The index file is older than the data file: /abs/path/to/tumordna.bai
[W::hts_idx_load2] The index file is older than the data file: /abs/path/to/tumordna.bai
2017-08-03 08:18:17,157 - INFO - Processing region 1:0-100000. (Only printing this message every 10 regions of size 100000)
However, I'm not convinced; running samtools idxstats churns out perfectly normal statistics on both despite their same warning;
samtools idxstats tumordna.bam
Warning: The index file is older than the data file: tumordna.bai
1 248956422 11392213 7879
2 242193529 8628419 5913
3 198295559 9399751 6165
4 190214555 4886955 3337
5 181538259 5489974 3893
6 170805979 6026849 4186
7 159345973 5836764 4164
8 145138636 4151537 3004
9 138394717 4802964 3449
10 133797422 4713127 3375
11 135086622 6878823 4761
12 133275309 5923137 4352
13 114364328 2254480 1557
14 107043718 4217733 2737
15 101991189 3937463 2725
16 90338345 3639656 2746
17 83257441 5696854 4281
18 80373285 1800628 1247
19 58617616 7705012 6037
20 64444167 2690292 2064
21 46709983 1539867 1191
22 50818468 1503402 1285
X 156040895 2423614 1809
Y 57227415 49893 182
MT 16569 76847 458
* 0 0 7620