Dear all,
I am running the sorting step in Tassel. It finished in 3 minutes for 4G data, but I found there is an error in the reporting file as below.
Memory Settings: -Xms512m -Xmx400g
Tassel Pipeline Arguments: -debug -SortGenotypeFilePlugin -inputFile GS_Fullsib_delete15badseq_indel_biallelic_GQ6_DP2_MAF0.01_Maxmiss0.9.vcf.recode.vcf -outputFile GS_Fulls
ib_delete15badseq_indel_biallelic_GQ6_DP2_MAF0.01_Maxmiss0.9_sorted_debug.vcf -fileType VCF
[main] INFO net.maizegenetics.tassel.TasselLogging - Tassel Version: 5.2.38 Date: July 13, 2017
[main] INFO net.maizegenetics.tassel.TasselLogging - Max Available Memory Reported by JVM: 364089 MB
[main] INFO net.maizegenetics.tassel.TasselLogging - Java Version: 1.8.0_131
[main] INFO net.maizegenetics.tassel.TasselLogging - OS: Linux
[main] INFO net.maizegenetics.tassel.TasselLogging - Number of Processors: 13
[main] INFO net.maizegenetics.pipeline.TasselPipeline - Tassel Pipeline Arguments: [-fork1, -SortGenotypeFilePlugin, -inputFile, GS_Fullsib_delete15badseq_indel_biallelic_GQ6_DP2_MAF0.01_Maxmiss0.9.vcf.recode.vcf, -outputFile, GS_Fullsib_delete15badseq_indel_biallelic_GQ6_DP2_MAF0.01_Maxmiss0.9_sorted_debug.vcf, -fileType, VCF, -runfork1]
net.maizegenetics.analysis.data.SortGenotypeFilePlugin
[pool-1-thread-1] INFO net.maizegenetics.plugindef.AbstractPlugin - Starting net.maizegenetics.analysis.data.SortGenotypeFilePlugin: time: Nov 5, 2017 12:41:48
[pool-1-thread-1] INFO net.maizegenetics.plugindef.AbstractPlugin -
SortGenotypeFilePlugin Parameters
inputFile: GS_Fullsib_delete15badseq_indel_biallelic_GQ6_DP2_MAF0.01_Maxmiss0.9.vcf.recode.vcf
outputFile: GS_Fullsib_delete15badseq_indel_biallelic_GQ6_DP2_MAF0.01_Maxmiss0.9_sorted_debug.vcf
fileType: VCF
[pool-1-thread-1] ERROR net.maizegenetics.dna.map.PositionListBuilder - validateOrdering: Position Chr:MA_10 Pos:24775 Name:SMA_10_24775 Variants:G/C
MAF:NaN Ref:G and Position Chr:MA_5 Pos:86572 Name:SMA_5_86572 Variants:C/T MAF:NaN Ref:C out of order.
BuilderFromVCF data timing 35.1043s
[pool-1-thread-1] INFO net.maizegenetics.plugindef.AbstractPlugin - Finished net.maizegenetics.analysis.data.SortGenotypeFilePlugin: time: Nov 5, 2017 12:44:35
[pool-1-thread-1] INFO net.maizegenetics.pipeline.TasselPipeline - net.maizegenetics.analysis.data.SortGenotypeFilePlugin: time: Nov 5, 2017 12:44:35: progress: 100%
[pool-1-thread-1] INFO net.maizegenetics.plugindef.AbstractPlugin - net.maizegenetics.analysis.data.SortGenotypeFilePlugin Citation: Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES. (2007) TASSEL: Software for association mapping of complex traits in diverse samples. Bioinformatics 23:2633-2635.
any suggestions?
Cheers
chen