1) I am trying to use -ref flag in DiscoverySNPCallerV2 Plugin in Tassel version: 5.2.23 but its showing this error
ERROR net.maizegenetics.plugindef.AbstractPlugin - null key in entry: null=TaxaDist{taxaWithRead=0, totalDepth=0, maxTaxa=13, []}
/TASSEL/tassel5/lib/commons-codec-1.10.jar:/TASSEL/tassel5/lib/ahocorasick-0.2.4.jar:/TASSEL/tassel5/lib/batik-awt-util.jar:/TASSEL/tassel5/lib/batik-css.jar:/TASSEL/tassel5/lib/batik-dom.jar:/TASSEL/tassel5/lib/batik-ext.jar:/TASSEL/tassel5/lib/batik-gui-util.jar:/TASSEL/tassel5/lib/batik-gvt.jar:/TASSEL/tassel5/lib/batik-parser.jar:/TASSEL/tassel5/lib/batik-svg-dom.jar:/TASSEL/tassel5/lib/batik-svggen.jar:/TASSEL/tassel5/lib/batik-util.jar:/TASSEL/tassel5/lib/batik-xml.jar:/TASSEL/tassel5/lib/biojava-alignment-4.0.0.jar:/TASSEL/tassel5/lib/biojava-core-4.0.0.jar:/TASSEL/tassel5/lib/biojava-phylo-4.0.0.jar:/TASSEL/tassel5/lib/cisd-jhdf5-batteries_included_lin_win_mac.jar:/TASSEL/tassel5/lib/colt.jar:/TASSEL/tassel5/lib/commons-math3-3.4.1.jar:/TASSEL/tassel5/lib/ejml-0.23.jar:/TASSEL/tassel5/lib/forester.jar:/TASSEL/tassel5/lib/geronimo-spec-activation-1.0.2-rc4.jar:/TASSEL/tassel5/lib/guava-14.0.1.jar:/TASSEL/tassel5/lib/htsjdk-1.138.jar:/TASSEL/tassel5/lib/itextpdf-5.1.0.jar:/TASSEL/tassel5/lib/javax.json-1.0.4.jar:/TASSEL/tassel5/lib/jcommon-1.0.6.jar:/TASSEL/tassel5/lib/jfreechart-1.0.3.jar:/TASSEL/tassel5/lib/json-simple-1.1.1.jar:/TASSEL/tassel5/lib/junit-4.10.jar:/TASSEL/tassel5/lib/log4j-1.2.13.jar:/TASSEL/tassel5/lib/mail-1.4.jar:/TASSEL/tassel5/lib/poi-3.0.1-FINAL-20070705.jar:/TASSEL/tassel5/lib/postgresql-9.4-1201.jdbc41.jar:/TASSEL/tassel5/lib/slf4j-api-1.7.10.jar:/TASSEL/tassel5/lib/slf4j-simple-1.7.10.jar:/TASSEL/tassel5/lib/snappy-java-1.1.1.6.jar:/TASSEL/tassel5/lib/sqlite-jdbc-3.8.5-pre1.jar:/TASSEL/tassel5/lib/trove-3.0.3.jar:/TASSEL/tassel5/lib/xercesImpl.jar:/TASSEL/tassel5/lib/xml.jar:/TASSEL/tassel5/lib/xmlParserAPIs.jar:/TASSEL/tassel5/sTASSEL.jar
Memory Settings: -Xms512m -Xmx175g
Tassel Pipeline Arguments: -fork1 -DiscoverySNPCallerPluginV2 -db database/T23.db -mnMAF 0.01 -mnLCov 0.05 -ref /home/Genome.fa -deleteOldData true -endPlugin -runfork1
[main] INFO net.maizegenetics.tassel.TasselLogging - Tassel Version: 5.2.23 Date: March 31, 2016
[main] INFO net.maizegenetics.tassel.TasselLogging - Max Available Memory Reported by JVM: 159289 MB
[main] INFO net.maizegenetics.tassel.TasselLogging - Java Version: 1.8.0_45
[main] INFO net.maizegenetics.tassel.TasselLogging - OS: Linux
[main] INFO net.maizegenetics.tassel.TasselLogging - Number of Processors: 16
[main] INFO net.maizegenetics.pipeline.TasselPipeline - Tassel Pipeline Arguments: [-fork1, -DiscoverySNPCallerPluginV2, -db, database/T23.db, -mnMAF, 0.01, -mnLCov, 0.05, -ref, /home/Genome.fa, -deleteOldData, true, -endPlugin, -runfork1]
net.maizegenetics.analysis.gbs.v2.DiscoverySNPCallerPluginV2
[pool-1-thread-1] INFO net.maizegenetics.plugindef.AbstractPlugin - Starting net.maizegenetics.analysis.gbs.v2.DiscoverySNPCallerPluginV2: time: Apr 7, 2016 13:44:6
Genome FASTA character conversion: ACGTNacgtn to ACGTNacgtn
[pool-1-thread-1] INFO net.maizegenetics.analysis.gbs.v2.DiscoverySNPCallerPluginV2 - MinMAF:0.0100000
[pool-1-thread-1] INFO net.maizegenetics.analysis.gbs.v2.DiscoverySNPCallerPluginV2 - includeRare:false includeGaps:false
[pool-1-thread-1] INFO net.maizegenetics.plugindef.AbstractPlugin -
DiscoverySNPCallerPluginV2 Parameters
db: database/T23.db
mnMAF: 0.01
mnLCov: 0.05
ref: /home/Genome.fa
sC: null
eC: null
inclRare: false
inclGaps: false
callBiSNPsWGap: false
gapAlignRatio: 1.0
maxTagsCutSite: 64
deleteOldData: true
[pool-1-thread-1] INFO net.maizegenetics.analysis.gbs.v2.DiscoverySNPCallerPluginV2 - Finding SNPs in database/T23.db.
[pool-1-thread-1] INFO net.maizegenetics.analysis.gbs.v2.DiscoverySNPCallerPluginV2 - StartChr:null EndChr:null
size of all tags in tag table=928159
size of all tissues in tissue table=0
size of all tags in mappingApproach table=2
size of all taxa in taxa table=13
[pool-1-thread-1] INFO net.maizegenetics.analysis.gbs.v2.DiscoverySNPCallerPluginV2 - deleteOldData is TRUE: Clearing previous Discovery and SNPQuality data
[pool-1-thread-1] INFO net.maizegenetics.analysis.gbs.v2.DiscoverySNPCallerPluginV2 - Start processing chromosome 1A
[pool-1-thread-1] INFO net.maizegenetics.analysis.gbs.v2.DiscoverySNPCallerPluginV2 - Calling getCutPosForStrand FORWARD strands...
size of all positions in cutPosition table=596215
positionTagTaxaMap size, forwardStrand, true size:10912 totalTagsAdded:15475
[pool-1-thread-1] ERROR net.maizegenetics.plugindef.AbstractPlugin - null key in entry: null=TaxaDist{taxaWithRead=0, totalDepth=0, maxTaxa=13, []}
[pool-1-thread-1] INFO net.maizegenetics.plugindef.AbstractPlugin -
Usage:
DiscoverySNPCallerPluginV2 <options>
-db <Input GBS Database> : Input Database file if using SQLite (required)
-mnMAF <Min Minor Allele Freq> : Minimum minor allele frequency (Default: 0.01)
-mnLCov <Min Locus Coverage> : Minimum locus coverage (proportion of Taxa with a genotype) (Default: 0.1)
-ref <Reference Genome File> : Path to reference genome in fasta format. Ensures that a tag from the reference genome is always included when the tags at a locus are aligned against each other to call SNPs. The reference allele for each site is then provided in the output HapMap files, under the taxon name "REFERENCE_GENOME" (first taxon). DEFAULT: Don't use reference genome.
-sC <Start Chromosome> : Start Chromosome
-eC <End Chromosome> : End Chromosome
-inclRare <true | false> : Include the rare alleles at site (3 or 4th states) (Default: false)
-inclGaps <true | false> : Include sites where major or minor allele is a GAP (Default: false)
-callBiSNPsWGap <true | false> : Include sites where the third allele is a GAP (mutually exclusive with inclGaps) (Default: false)
-gapAlignRatio <Gap Alignment Threshold> : Gap alignment threshold ratio of indel contrasts to non indel contrasts: IC/(IC + NC). Any loci with a tag alignment value above this threshold will be excluded from the pool. (Default: 1.0)
-maxTagsCutSite <Max Number of Cut Sites> : Maximum number of tags per cut site (Default: 64)
-deleteOldData <true | false> : Delete existing SNP data from tables (Default: false)
2) Also, recently there was a post regarding the issue of SNP positions being occasionally wrong in version 22. I have run a couple of big projects using version 22 so I am concerned about the quality of data. Would you recommend re-running the pipelines again with the version 23 or is there any adjustment that I can use to correct the positions?
Your help would be truly appreciated.