What version of Stacks are you running? Make sure you are using the latest.
I just double-checked the --write_single_snp flag and it is working
correctly. When you enable that flag, the software prunes the other
sites out of memory, so it should not be possible to export the other
sites in any file output as they are no longer in the data structures to
be output.
julian
GM wrote:
>
> I also found that despite using the --write_single_snps option, I still
> get multiple snps from the same locus in the structure output
>
> eg: 24_10 24_20 24_21 24_40 24_61 24_65 180_24 180_29 180_49 180_77 180_87
>
> my command line is this:-e hindIII --write_single_snp -t 30
> --merge_sites -r 0.7 -p 7 -m 5 -a 5 -f bonferroni_gen --p_value_cutoff
> 0.1 --lnl_lim -10 --fasta_strict --structure --phylip --verbose
>
> Not sure if I'm doing something wrong.
>
> Thanks!
>
>
> On Tuesday, September 22, 2015 at 3:18:48 PM UTC-4, GM wrote:
>
> Hi,
>
> I'd like to use the polymorphism data from my populations to perform
> principal component analysis as per the Eigensoft smartPCA module.
> The software requires files of a certain format, and so, I was
> trying to find out the right kind of output files of the Stacks
> populations module to start with. smartPCA requires the following
> files as input
> (
http://genepath.med.harvard.edu/~reich/InputFileFormats.htm
>
> 1) genotype file (.geno) in EIGENSTRAT format
>
> The genotype file contains 1 line per SNP. Each line contains 1
> character per individual:
>
> 0 means zero copies of reference allele.
>
> 1 means one copy of reference allele.
>
> 2 means two copies of reference allele.
>
> 9 means missing data.
>
>
> I'm thinking of using the STRUCTURE format file produced by
> Populations to make this file. Is there an easy way to do this?
> Also,*/just wanted to confirm that 0/1/2/3/4 stand for
> noChange/a/c/g/t/* in the STRUCTURE file. I also found that despite