Coding biallelic SNP data

244 views
Skip to first unread message

Saad Arif

unread,
Oct 22, 2013, 10:37:26 AM10/22/13
to structure...@googlegroups.com
Hello

I'm using STRUCTURE for the first time and had a question about coding my SNP data. I have biallelic SNP at each marker for all individuals. I understand the alphabetic SNP data need to be coded numerically (A=1, T=2, C=3, G=4). However, do i need to input each individual in two rows with one allele on the first line and the second on second line? I do not have any phase information. Any input would be appreciated. Thanks in advance.

best,
Saad

Saad Arif

unread,
Oct 22, 2013, 10:39:17 AM10/22/13
to structure...@googlegroups.com

Also, individuals are diploid and thus i have two nucleotide information at each marker

Vikram Chhatre

unread,
Oct 22, 2013, 10:39:15 AM10/22/13
to structure-software
Hello Saad,

You can put both alleles on the same line (ONEROWPERIND=1) or on two consecutive lines (ONEROWPERIND=0).  In case of latter, the individual ID should be identical in both lines.

V



--
You received this message because you are subscribed to the Google Groups "structure-software" group.
To unsubscribe from this group and stop receiving emails from it, send an email to structure-softw...@googlegroups.com.
To post to this group, send email to structure...@googlegroups.com.
Visit this group at http://groups.google.com/group/structure-software.
For more options, visit https://groups.google.com/groups/opt_out.

Saad Arif

unread,
Oct 22, 2013, 10:43:58 AM10/22/13
to structure...@googlegroups.com
Dear Vikram

Thanks for your response. In the case alleles on the same line per individual, if my marker genotype is AA do i code this as 11 and if it is AT then 12?

Thanks in advance.

Saad


On Tuesday, October 22, 2013 4:39:15 PM UTC+2, Vikram Chhatre wrote:
Hello Saad,

You can put both alleles on the same line (ONEROWPERIND=1) or on two consecutive lines (ONEROWPERIND=0).  In case of latter, the individual ID should be identical in both lines.

V
On Tue, Oct 22, 2013 at 9:37 AM, Saad Arif <arif...@gmail.com> wrote:
Hello

I'm using STRUCTURE for the first time and had a question about coding my SNP data. I have biallelic SNP at each marker for all individuals. I understand the alphabetic SNP data need to be coded numerically (A=1, T=2, C=3, G=4). However, do i need to input each individual in two rows with one allele on the first line and the second on second line? I do not have any phase information. Any input would be appreciated. Thanks in advance.

best,
Saad

--
You received this message because you are subscribed to the Google Groups "structure-software" group.
To unsubscribe from this group and stop receiving emails from it, send an email to structure-software+unsub...@googlegroups.com.

Vikram Chhatre

unread,
Oct 22, 2013, 11:14:09 AM10/22/13
to structure-software
Treat each allele as a distinct entry i.e. write 1 2 instead of 12.  A space or tab is fine, but be consistent.

V


To unsubscribe from this group and stop receiving emails from it, send an email to structure-softw...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages