Hi Dr. Broman and the R/QTL community,
I have a large concern regarding segregation distortion. I have an F2 population derived from a cross between parents of the same species. I genotyped the population using GBS/ddRADseq and both STACKS and TASSEL. The parents are inbred allotetraploid, but previous studies have demonstrated this species "behaves as a diploid" with disomic inheritance and a population has been genetically mapped using STACKS/JoinMap and ~1,000 markers.
When I use the bonferonni correction to assess for segregation distortion (see code below) I find that my markers are 88% distorted in the tassel genotyping data (4,004/4,512 markers; pval=1.108156e-05) and 97% distorted in the stacks genotyping data (3,014/3,079 markers, p.val=1.623904e-05).
Has anyone encountered this before? All I can find are discussions on removing the distorted loci (which I can't do as it's most of my data), explanations regarding increased distortion with interspecific crosses (this is the same species), different cytoplasmic environments (possibly?), preferential meiotic selection (possibly?), and infertile genotypes (I didn't notice any decrease in fertility).
Any suggestions on how to proceed would be greatly appreciated!!!
### Bonferroni correction ###
gt <- geno.table(mydata)
bon.pval.cutoff <- 0.05/totmar(mydata)
gt[gt$P.value < bon.pval.cutoff, ]
todrop <- rownames(gt[gt$P.value < bon.pval.cutoff, ])
nrow(as.data.frame(todrop))
### study genotype frequencies in individuals ###
gfreq <- apply(g, 1, function(a) table(factor(a, levels=1:3)))
gfreq <- t(t(gfreq) / colSums(gfreq))
par(mfrow=c(1,3), las=1)
for(i in 1:3)
plot(gfreq[i,], ylab="Genotype frequency", main=c("AA", "AB", "BB")[i],
ylim=c(0,1))
#export as "Genotype frequencies by individual