I don’t think that qtl2 provides this in a manner that’s exposed to the user. For association mapping, qtl2 imputes the founder SNPs onto the DO diplotypes and then fits the mapping model at each SNP. But the full sequence of each DO genome is never computed. Also, there is considerable uncertainty, on the order of kilobases, about where crossovers occur. So if you are trying to impute full sequences, you’ll have to accept some heuristic compromise in crossover regions. You might be able to look in the scan1snps. R code and see how the SNP probs are imputed. Look for this code:
# snpinfo -> add index
snpinfo <- index_snps(map, snpinfo)
# genoprob -> snpprob
snp_pr <- genoprob_to_snpprob(genoprobs, snpinfo)
You may have to access these functions using qtl2:::index_snps(). But then you’ll have SNP positions and you could insert them into the reference genome. It’s not perfect since it excludes indels and SVs, but it might work. You’d probably be using BioStrings from the Bioconductor suite as well.
I don’t know of a package that performs this task, but I’ve never needed to do it before, so I haven’t looked. I think that you’ll need to use the founder assemblies, which may not contain accurate structural variants, to do what you want to do. And you’ll need enough disk space . The Ensembl GRCm39 genome is ~ 770MB zipped. So you’d multiply that by the number of DO mice that you have.
Also, I’d strongly suggest working with the current reference genome, which is GRCm39. mm10 is over a decade old.
Dan
--
You received this message because you are subscribed to the Google Groups "R/qtl2 discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
rqtl2-disc+...@googlegroups.com.
To view this discussion visit
https://groups.google.com/d/msgid/rqtl2-disc/637b2198-c029-4eb7-a531-9edbb3150f07n%40googlegroups.com.