Hi Kris,
Thanks! I was wondering if that might be the issue. Now, GQ said that adapters are not removed nor are duplicates removed, so that's a bit odd. Though I might follow up and ask if that's in cases where data is not demultiplexed on site. Especially since I noted the reads don't seem to start with the same sequence. The FASTQ I showed was from an F2, though, just in case that makes a difference.
I am trying the run now with the
--disable_rad_check command. It certainly takes longer...
I haven't run the fastqc on it, no. I'll make a note to do that.
I'm also checking back with the contact I mentioned at GQ to see if the low quality issue for low diversity sites that he mentioned was more general or largely restricted to cut sites. The latter, from what I've read, might be fixable.
Fingers crossed this all works out and my thesis doesn't end up being "Here Is How I Failed At RADseq." I don't even want to imagine defending that...
Thank you!