Hello,
I'm currently working on two assemblies for plant species with very large genomes (~2 Gbp). Both species are within the same genus, diploid, and are expected to have 12 chromosomes based on previous studies. For both species, I have an 'extra' 13th scaffold that is an order of magnitude smaller than the other pseudomolecule scaffolds, and significantly larger than the debris scaffolds after it. It doesn't have strong hic contacts with any of the pseudomolecules outside of telomeric regions, where it has contact with every pseudomolecule's telomeric region. We originally assumed it was mostly repeat sequence, but after a busco analysis we realized it was rather rich in busco hits. The species with the better contact map has 22% of single copy, complete buscos aligning to the 13th scaffold, whereas the species with the poorer quality map has ~30% of single copy buscos.
Have you seen/heard of anything like this before? We're currently at a loss for how to deal with it, and I'm hoping you have some insights.
Below is a screenshot from the species with a stronger contact map, hopefully demonstrating what I'm talking about.