Follow-up to the follow-up: this was NOT a bug, after all, just a
conceptually very tricky thing to spot.
The issue, is that you've not recapitated, and so there are (rare)
places on the genome where the whole genome has not coalesced. Since
you're looking at genotypes, mostly this just shows up as having a bit
less diversity than you would otherwise (since any mutations
segregating in the initial population -- i.e., that would have
occurred on the missing ancestral history -- are missing). However,
this node at this position was in the unlikely situation of being all
by itself - it inherited from a node in the first generation that
no-one else did. When you simplified, that node's ancestor got
removed, and so it was a node with no parent at that position. And,
that happens to be how tskit represents missing data.
So: the solution is to recapitate first!
-- peter