I believe this is due to the fact that ELO estimates are based on new ID vs current ID; and there is not a linear relationship between corruption of knowledge and drop in ELO.
Overly simplified example: Say ID 100 is totally healthy with ELO 5000. But a bug comes along that starts to corrupt its knowledge. From ID 100 to ID 101, bug erases understanding of how knight moves. From ID 101 to ID 102, it erases knowledge of how bishop moves. At 103, bug is fixed, and over next 10 genations, the knowledge of both bishop and knight is restored.
Obviously, ID 101 is going to be much worse than ID 100: ID 100 knows how to use knight, ID101 does not. Let's say this drops ELO from 5000 to 4900.
Also, ID 102 is going to be worse than ID 101 (ID 102 can't use bishop or knight; ID 101 only can't use knight). This might also be ELO loss of -100, dropping from 4900 to 4800.
So at ID 102, we've dropped -200 ELO. However, were we to compete ID 100 directly with ID 102 (a healthy network vs one that can use neither bishop nor knight), we might see a much larger ELO drop of -800. Compared to ID 100, the actual performance of 102 might measure at 4200, not the published 4800.
Moving forward, the bug is fixed, and as both knight and bishop information is relearned, ID 103 gains ELO of +50 over 102 (to 4850); and 104 gets +50 over 103 (4900), and so forth. By ID 112, the ELO has climbed up to 5800.
And yet, ID 100 had "100 IDs-worth" of knowledge of bishop and knight. ID 112 only has "10 IDs-worth" of bishop and knight. So even though 112 has published ELO of 5800, it still is much worse than ID 100 (ELO 5000)