So, how critical is it that the whitening is done with the in-domain data?
DER(%) | Supervised Calibration | Oracle # Speakers |
IND whiten | 11.04 | 8.62 |
OOD whiten | 14.61 | 11.56 |
No whiten | 12.57 | 10.33 |
Thanks. In that case you still do length normalization right?
Also, what are the differences in the ivectors from the `sid` and `diarization` dirs, compared to the ones used for neural nets?
Does it make sense to perform the whitening with the test data?