Hi Atray,
Thanks for the clarifications, they were very helpful already.
cbc_gbc mapping: I just realized that the described problem (maximal 1 distinct guide per cell) is only present in the dc_0hr, k562_tf_7, k562_tf_13 and k562_ccycle files, i.e. the dict files without the _lenient/_strict suffixes; In the files with suffix I see up to 10 guides per cell in the inverse mapping. Perhaps there is a difference in how they are generated/saved? I am very much looking forward to the related notebook.
batches: Thanks, that clears thinks up for me. (On a side note, not requiring an answer: that rate is ~0.5% for k562_tf_13. Can the within batch collision rate be expected to be the same?)
all the best,
Jan