Thank you Pavel for the reply.
1. We have special interest in all the certificates that were issued (and logged), although some of them were never used publicly. If you can provide access to another dataset, it would be great (mainly if the other dataset is not contained in the logged certs).
2. Mainly the certs themselves, although we also have plans analyzing the chains.
3. No. We mainly need to make sure that I have all of them. If we get another indications, the index does matter.
4. Partial dataset will help, mainly if we know that it covers some or parts of the logs (even from one or more sources).
5. We start with Chrome, but plan to continue to the others as well (the parts that do not overlap).
6. We want to preprocess them and then explore the use/misuse/abuse of certificates. Not a particular query that I can mention.
Thanks again!