As we start gathering resources for `Track 2: Better Training Data` we were wondering if there are any regulations regarding the dataset that we'll submit.
For example, if there are limitations for using datasets licensed under `CC by` instead of a `CC0` license, etc. to generate our 10K training examples).
We also wanted to ask under what license would the generated training examples will be made available later if at all.
Would we also be required to submit the code we used to generate such questions if any?