Dear Paricipants,
We notice there are some ambiguities about track I evaluation. Here, we make some clarification.
1. There are two models to test the jailbreak prompts you submit. One is Gemma-2B-it, the other is not released.
2. There is a third model for evaluation/judging, which is not released.
3. The maximum number of injected token is 100, which is measured by the tokenizer of Gemma-2B-it. If a prompt exceeds this limit, it will receive a zero score, and this will not affect the evaluation for other prompts.
4. We have cleared-up the leaderboard and restored the submission chances for all teams.
Please feel free to let us know any further questions. Thanks again for your participation.
Best,
Organizers