[retrieval-augmented debating] training set released

59 views

Skip to first unread message

johanne...@uni-weimar.de

unread,

Mar 19, 2025, 6:18:56 PMMar 19

to touch...@googlegroups.com

Hi all,

Today we once more updated the web page of the Retrieval-Augmented Debating task:

https://touche.webis.de/clef25/touche25-web/retrieval-augmented-debating.html

As part of this, we added 100 (simulated) example debates, one for each of the 100 claims we released earlier.

Each debate consists of 5 user messages and 5 system messages (“responses”). We manually judged each response for its debate quality, for which we adopted Grice’s maxims:

Quantity: be informative. Does the response contain at least one (attack or defense) argument, and at most one of each type of defense and attack?
Quality: be truthful. Can the response be deduced from the retrieved arguments?
Relation: be relevant. Is the response coherent with the conversation and does it express a contrary stance to the user?
Manner: be clear. Is the response clear and precise?

These are the quality criteria we settled on for our evaluation, using a binary judgment for each. A response can thus score between 0 (no maxim fulfilled) and 4 points (all maxims fulfilled).

When you develop a debate system (sub-task 1), keep these criteria in mind.

When you develop an evaluation system (sub-task 2), you can use our labels as training set. Note that your evaluation system can address a single or up to all four criteria/maxims. We will score the evaluation systems for each criterion independently.

We are nearly done with setting up the submission system. In case you can’t wait, you can already take a look at basic systems (without generation) that we prepared in Python [1] and JavaScript [2] and that can serve as a starting point for developing your system. Note that they might change slightly in the next few days.

That’s it for now. We are looking forward to seeing your approaches, and please ask questions if you have some,

Johannes

[1] https://github.com/touche-webis-de/touche-code/tree/main/clef25/retrieval-augmented-debating/debating-systems/basic-elastic-py

[2] https://github.com/touche-webis-de/touche-code/tree/main/clef25/retrieval-augmented-debating/debating-systems/basic-elastic-js

cagri coltekin

unread,

Mar 21, 2025, 9:55:04 AMMar 21

to touch...@googlegroups.com, ideology-and-power-in-...@googlegroups.com

Dear all,

An improved version this year's Ideology and Power Identification
shared task training set is released. The link to the data and a
simple baseline is added to the shared task website at
<https://touche.webis.de/clef25/touche25-web/ideology-and-power-identification-in-parliamentary-debates.html>.

As well as orientation (left - right) and power (opposition -
government), we also added a populism identification sub-task to
this year's task. You are welcome to participate in any of the
sub-tasks for all or a subset of the 29 parliaments.

We will announce here when the submission system is ready.

We are looking forward to seeing your approaches. Please do not
hesitate to ask if you have any questions.

Best,
Cagri

Reply all

Reply to author

Forward

0 new messages