LEGAL2026 and CALD-pseudo 2026 at COLING2026 - Call for papers

9 views

Skip to first unread message

Xuan-Son Vu

unread,

Jan 21, 2026, 6:37:52 PM (17 hours ago) Jan 21

to Machine Learning News

Dear all,

*Apologies for cross-postings*

You are invited to submit original and unpublished research papers for the 𝐋𝐄𝐆𝐀𝐋2026 & 𝐂𝐀𝐋𝐃-𝐩𝐬𝐞𝐮𝐝𝐨 2026 𝐣𝐨𝐢𝐧𝐭 𝐰𝐨𝐫𝐤𝐬𝐡𝐨𝐩 at LREC2026.

Our main focus is on the legal aspects of language data as well as the theoretical, methodological, and technical aspects of pseudonymization, anonymization, and de-identification.

The workshop is co-located with LREC 2026 and will take place in Palma de Mallorca on the 12th of May, 2026.

Head over to our page at https://liti.info/CALD-Pseudo-2026 for more details on the topics of interest and submission guidelines.

Authors are invited to submit original and unpublished research papers in the following categories:
- Long papers (up to 8 pages) for substantial contributions
- Short papers (up to 4 pages) for: Small, focused contributions or ongoing or preliminary work

Extended abstracts for non-technical submissions only, such as conceptual, theoretical, legal, ethical, policy-oriented, or position papers. Extended abstract submissions are expected to be developed into regular papers by the camera-ready submission deadline.

Topics of interest:
- Detection and classification of personal information (PI): Automatic identification of PI in text, speech, and multimodal data; context-dependent and indirect indicators of identity.
- Replacement and transformation of PI: Context-sensitive pseudonymization and anonymization methods; substitution, masking, obfuscation; maintaining coherence across discourse and modalities.
- Utility and bias after de-identification: Effects of de-identification on downstream task performance, linguistic research validity, readability, and bias amplification or reduction.
- Approaches to evaluation and adversarial testing: Metrics and frameworks for assessing de-identification quality; adversarial re-identification attempts; robustness and failure-mode analysis.
- Dataset creation for de-identification research: Methodological, ethical, and annotation-related considerations in building corpora for training or evaluating de-identification systems.
- Low-resource scenarios: Techniques for de-identification in settings with limited data, scarce annotations, or underrepresented languages; transfer and multilingual approaches.
- Speech-specific challenges: Removing speaker identity cues in audio; voice anonymization; cross-modal leakage between text, transcripts, and acoustic features.
- Cross-disciplinary applications and challenges: Integrating de-identification techniques into real-world workflows in areas such as linguistics, social sciences, digital humanities, healthcare, and other private- or public-sector data environments.

Important dates:

- February 20, 2026: Deadline for submission
- March 11, 2026 (tentative): Notification of acceptance
- March 30, 2026: Submission of final version of accepted papers (strict)
- May 12, 2026: Workshop day

Submission link: https://softconf.com/lrec2026/LEGAL2026/

=========================================
ORGANIZING COMMITTEE of CALD-pseudo 2026:

- Maria Irena Szawerna, University of Gothenburg, Sweden
- Simon Dobnik, University of Gothenburg, Sweden
- Therese Lindström Tiedemann, University of Helsinki, Finland
- Pierre Lison, Norwegian Computing Center & University of Oslo, Norway
- Ildikó Pilán, Norwegian Computing Center, Norway
- Ricardo Muñoz Sánchez, University of Gothenburg, Sweden
- Lisa Södergård, University of Helsinki, Finland
- Elena Volodina, University of Gothenburg, Sweden
- Xuan-Son (Sonny) Vu, Lund University, Sweden
=========================================

---

Sonny Vu (PhD.)

Asst. Prof. at RSS Lab, Lund University (LTH), CTO at WASP Media &

Language and AI for Vietnam, Founder and CEO at DeepTensor AB

Reply all

Reply to author

Forward

0 new messages