LEGAL2026 and CALD-pseudo 2026 at COLING2026 - Call for papers

9 views
Skip to first unread message

Xuan-Son Vu

unread,
Jan 21, 2026, 6:37:52 PM (17 hours ago) Jan 21
to Machine Learning News
Dear all,

*Apologies for cross-postings*

You are invited to submit original and unpublished research papers for the 𝐋𝐄𝐆𝐀𝐋2026 & 𝐂𝐀𝐋𝐃-𝐩𝐬𝐞𝐮𝐝𝐨 2026 𝐣𝐨𝐢𝐧𝐭 𝐰𝐨𝐫𝐤𝐬𝐡𝐨𝐩 at LREC2026. 

⚖️ Our main focus is on the legal aspects of language data as well as the theoretical, methodological, and technical aspects of pseudonymization, anonymization, and de-identification.
📅 The workshop is co-located with LREC 2026 and will take place in Palma de Mallorca on the 12th of May, 2026.
➡️ Head over to our page at https://liti.info/CALD-Pseudo-2026 for more details on the topics of interest and submission guidelines.

Authors are invited to submit original and unpublished research papers in the following categories:
- Long papers (up to 8 pages) for substantial contributions
- Short papers (up to 4 pages) for: Small, focused contributions or ongoing or preliminary work

Extended abstracts for non-technical submissions only, such as conceptual, theoretical, legal, ethical, policy-oriented, or position papers. Extended abstract submissions are expected to be developed into regular papers by the camera-ready submission deadline.

Topics of interest:
- Detection and classification of personal information (PI): Automatic identification of PI in text, speech, and multimodal data; context-dependent and indirect indicators of identity.
- Replacement and transformation of PI: Context-sensitive pseudonymization and anonymization methods; substitution, masking, obfuscation; maintaining coherence across discourse and modalities.
- Utility and bias after de-identification: Effects of de-identification on downstream task performance, linguistic research validity, readability, and bias amplification or reduction.
- Approaches to evaluation and adversarial testing: Metrics and frameworks for assessing de-identification quality; adversarial re-identification attempts; robustness and failure-mode analysis.
- Dataset creation for de-identification research: Methodological, ethical, and annotation-related considerations in building corpora for training or evaluating de-identification systems.
- Low-resource scenarios: Techniques for de-identification in settings with limited data, scarce annotations, or underrepresented languages; transfer and multilingual approaches.
- Speech-specific challenges: Removing speaker identity cues in audio; voice anonymization; cross-modal leakage between text, transcripts, and acoustic features.
- Cross-disciplinary applications and challenges: Integrating de-identification techniques into real-world workflows in areas such as linguistics, social sciences, digital humanities, healthcare, and other private- or public-sector data environments.


Important dates:

- February 20, 2026: Deadline for submission
- March 11, 2026 (tentative): Notification of acceptance
- March 30, 2026: Submission of final version of accepted papers (strict)
- May 12, 2026: Workshop day

Submission link: https://softconf.com/lrec2026/LEGAL2026/ 

=========================================
ORGANIZING COMMITTEE of CALD-pseudo 2026:

- Maria Irena Szawerna, University of Gothenburg, Sweden
- Simon Dobnik, University of Gothenburg, Sweden
- Therese Lindström Tiedemann, University of Helsinki, Finland
- Pierre Lison, Norwegian Computing Center & University of Oslo, Norway
- Ildikó Pilán, Norwegian Computing Center, Norway
- Ricardo Muñoz Sánchez, University of Gothenburg, Sweden
- Lisa Södergård, University of Helsinki, Finland
- Elena Volodina, University of Gothenburg, Sweden
- Xuan-Son (Sonny) Vu, Lund University, Sweden
=========================================

---

Sonny Vu (PhD.)

Asst. Prof. at RSS Lab, Lund University (LTH), CTO at WASP Media &

Language and AI for Vietnam, Founder and CEO at DeepTensor AB

Reply all
Reply to author
Forward
0 new messages