Topics:
- Synthesis:
- Techniques for partially editing content/background/emotion/prosody/object/etc. of audio/speech/music/singing or multimodal audio-visual media.
- Methods to ensure acoustic and perceptual consistency after editing
- Datasets, benchmarks, toolkit for partial audio/speech editing
- Unified models for zero-shot TTS (continuation) and speech editing (infilling)
- Partially audio/speech editing for more complicated scenarios, like long-form and/or multi-speaker conversations, noisy background, multilingual editing, etc.
- Fairness, biases, harms, risks and socio-ethical failures of partial editing.
- Defense:
- Detection, localization, and diarization of partially edited audio
- Proactive protecting under partial edits, like watermarking
- Adaptation and generalization methods for identifying edits
- Human vs. machine performance in detecting partially edited audio
- Explainability, interpretability and transparency techniques for defense against partial edits in speech
- Ethics of data collection, annotation, and use of data for speech editing.
- Fairness, biases for defending against audio/speech/music/singing editing.
- Joint defense against partial editing with other downstream tasks, like ASV, ASR, etc.
- Other novel topics related to audio/speech/music/singing editing
Important Date (AoE Time) - Schedules are same as the regular session at IEEE SLT 2026:
- Paper submissions open: March 4, 2026 (Welcome your submissions!)
- Paper submissions due: June 17, 2026
- Paper revision due: June 24, 2026
- Rebuttal Period: July 29 – August 4, 2026
- Acceptance Notification: September 1, 2026
- Camera-ready Deadline: September 16, 2026
- Workshop Date: someday within December 13-16, 2026
We look forward to your contributions and to seeing you at the last IEEE SLT 2026.
For any questions, please contact the organizers using: parti...@googlegroups.com
Best,
Lin Zhang@JHU, David Harwath@UT Austin, Xin Wang@NII, You Zhang@UR, Bowen Shi@Meta, Nicholas Evans@EURECOM, Sanjeev Khudanpur@JHU