Hello folks,
We are forwarding a message from the organizers of the Multimodal Intelligence: Next Token Prediction & Beyond workshop @ ICLR 2026. We believe that it may be of interest for the general discrete diffusion research community.
====================
The workshop "Multimodal Intelligence: Next Token Prediction & Beyond" will take place at ICLR 2026, in April 2026 in Rio de Janeiro, Brazil. The workshop focuses on advancing multimodal foundation models beyond classic next-token prediction, toward unified modeling across images, language, audio, video, and embodied environments. Submissions are welcome ranging from full papers to early ideas and work-in-progress. Topics include autoregressive multimodal models, predictive encoders, discrete diffusion approaches, and hybrid paradigms, with emphasis on representation quality, scaling behavior, data efficiency, and cross-paradigm insights.
Submissions are open on OpenReview: https://openreview.net/group?id=ICLR.cc/2026/Workshop/MM_Intelligence
Website & CFP: https://mmintelligence.github.io/
You can also cc me (m.m.der...@uva.nl) if people have questions or want to help by doing a review.
The following is my Linkedin account: https://www.linkedin.com/in/mmderakhshani/.
====================
Yours truly,
Subham, Justin, Zhihan
Website, Twitter, Discord, YouTube