Open Internship position -- Multi-source Online Topic Modeling (Sophia Antipolis, France)

26 views
Skip to first unread message

Serena Villata

unread,
Feb 9, 2026, 9:55:11 AM (11 days ago) Feb 9
to aixia
Master 2 -- Internship Proposal: Multi-source Online Topic Modeling

Duration. 6 months

Profile. Master 2 or equivalent in Statistics, Data Science, or Artificial Intelligence.

Contacts. Federica Granese (federica...@inria.fr) and Serena Villata (serena....@inria.fr) -- Inria Team MARIANNE; Charles Bouveyron (charles....@inria.fr) -- Inria Team MAASAI.

Location. Centre Inria d’Université Côte d’Azur, 2004, route des Lucioles BP 93 06902 Sophia Antipolis Cedex.

Gratification. Standard gratification (~600 euros/month)

Description. Online topic models are unsupervised algorithms to identify latent topics in textual data streams that continuously evolve over time. Although these methods naturally align with real-world scenarios, they have received considerably less attention from the community compared to their offline counterparts, due to specific additional challenges. In [1, 2] we propose SB-SETM, an innovative model extending the Embedded Topic Model (ETM) [2] to process data streams by merging models formed on successive partial document batches. SB-SETM (i) leverages a truncated stick-breaking construction for the topic–per-document distribution, enabling the model to automatically infer from the data the appropriate number of active topics at each timestep; and (ii) introduces a merging strategy for topic embeddings based on a continuous formulation of optimal transport adapted to the high dimensionality of the latent topic space. 
The goal of the internship is to extend SB-SETM to a multi-source online setting, where, at each time step, document batches originate from different information sources. Experiments will be conducted on disinformation datasets to analyze how sensitive topics are framed and evolve across heterogeneous information sources over time.

Key words. Topic Modeling, Natural Language Processing, Disinformation Detection.

[1] A. B. Dieng, F. J. Ruiz, and D. M. Blei. Topic modeling in embedding spaces. Transactions of the Association for Computational Linguistics, 8:439–453, 2020.
[2] F. Granese, B. Navet, S. Villata, and C. Bouveyron. Merging embedded topics with optimal transport for online topic modeling on data streams. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pages 290–307. Springer, 2025.
[3] F. Granese, S. Villata, and C. Bouveyron. Stick-breaking embedded topic model with continuous optimal transport for online analysis of document streams. International Conference on Artificial Intelligence and Statistics, 2026.

Serena Villata

unread,
Feb 9, 2026, 9:57:16 AM (11 days ago) Feb 9
to ai...@aixia.it

Serena Villata

unread,
Feb 9, 2026, 9:59:35 AM (11 days ago) Feb 9
to aixia

Shekufeh Shafee

unread,
Feb 11, 2026, 8:19:07 AM (9 days ago) Feb 11
to Serena Villata, ai...@aixia.it
Hello 
I am very interested in this open intern position and I do qualify, I believe. ..
Please tell me How should I apply and/or share my resume to be considered for this role?!

Thank you so much in advance . 
Looking forward to hearing from you soon...
Best regards , 
--
--
Hai ricevuto questo messaggio in quanto sei iscritto al gruppo Google
"AIxIA mailing list".
Per mandare un messaggio a questo gruppo, invia una email a
ai...@aixia.it
Solo gli iscritti possono inviare messaggi.
Per annullare l'iscrizione a questo gruppo, invia un'email a aixia+un...@aixia.it
Per iscriversi visita
http://groups.google.com/a/aixia.it/group/aixia
Puoi iscriverti con il tuo account aixia.it o gmail.com
 
You received this message because you have subscribed the Google group "AIxIA mailing list".
To send a message to this group, send an email to ai...@aixia.it
Only the subscribers can send to this group.
To cancel the subscription to this group, send an email to aixia+un...@aixia.it
To subscribe, visit
http://groups.google.com/a/aixia.it/group/aixia
You can subscribe with your aixia.it or gmail,com account
To unsubscribe from this group and stop receiving emails from it, send an email to aixia+un...@aixia.it.


--

Sh.Shafeie

"Everyone knows everything and everyone is not born yet." Bozorgmehr - Wikipedia

Reply all
Reply to author
Forward
0 new messages