* Language and vision (captioning, NLP-based search, VQA, visual dialogs)
* Video spatio-temporal understanding
* Object and category recognition/detection
* Semantic representation and recognition
* Representation and deep learning
* Weakly-supervised, semi-supervised or unsupervised learning
The candidate will actively participate in research and publishing, work with graduate students, and have opportunities for teaching. For further information please contact Leonid Sigal (lsi...@disneyresearch.com). The starting time for the position is negotiable.