---------- Forwarded message ----------
From:
Soraya Panahi <sor...@gmail.com>
Date: Mon, Jan 14, 2013 at 7:56 AM
Subject: Wednesday seminar at Sharif
To:
Cc: Mehrdad Mirshams Shahshahani <
mshahs...@gmail.com>
Hello everybody,
The speaker for this week is Neda Sabagh Pour. She will give a lecture about "
End-to-End Scene Text Recognition". You can find the abstract of this lecture in the following.
The seminar will be held at 3 p.m. at the Machine Intelligence and Vision Lab. (
MIVLab is marked in the attached map. The lab is located in the underground floor, next to the Sharif newspaper office.)
Bests,
Soraya Panahi
abstract:
In this talk we will focus on the
problem of word detection and recognition in natural images. The problem
is significantly more challenging than reading text in scanned
documents, and has only recently gained attention from the computer
vision community. Sub-components of the problem, such as text detection
and cropped image word recognition, have
been studied. However, what is unclear is how these
recent approaches contribute to solving the end-to-end problem of word
recognition.
We fill this gap by constructing and evaluating two systems. The first, representing the de
facto state-of-the-art, is a two stage pipeline consisting of text
detection followed by a leading OCR engine. The second is a system
rooted
in generic object recognition. We show that the latter
approach achieves superior performance. While scene text recognition
has generally been treated with highly domain-specific methods, our
results demonstrate the suitability of applying generic
computer vision methods. Adopting this approach opens the door for real world scene text recognition to benefit from the
rapid advances that have been taking place in object recognition.