|
|
About
I am a researcher with the IUPR
group at the German Research
Center for Artificial Intelligence (DFKI).
My fields of interest are
pattern recognition for visual information and multimedia
retrieval. In the past, my work was focused on document image
processing, text recognition, and camera-based document capture.
Now that I work on my PhD, my interest has shifted towards video
annotation and learning methods to improve recognition in video.
I hold a diploma degree in
computer science from the Technical
University Kaiserslautern. Currently, I am working on my PhD.
Contact Information
Adrian Ulges DFKI
GmbH Trippstaedter Str. 122 D-67663 Kaiserslautern Germany
phone: ++49 (631)
20575-419 fax: ++49 (631) 20575-402 e-mail: adrian.ulges at
dfki.de
Curriculum Vitae
resume
in pdf format
Research
|
Concept
Learning from Web Video
Web-based
portals like YouTube offer an information source from which
video tagging engines can automatically learn without manual
intervention. We have developed a system that autonomously
learns to tag videos with semantic concepts by analyzing
YouTube videos.
[project
website] [demo]
[press]
[@heise]
|

|
|
Detecting Pornographic
and Illegal Images and Videos
As the load of images and
videos on the internet and in users' personal collections
grows, the blocking of pornographic (or even illegal) image
content becomes a serious issue. In the European research
project FIVES, we support police forces with searching
confiscated material with pattern recognition methods.
[project
website]
|

|
|
Learning
from Weakly Labeled Content
Visual
Learning from the Web is difficult as tag information at
flickr or YouTube is coarse and unreliable. We have developed
a statistical framework that achives robustness with respect
to weak labels by separating truly relevant content from
non-relevant one. This is shown in the picture for the concept
“basketball”, where relevant content (left) is separated
from non-relevant one (right).
[publication]
|

|
|
Motion
Segmentation and Object Recognition
Another
interest of mine is the interaction between motion
segmentation and object recogntion. Motion segmentation can
give an segmentation of moving objects from their background.
We have developed a simple recognition approach that uses
motion segmentation to discard background clutter and to
improve the generalization capabilities of object recognition.
[contact]
[publications to come...]
|

|
|
OCR
for Camera-based Document Capture
Imagine
you capture a document by pointing your camera at it (or a
camera does this automatically for you). While this simplifies
capture, information extraction is more difficult due to
issues like low resolution and distorsion. For this purpose,
we have developed an optical character recognition (OCR)
system for noisy, warped, and low-resolution text. The system
was integrated with a user interface such that users can
“google” in their camera-captured document snapshots.
[contact]
|

|
|
Document
Image Dewarping
Document
images delivered by digital cameras come with geometric
distorsion that makes OCR difficult. This problem is difficult
for curled book surfaces. To make information extraction from
such images possible, we have developed novel approaches of
dewarping text based on a depth model of the paper surface.
[demo]
[publication]
[publication]
|

|
Publications
list
of publications
Teaching
2009:
lab course (Praktikum): "ACM Multimedia Grand Challenge",
Master's level
2009:
diploma thesis “Topic Models for Video” (Jörn Wanke)
2009:
diploma thesis “Statistical Classification of Image Content
for Information Filtering”
(Christian
Jansohn)
2008:
diploma thesis "Shape
Matching for Automatic Text Reading in Natural Scenes"
(Marius Renn)
2008:
internship “Style
Modelling for Tagging Personal Photo Collections”
(Manni Duan)
2008:
bachelor's thesis “Statistical Detection of Non-scene Text
for Content-based Video Retrieval”
(Markus
Koch)
2008:
lab course (Praktikum): “Video Segmentation and Recognition”,
Master's level
2007:
lecture “Pattern Recognition and Statistical Learning”
(tutorials+lectures)
2007:
lecture “Image and Video Processing” (tutorials+lectures)
2007:
lab course (Praktikum): “Screen OCR using Hidden Markov
Models”, Master's level
2007:
internship: “camera-based HCI” (Andres Koetsier
Jasper Laagland)
2007:
master's thesis “Dewarping Documents using a Stereo Vision
System” (Soner Ozgun Pelvan)
2006:
lecture “Pattern Recognition and Statistical Learning”
(tutorials)
2006:
lecture “Image and Video Processing” (tutorials)
Peer Reviewing
IEEE
TPAMI, CVPR07, MVA07, VISAPP 08, ELCVIA CVIA, ECCV08, CVPR08,
DAS08, CVPR09,MVA09
Conference Committee Member
|