Adrian Ulges


About

I am a researcher with the IUPR group at the German Research Center for Artificial Intelligence (DFKI).

My fields of interest are pattern recognition for visual information and multimedia retrieval. In the past, my work was focused on document image processing, text recognition, and camera-based document capture. Now that I work on my PhD, my interest has shifted towards video annotation and learning methods to improve recognition in video.

I hold a diploma degree in computer science from the Technical University Kaiserslautern. Currently, I am working on my PhD.

Contact Information

Adrian Ulges
DFKI GmbH
Trippstaedter Str. 122
D-67663 Kaiserslautern
Germany

phone: ++49 (631) 20575-419
fax: ++49 (631) 20575-402
e-mail: adrian.ulges at dfki.de

Curriculum Vitae

resume in pdf format

Research

Concept Learning from Web Video

Web-based portals like YouTube offer an information source from which video tagging engines can automatically learn without manual intervention. We have developed a system that autonomously learns to tag videos with semantic concepts by analyzing YouTube videos.

[project website] [demo] [press] [@heise]



Detecting Pornographic and Illegal Images and Videos

As the load of images and videos on the internet and in users' personal collections grows, the blocking of pornographic (or even illegal) image content becomes a serious issue. In the European research project FIVES, we support police forces with searching confiscated material with pattern recognition methods.

[project website]



Learning from Weakly Labeled Content

Visual Learning from the Web is difficult as tag information at flickr or YouTube is coarse and unreliable. We have developed a statistical framework that achives robustness with respect to weak labels by separating truly relevant content from non-relevant one. This is shown in the picture for the concept “basketball”, where relevant content (left) is separated from non-relevant one (right).

[publication]


Motion Segmentation and Object Recognition

Another interest of mine is the interaction between motion segmentation and object recogntion. Motion segmentation can give an segmentation of moving objects from their background. We have developed a simple recognition approach that uses motion segmentation to discard background clutter and to improve the generalization capabilities of object recognition.

[contact] [publications to come...]


OCR for Camera-based Document Capture

Imagine you capture a document by pointing your camera at it (or a camera does this automatically for you). While this simplifies capture, information extraction is more difficult due to issues like low resolution and distorsion. For this purpose, we have developed an optical character recognition (OCR) system for noisy, warped, and low-resolution text. The system was integrated with a user interface such that users can “google” in their camera-captured document snapshots.

[contact]



Document Image Dewarping

Document images delivered by digital cameras come with geometric distorsion that makes OCR difficult. This problem is difficult for curled book surfaces. To make information extraction from such images possible, we have developed novel approaches of dewarping text based on a depth model of the paper surface.

[demo] [publication] [publication]




Publications

list of publications

Teaching

  • 2009: lab course (Praktikum): "ACM Multimedia Grand Challenge", Master's level

  • 2009: diploma thesis “Topic Models for Video” (Jörn Wanke)

  • 2009: diploma thesis “Statistical Classification of Image Content for Information Filtering

(Christian Jansohn)

  • 2008: diploma thesis "Shape Matching for Automatic Text Reading in Natural Scenes" (Marius Renn)

  • 2008: internship “Style Modelling for Tagging Personal Photo Collections(Manni Duan)

  • 2008: bachelor's thesis “Statistical Detection of Non-scene Text for Content-based Video Retrieval

(Markus Koch)

  • 2008: lab course (Praktikum): “Video Segmentation and Recognition”, Master's level

  • 2007: lecture “Pattern Recognition and Statistical Learning” (tutorials+lectures)

  • 2007: lecture “Image and Video Processing” (tutorials+lectures)

  • 2007: lab course (Praktikum): “Screen OCR using Hidden Markov Models”, Master's level

  • 2007: internship: “camera-based HCI” (Andres Koetsier Jasper Laagland)

  • 2007: master's thesis “Dewarping Documents using a Stereo Vision System” (Soner Ozgun Pelvan)

  • 2006: lecture “Pattern Recognition and Statistical Learning” (tutorials)

  • 2006: lecture “Image and Video Processing” (tutorials)


Peer Reviewing

  • IEEE TPAMI, CVPR07, MVA07, VISAPP 08, ELCVIA CVIA, ECCV08, CVPR08, DAS08, CVPR09,MVA09


Conference Committee Member

  • program committee: ICME Workshop Internet Multimedia Search and Mining (ICMEW 2009)





Last update: 03/18/2009