LAMP - Media Group - Research - Enhancement of Text in Video

About

People

Research

Publications

Seminars

Presentations

Courses

Mosaicing

Overview

Browsing is a fundamental tool for multimedia systems. A good browsing tool should retain novel information and discard redundant information. During camera rotation, there is significant overlap of scene information between successive frames.Then a scene synopsis can be constructed by stitching or mosaicing'' individual frames. The resulting mosaic is an efficient representation of a sequence of frames because the redundant information in the frame sequence produced by the camera rotation has been discarded.

We have developed a Java application for generating mosaics. The user interface displays the successive frames at the top of the window, and the generated mosaic at the bottom. The two main operations are (1) the frame-to-frame correspondence detection, which must detect matching features or correlated groups of pixels, and (2) the image transform operation, which computes what the frames would look like in the frame of reference of a still camera with a very large field of view.

So far, our frame-to-frame correspondence has been defined by mouse clicks on matching point features on the successive frames. This approach allowed us to focus on the code for the second operation, the image transform, which is completed and produces satisfactory mosaics. The image transform operation is approximated by an affine transformation that combines a translation, a rotation, and a scaling. The 4 unknown parameters that describe the transformation are found by solving an over-determined system, in which each equation describes the information generated by a single frame-to-frame feature matching.

Focusing back to the first operation, we are now taking advantage of the fact that video sequences in the MPEG format already contain significant correspondence information. Indeed MPEG encoders generate the matchings between blocks of 16x16 pixels for most pairs of successive frames when they calculate the so-called motion vectors. We have developed tools that allow us to visualize the fields of motion vectors over frames, and images produced by these tools show that the fields of motion vectors are reliable and consistent with the observed camera motions. Our goal for the coming months is to complete a Java mosaicing application that takes an MPEG video sequence as an input and produces a summary sheet of this sequence that includes single key-frames for still shots and mosaiced key-frames for shots with significant camera motion.