PRELIMINARY PROGRAM

 

2001 Symposium on

Document Image Understanding

Technology

 

 

April 23-25, 2001

 

Sheraton Columbia Hotel

Columbia, Maryland

 

Post Symposium information will be available on the SDIUT Home Page

http://lamp.cfar.umd.edu/SDIUT

If you have any questions, please contact the Symposium Coordinators at

Phone: (301) 405-6444

Fax: (301) 314-9658

E-mail: sdiut01@cfar.umd.edu

SDIUT01, UMIACS, University of Maryland,

College Park, MD 20742


PRELIMINARY PROGRAM

 

 

Sunday―April 22, 2001

 

6:00-8:00 pm                Welcome Registration

 

 

Monday―April 23, 2001

 

7:30 AM                Registration/Breakfast

 

8:30 AM                Welcome/Introduction

 

9:00–10:15                Session 1 - Preprocessing

                                Chair:  Richard Schwartz and Gary Kuhn

 

Line by Line Script Identification

C. Cumbee -- DoD

 

Gaussian Model-Based Image Binarization for Text Extraction

 T. Drayer -- DoD

 

Low Resolution Expansion of Gray Scale Text Image Using

Gibbs-Markov Random Field Model

P. Thouin, Y. Du and C.-I. Chang -- DoD, UMBC

 

Bilevel Image Degradations: Effects and Estimation

E. Barney Smith -- Boise State University

 

10:15–10:45                Morning Break

 

10:45–12:00                Session 2 - Multimedia

                                Chair: Ken Cantwell

 

A Framework for Reliable Text Based Indexing of Video

R. Kasturi, S. Antani, D.J. Crandall and V.Y. Mariano -- PSU

 

Digital Camera for Document Acquisition

P. Fisher                -- ARL

 

Recognition of Text in 3D Scenes

G. Myers, R.C. Bolles, Q.-T. Luong, and J.A. Herson -- SRI

 

MediaBrowse: A Workbench for Multimedia Information Fusion

J. Liang and  G. Marchisio, Insightful Corp.

 

 

12:00– 1:15                Lunch

 

1:15– 2:15                Keynote SPEAKER

                                                Document Appliances in Practices

                                                Kurt Piersol  -- Ricoh Silicon Valley


2:30–3:30                 Session 3 - Projects and Applications

                                Chair: Larry Spitz and Melissa Holland

 

The CMU-Seagate Historical New York Times Project

R. Thibadeau, C. DeWan, J. Young and D. Marous -- CMU

 

Overview of the DjVu Document Compression Technology

Y. LeCun, L. Bottou, P. Haffner, J. Triggs, B. Riemers,

L. Vincent -- AT&T Labs, LizardTech, Inc.

 

Performance Metrics for the Electronic Conversion of Large Archival Document Collections

Thomas Nartker  -- UNLV - ISRI

 

 

3:30–4:00                 Afternoon Break

 

4:00–5:00                 Session 4 - Performance Evaluation

                                Chair: Dan Lopresti

 

OCR Accuracy of Three Systems on English and Russian Documents of Highly Varying Quality

K. Summers -- Highland Technologies

 

Truthing, Testing, Evaluation Issues in Complex Systems,

S. Setlur, V. Govindaraju, S. Srihari, A. Lawson -- CEDAR, SUNY Buffalo,

US Postal Service

 

What System Developers Need to Select OCR for Authentic Tasks: Evaluating End-to-End Systems

M. Holland, C. Schlesiger, L. Hernandez -- ARL

 

 

Tuesday―April 24, 2001

 

7:30 AM                Registration/Breakfast

 

9:00–10:15                 Session 5 - Foreign Language Information Retrieval

                                Chair: Kazeem Taghva and John Kovarik

 

Document Image Retrieval Techniques for Chinese

Y.-H. Tseng and D. Oard -- Fu Jen Catholic Univ., UMCP

 

Advances in Arabic Text Recognition

J. Trenkle, A. Gillies, E. Erlandson, S. Schlosser,

and S. Cavin -- NovoDynamics, Inc.

 

Experiments in Trilingual Cross-Language Information Retrieval

G. Marchisio, J. Liang -- Insightful Corp.

 

 

10:15–10:45                 Morning Break

 


10:45–12:00                 Session 6 - Page Analysis and Classification

                                Chair: Thomas Breuel and Nigel Dewdney

 

Binary Document Image Using Similarity of Multiple Texture Features

D. Doermann and J. Liang -- UMCP

 

Style-Directed Document Segmentation

L. Spitz -- Document Recognition Technologies

 

Evaluating Document Analysis Results via Graph Probing

D. Lopresti and G. Wilfong -- Bell Labs., Lucent Technologies

 

Applications of the Turbo Recognition: Approach to Layout Analysis

T. Tokuyasu -- UC Berkeley

 

12:00–1:15                 Lunch

 

1:15–2:15                 Keynote Speaker

                                The Making of America Project

                                                Maria Bonn -- University of Michigan

 

2:15–3:30                 Session 7 - Indexing and Retrieval

                                Chair: Doug Oard

 

A Conceptual Model and Image Similarity

N. Dewdney -- DoD

 

Recognize, Categorize, and Retrieve

K. Taghva, T. Nartker and J. Borsack -- UNLV-ISRI

 

Large-Scale Duplicate Document Detection in Operation

M. Turner, Y. Katsnelson, J. Smith -- Highland Technologies and DoD

 

Shape Extraction from Digital Document Images

G. Becker and P. Brock -- Magnify Research, Inc., GW Univ.

 

3:30–4:00                 Afternoon Break

 

4:00–5:00                 Demos & Abstracts

 

                                               

Integrating OCR and Machine Translation on Non-Traditional Languages

C. Schlesiger, M. Holland, L. Hernandez - ARL

 

Creating a Digital Library from Newspaper Archives
S. Mantzaris, B. Gatos, and N. Gouraros -- Lambrakis Press SA

 

                                                Standard Metadata for Multimedia Content

 W. Chang -- NIST

 

OCR Accuracy of Three Systems on English and Russian Documents of Highly Varying Quality

K. Summers -- Highland Technologies

 


DjVu Document Compression Technology

Y. LeCun, L. Bottou, P. Haffner, J. Triggs, B. Riemers,

L. Vincent -- AT&T Labs, LizardTech, Inc.

 

A Recognition Method of the Machine-Printed Monetary Amounts

M. Koga, R. Minc, H. Sako and H. Fujisawa -- Hitachi, Inc.

 

VIPER: Tools and Techniques for Video Performance Evaluation Applied to Scene and Document Images

D. Doermann and D. Mihalcik, UMCP

 

 

 

5:00–8:00                 Demos/Posters and Buffet Reception

 

Wednesday―April 25, 2001

 

8:00 AM                Registration/Breakfast

 

9:00–9:30                 Research Directions And Opportunities In Documents Analysis

                                Steve Dennis, Department of Defense

                                                                               

9:30–10:30                 Session 8 - Text Recognition and Page Analysis

                                Chairs: David Doermann and Tom Drayer

 

OCR of Low-Resolution Text Images from Diverse Sources

P. Natarajan, R. Schwartz, J. Makhoul -- BBN Tech., Verizon

 

Document Image Analysis Research at Xerox PARC

T. Breuel and K. Popat -- Xerox, PARC

 

Advanced Labeling Techniques for Scanned Document Images

D.X. Le and G. Thoma -- National Library of Medicine

 

 

10:30–11:00                 Morning Break

 

 

11:00–12:00                 Session 9 -Panel Discussions

                                Chair: David Doermann