Every morning there will two lectures while in the evening either student presentation  (Monday) or practice  in laboratories (Tuesday, Thursday, Friday) will take place. The last day (Friday), the participants will have the chance to attend the presentation of several case studies.

 


Keywords of the Lectures

 
 

System architectures

 -          end-to-end document analysis systems

 -          Common issues & implications

 -          performance evaluation

 

Pre-processing

 -          Binarization

 -          Noise removal

 -          Enhancement

 -          Show through cancellation

 -          Geometric correction (page curl removal, Dewarping, Slant&Skew correction)

 -          Skeletonization

 

Layout analysis

 -          Segmentation and region classification (physical layout)

 -          Logical layout analysis. Reading order.

 

Recognition

 -          Machine printed text recognition

 -          Handwriting recognition

 -          Graphics recognition

 

Evaluation

 -          Datasets

 -          Metrics

 -          Examples of performance evaluation methodologies and protocols

 

 Indexing

 -          Inverted files

 -          Hashing

 -          Latent semantic indexing (LSI)

 -          information spotting

 

Document classification

 -          Document classification

 -          Document retrieval