Every morning there will two lectures while in the evening either student presentation (Monday) or practice in laboratories (Tuesday, Thursday, Friday) will take place. The last day (Friday), the participants will have the chance to attend the presentation of several case studies.
Keywords of the Lectures
System architectures
- end-to-end document analysis systems
- Common issues & implications
- performance evaluation
Pre-processing
- Binarization
- Noise removal
- Enhancement
- Show through cancellation
- Geometric correction (page curl removal, Dewarping, Slant&Skew correction)
- Skeletonization
Layout analysis
- Segmentation and region classification (physical layout)
- Logical layout analysis. Reading order.
Recognition
- Machine printed text recognition
- Handwriting recognition
- Graphics recognition
Evaluation
- Datasets
- Metrics
- Examples of performance evaluation methodologies and protocols
Indexing
- Inverted files
- Hashing
- Latent semantic indexing (LSI)
- information spotting
Document classification
- Document classification
- Document retrieval