Mao S, Kanungo T
SPIE conference on Document Recognition and Retrieval. 2000 Jan.;:303-314.
Document page segmentation is a crucial preprocessing step in Optical Character Recognition (OCR) system. While numerous segmentation algorithms have been proposed, there is relatively less literature on comparative evaluation – empirical or theoretical – of these algorithms. We use a 5 step methodology to quantitatively compare the performance of page segmentation algorithms.