Skip to main content

Recognising text in real scenes

Paul Clark, Majid Mirmehdi, Recognising text in real scenes. International Journal on Document Analysis and Recognition, 4 (4). ISSN 1433-2833, pp. 243–257. August 2002. No electronic version available. External information


We present two different approaches to the location and recovery of text in images of real scenes. The techniques we describe are invariant to the scale and 3D orientation of the text, and allow recovery of text in cluttered scenes. The first approach uses page edges and other rectangular boundaries around text to locate a surface containing text, and to recover a fronto-parallel view. This is performed using line detection, perceptual grouping, and comparison of potential text regions using a confidence measure. The second approach uses low-level texture measures with a neural network classifier to locate regions of text in an image. Then we recover a fronto-parallel view of each located paragraph of text by separating the individual lines of text and determining the vanishing points of the text plane. We illustrate our results using a number of images.

Bibtex entry.

Contact details

Publication Admin