Skip to main content

Fast Perspective Recovery of Text in Natural Scenes

Carlos Merino, Majid Mirmehdi, Jose Sigut, Jose Luis Gonzalez-Mora, Fast Perspective Recovery of Text in Natural Scenes. Image and Vision Computing, 31(10), pp. 714–724. October 2013. PDF, 2862 Kbytes. External information

Abstract

Cheap, ubiquitous, high-resolution digital cameras have led to opportunities that demand camera-based text understanding, such as wearable computing or assistive technology. Perspective distortion is one of the main challenges for text recognition in camera captured images since the camera may often not have a fronto-parallel view of the text. We present a method for perspective recovery of text in natural scenes, where text can appear as isolated words, short sentences or small paragraphs (as found on posters, billboards, shop and street signs etc.). It relies on the geometry of the characters themselves to estimate a rectifying homography for every line of text, irrespective of the view of the text over a large range of orientations. The horizontal perspective foreshortening is corrected by fitting two lines to the top and bottom of the text, while the vertical perspective foreshortening and shearing are estimated by performing a linear regression on the shear variation of the individual characters within the text line. The proposed method is efficient and fast. We present comparative results with improved recognition accuracy against the current state-of-the-art.

Bibtex entry.

Contact details

Publication Admin