Article Open Access Logo

SKEW DETECTION USING THE HIERARCHICAL HOUGH TRANSFORM FOR VIETNAMESE GENERIC DOCUMENTS

Pham Pham Tuyet Trinh 1
Nguyen Cong Vu 1
Volume & Issue: Vol. 3 No. 1 (2000) | Page No.: 5-18 | DOI: 10.32508/stdj.v3i1.3521
Published: 2000-01-31

Online metrics


Statistics from the website

  • Abstract Views: 1340
  • Galley Views: 622

Statistics from Dimensions

Copyright The Author(s) 2023. This article is published with open access by Vietnam National University, Ho Chi Minh city, Vietnam. This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0) which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited. 

Abstract

Most existing page segmentation algorithms do not handle document images with skew or utilize time consuming skew detection techniques. This paper present an application of the hierarchical Hough transform, a computationally efficient skew detection algorithm to connected components. It is capable of detecting the skew angle in many types of images, including scientific articles, postal labels, handwritten texts, forms, drawings and bar codes. The algorithm is robust even when black-margins introduced by photocopying are present in the image and when the document is scanned at a low resolution of 50 dpi. The algorithm consists of two steps. In the first step, we quickly extract the centroids of connected components using a graph data structure. Then, the hierachical Hough transform, at two different angular resolutions, is applied to the selected centroids. The skew angle corresponds to the location of the highest peak in the Hough space.

Comments