Indexed by:
Abstract:
Text extraction is an important initial step in digitizing the historical documents. In this paper, we present a text extraction method for historical Tibetan document images based on block projections. The task of text extraction is considered as text area detection and location problem. The images are divided equally into blocks and the blocks are filtered by the information of the categories of connected components and corner point density. By analyzing the filtered blocks’ projections, the approximate text areas can be located, and the text regions are extracted. Experiments on the dataset of historical Tibetan documents demonstrate the effectiveness of the proposed method. © 2017, Tianjin University of Technology and Springer-Verlag GmbH Germany, part of Springer Nature.
Keyword:
Reprint Author's Address:
Email:
Source :
Optoelectronics Letters
ISSN: 1673-1905
Year: 2017
Issue: 6
Volume: 13
Page: 457-461
Cited Count:
WoS CC Cited Count: 0
SCOPUS Cited Count: 6
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 10
Affiliated Colleges: