Text extraction method for historical Tibetan document images based on block projections - Details

Author：

Duan, Li-juan (Duan, Li-juan.) (Scholars：段立娟) | Zhang, Xi-qun (Zhang, Xi-qun.) | Ma, Long-long (Ma, Long-long.) | Wu, Jian (Wu, Jian.)

Indexed by：

EI Scopus

Abstract：

Text　extraction　is　an　important　initial　step　in　digitizing　the　historical　documents.　In　this　paper,　we　present　a　text　extraction　method　for　historical　Tibetan　document　images　based　on　block　projections.　The　task　of　text　extraction　is　considered　as　text　area　detection　and　location　problem.　The　images　are　divided　equally　into　blocks　and　the　blocks　are　filtered　by　the　information　of　the　categories　of　connected　components　and　corner　point　density.　By　analyzing　the　filtered　blocks’　projections,　the　approximate　text　areas　can　be　located,　and　the　text　regions　are　extracted.　Experiments　on　the　dataset　of　historical　Tibetan　documents　demonstrate　the　effectiveness　of　the　proposed　method.　©　2017,　Tianjin　University　of　Technology　and　Springer-Verlag　GmbH　Germany,　part　of　Springer　Nature.

Keyword：

Information filtering Extraction Image processing

Author Community：

[ 1 ] [Duan, Li-juan]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Duan, Li-juan]Beijing Key Laboratory on Integration and Analysis of Large-scale Stream Data, Beijing University of Technology, Beijing; 100124, China
[ 3 ] [Zhang, Xi-qun]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 4 ] [Zhang, Xi-qun]Beijing Key Laboratory of Trusted Computing, Beijing University of Technology, Beijing; 100124, China
[ 5 ] [Ma, Long-long]Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences, Beijing; 100190, China
[ 6 ] [Wu, Jian]Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences, Beijing; 100190, China

Reprint Author's Address：

段立娟
[duan, li-juan]beijing key laboratory on integration and analysis of large-scale stream data, beijing university of technology, beijing; 100124, china;;[duan, li-juan]faculty of information technology, beijing university of technology, beijing; 100124, china

Email：

ljduan@bjut.edu.cn

Show more details

Related Keywords：

Adaptive feature extraction of four-class motor imagery EEG based on best basis of wavelet packet and CSP
2011，
Few-Shot Waste Detection Based on Dual Attention and Dynamic Hard Sample Triplet Loss
2024，14th Asian Control Conference, ASCC 2024
Structural asymmetric convolution for wireframe parsing
2024，Engineering Applications of Artificial Intelligence
Reciprocating compressor fault diagnosis using an optimized convolutional deep belief network
2020，Journal of Vibration and Control

Source ：

Optoelectronics Letters

ISSN： 1673-1905

Year： 2017

Issue： 6

Volume： 13

Page： 457-461

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 6

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 10

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to