Sazhumyan Grigor ; Aznauryan Lusine
Nowadays lots of handwritten and printed ancient documents need to be digitized for automated processing and analysis. In this paper, an approach to background-text-non-text separation procedure based on differences of presented in a document image objects sizes which can be obtained by binarization and segmentation algorithms, is proposed. After binarization by proper method it is segmented and the distribution of segments sizes is obtained. It is assumed that the three types of objects presented in an image have significantly different sizes; therefore the problem of separation comes to discrimination of the set of segments into three groups. The thresholds for separation of these groups can be found by minimizing the intrasample variation which used in discriminant analysis. Some examples of images from Matenadaran collection are considered and the separated parts of the image are illustrated and interpreted.
oai:noad.sci.am:135813
dasat@ipia.sci.am ; grigorsazhumyan@gmail.com ; lusine.aznauryan8@gmail.com
Institute for Informatics and Automation Problems ; Russian-Armenian (Slavonic) University
11th International Conference on Computer Science and Information Technologies CSIT 2017
Mar 3, 2021
Jul 17, 2020
22
https://noad.sci.am/publication/149340
Edition name | Date |
---|---|
David, Asatryan, Novel Approach to Background-Text-Non-Text Separationin Ancient Degraded Document Images | Mar 3, 2021 |
Asatryan David Sazhumyan Grigor Sakanyan Bagrat
Asatryan David Hovsepyan Samvel
Asatryan David Kurkchiyan Vardan Sazhumyan Grigor
Asatryan David
Asatryan David