Object

Title: Novel Approach to Background-Text-Non-Text Separationin Ancient Degraded Document Images

Co-author(s) :

Sazhumyan Grigor ; Aznauryan Lusine

Abstract:

Nowadays lots of handwritten and printed ancient documents need to be digitized for automated processing and analysis. In this paper, an approach to background-text-non-text separation procedure based on differences of presented in a document image objects sizes which can be obtained by binarization and segmentation algorithms, is proposed. After binarization by proper method it is segmented and the distribution of segments sizes is obtained. It is assumed that the three types of objects presented in an image have significantly different sizes; therefore the problem of separation comes to discrimination of the set of segments into three groups. The thresholds for separation of these groups can be found by minimizing the intrasample variation which used in discriminant analysis. Some examples of images from Matenadaran collection are considered and the separated parts of the image are illustrated and interpreted.

Identifier:

oai:noad.sci.am:135813

Language:

English

URL:


Additional Information:

dasat@ipia.sci.am ; grigorsazhumyan@gmail.com ; lusine.aznauryan8@gmail.com

Affiliation:

Institute for Informatics and Automation Problems ; Russian-Armenian (Slavonic) University

Country:

Armenia

Year:

2017

Time period:

September25-29

Conference title:

11th International Conference on Computer Science and Information Technologies CSIT 2017

Place:

Yerevan

Participation type:

oral

Object collections:

Last modified:

Mar 3, 2021

In our library since:

Jul 17, 2020

Number of object content hits:

22

All available object's versions:

https://noad.sci.am/publication/149340

Show description in RDF format:

RDF

Show description in OAI-PMH format:

OAI-PMH

This page uses 'cookies'. More information