Manuscript document digitalization and recognition: a first approach

cic.isFulltexttruees
cic.isPeerReviewedtruees
cic.lugarDesarrolloServicio de Difusión de la Creación Intelectual es
cic.versioninfo:eu-repo/semantics/submittedVersiones
dc.date.accessioned2016-08-18T20:18:15Z
dc.date.available2016-08-18T20:18:15Z
dc.identifier.urihttps://digital.cic.gba.gob.ar/handle/11746/3826
dc.titleManuscript document digitalization and recognition: a first approachen
dc.typeArtículoes
dcterms.abstractThe handwritten manuscript recognizing process belongs to a set of initiatives which lean to the preservation of cultural patrimony gathered in libraries and archives, where there exist a great wealth in documents and even handwritten cards that accompany incunabula books. This work is the starting point of a research and development project oriented to digitalization and recognition of manuscript materials. The paper presented here discuss different algorithms used in the first stage dedicated to image noise-cleaning in order to improve it before the character recognition process begins. In order to make the handwritten-text recognition and image digitalization process efficient, it must be preceded by a preprocessing stage of the image to be treated, which includes thresholding, noise cleaning, thinning, base-line alignment and image segmentation, among others. Each of these steps will allow us to reduce the injurious variability when recognizing manuscripts (noise, random gray levels, slanted characters, ink level in different zones), and so increasing the probability of obtaining a suitable text recognition. In this paper, two image thinning methods are considered, and implemented. Finally, an evaluation is carried out obtaining many conclusions related to efficiency, speed and requirements, as well as ideas for future implementations.en
dcterms.creator.authorDe Giusti, Marisa Raqueles
dcterms.creator.authorVila, María Martaes
dcterms.creator.authorVillarreal, Gonzalo Lujánes
dcterms.extent6 p.es
dcterms.identifier.other1666-6038es
dcterms.identifier.urlRegistro completoes
dcterms.isPartOf.issuevol. 5, no. 3es
dcterms.isPartOf.seriesJournal of Computer Science & Technologyes
dcterms.issued2005-10-01
dcterms.languageIngléses
dcterms.licenseAttribution 4.0 International (BY 4.0)es
dcterms.subjectdigitalizaciónes
dcterms.subjectImage processing softwareen
dcterms.subjectconservación patrimonialen
dcterms.subject.materiaCiencias de la Computación e Informaciónes

Archivos

Bloque original

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
vila - manuscript.pdf-PDFA.pdf
Tamaño:
540.25 KB
Formato:
Adobe Portable Document Format
Descripción:
Documento completo