Handwriting word recognition using windowed Bernoulli HMMs

[EN] Hidden Markov Models (HMMs) are now widely used for off-line handwriting recognition in many lan- guages. As in speech recognition, they are usually built from shared, embedded HMMs at symbol level, where state-conditional probability density functions in each HMM are modeled with Gaussian mixt...

Descripción completa

Detalles Bibliográficos
Autores: Giménez Pastor, Adrián, Juan, Alfons|||0000-0002-9984-4072, Alkhoury, Ihab, Andrés Ferrer, Jesús
Tipo de recurso: artículo
Fecha de publicación:2014
País:España
Institución:Universitat Politècnica de València (UPV)
Repositorio:RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia
Idioma:inglés
OAI Identifier:oai:riunet.upv.es:10251/37326
Acceso en línea:https://riunet.upv.es/handle/10251/37326
Access Level:acceso abierto
Palabra clave:HTR
Bernoulli HMMs
Latin
Arabig
Sliding window
ESTADISTICA E INVESTIGACION OPERATIVA
LENGUAJES Y SISTEMAS INFORMATICOS
Descripción
Sumario:[EN] Hidden Markov Models (HMMs) are now widely used for off-line handwriting recognition in many lan- guages. As in speech recognition, they are usually built from shared, embedded HMMs at symbol level, where state-conditional probability density functions in each HMM are modeled with Gaussian mixtures. In contrast to speech recognition, however, it is unclear which kind of features should be used and, indeed, very different features sets are in use today. Among them, we have recently proposed to directly use columns of raw, binary image pixels, which are directly fed into embedded Bernoulli (mixture) HMMs, that is, embedded HMMs in which the emission probabilities are modeled with Bernoulli mix- tures. The idea is to by-pass feature extraction and to ensure that no discriminative information is filtered out during feature extraction, which in some sense is integrated into the recognition model. In this work, column bit vectors are extended by means of a sliding window of adequate width to better capture image context at each horizontal position of the word image. Using these windowed Bernoulli mixture HMMs, good results are reported on the well-known IAM and RIMES databases of Latin script, and in particular, state-of-the-art results are provided on the IfN/ENIT database of Arabic handwritten words.