Discriminative Bernoulli HMMs for isolated handwritten word recognition
[EN] Bernoulli HMMs (BHMMs) have been successfully applied to handwritten text recognition (HTR) tasks such as continuous and isolated handwritten words. BHMMs belong to the generative model family and, hence, are usually trained by (joint) maximum likelihood estimation (MLE) by means of the Baum-We...
| Autores: | , , |
|---|---|
| Formato: | artículo |
| Fecha de publicación: | 2014 |
| País: | España |
| Recursos: | Universitat Politècnica de València (UPV) |
| Repositorio: | RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia |
| Idioma: | inglés |
| OAI Identifier: | oai:riunet.upv.es:10251/50978 |
| Acesso em linha: | https://riunet.upv.es/handle/10251/50978 |
| Access Level: | acceso abierto |
| Palavra-chave: | HTR Bernoulli HMM Log-linear HMM MMI RIMES ESTADISTICA E INVESTIGACION OPERATIVA LENGUAJES Y SISTEMAS INFORMATICOS |
| id |
ES_f0cc8ea7506cb0ef9bf0b5ad28fa0286 |
|---|---|
| oai_identifier_str |
oai:riunet.upv.es:10251/50978 |
| network_acronym_str |
ES |
| network_name_str |
España |
| repository_id_str |
|
| spelling |
Discriminative Bernoulli HMMs for isolated handwritten word recognitionGiménez Pastor, AdriánAndrés Ferrer, JesúsJuan, AlfonsHTRBernoulli HMMLog-linear HMMMMIRIMESESTADISTICA E INVESTIGACION OPERATIVALENGUAJES Y SISTEMAS INFORMATICOS[EN] Bernoulli HMMs (BHMMs) have been successfully applied to handwritten text recognition (HTR) tasks such as continuous and isolated handwritten words. BHMMs belong to the generative model family and, hence, are usually trained by (joint) maximum likelihood estimation (MLE) by means of the Baum-Welch algorithm. Despite the good properties of the MLE criterion, there are better training criteria such as maximum mutual information (MM!). The MMI is the most widespread criterion to train discriminative models such as log-linear (or maximum entropy) models. Inspired by a BHMM classifier, in this work, a log-linear HMM (LLHMM) for binary data is proposed. The proposed model is proved to be equivalent to the BHMM classifier, and, in this way, a discriminative training framework for BHMM classifiers is defined. The behavior of the proposed discriminative training framework is deeply studied in a well known task of isolated word recognition, the RIMES database. (C) 2013 Elsevier B.V. All rights reserved.Work supported by the EC (FEDER/FSE) and the Spanish MEC/MICINN under the MIPRCV ‘‘Consolider Ingenio 2010’’ program (CSD2007-00018), iTrans2 (TIN2009-14511) and MITTRAL (TIN2009-14633-C03-01) projects. Also supported by the IST Programme of the European Community, under the PASCAL2 Network of Excellence, IST-2007-216886, and by the Spanish MITyC under the erudito.com (TSI-020110-2009-439).ElsevierDepartamento de Sistemas Informáticos y ComputaciónEscuela Técnica Superior de Ingeniería InformáticaMinisterio de Educación y CienciaMinisterio de Ciencia e InnovaciónMinisterio de Industria, Turismo y ComercioEuropean CommissionRepositorio Institucional de la Universitat Politècnica de València Riunet20142014-01-01journal articlehttp://purl.org/coar/resource_type/c_6501VoRhttp://purl.org/coar/version/c_970fb48d4fbd8a85info:eu-repo/semantics/articleapplication/pdfapplication/pdfhttps://riunet.upv.es/handle/10251/50978reponame:RiuNet. Repositorio Institucional de la Universitat Politécnica de Valénciainstname:Universitat Politècnica de València (UPV)InglésengEuropean Commission https://doi.org/10.13039/501100000780 FP7 216886 Pattern Analysis, Statistical Modelling and Computational Learning 2Ministerio de Ciencia e Innovación http://dx.doi.org/10.13039/501100004837 TIN2009-14633-C03-01 Multimodal Interaction For Text Transcription With Adaptive LearningMinisterio de Educación y Ciencia https://doi.org/10.13039/501100008743 CSD2007-00018 Multimodal Intraction in Pattern Recognition and Computer VisionmMinisterio de Ciencia e Innovación http://dx.doi.org/10.13039/501100004837 TIN2009-14511 Traduccion De Textos Y Transcripcion De Voz InteractivasMinisterio de Industria, Turismo y Comercio MITURCO TSI-020110-2009-0439 ERUDITO.COMopen accesshttp://purl.org/coar/access_right/c_abf2Reserva de todos los derechoshttp://rightsstatements.org/vocab/InC/1.0/info:eu-repo/semantics/openAccessoai:riunet.upv.es:10251/509782026-06-13T07:49:27Z |
| dc.title.none.fl_str_mv |
Discriminative Bernoulli HMMs for isolated handwritten word recognition |
| title |
Discriminative Bernoulli HMMs for isolated handwritten word recognition |
| spellingShingle |
Discriminative Bernoulli HMMs for isolated handwritten word recognition Giménez Pastor, Adrián HTR Bernoulli HMM Log-linear HMM MMI RIMES ESTADISTICA E INVESTIGACION OPERATIVA LENGUAJES Y SISTEMAS INFORMATICOS |
| title_short |
Discriminative Bernoulli HMMs for isolated handwritten word recognition |
| title_full |
Discriminative Bernoulli HMMs for isolated handwritten word recognition |
| title_fullStr |
Discriminative Bernoulli HMMs for isolated handwritten word recognition |
| title_full_unstemmed |
Discriminative Bernoulli HMMs for isolated handwritten word recognition |
| title_sort |
Discriminative Bernoulli HMMs for isolated handwritten word recognition |
| dc.creator.none.fl_str_mv |
Giménez Pastor, Adrián Andrés Ferrer, Jesús Juan, Alfons |
| author |
Giménez Pastor, Adrián |
| author_facet |
Giménez Pastor, Adrián Andrés Ferrer, Jesús Juan, Alfons |
| author_role |
author |
| author2 |
Andrés Ferrer, Jesús Juan, Alfons |
| author2_role |
author author |
| dc.contributor.none.fl_str_mv |
Departamento de Sistemas Informáticos y Computación Escuela Técnica Superior de Ingeniería Informática Ministerio de Educación y Ciencia Ministerio de Ciencia e Innovación Ministerio de Industria, Turismo y Comercio European Commission Repositorio Institucional de la Universitat Politècnica de València Riunet |
| dc.subject.none.fl_str_mv |
HTR Bernoulli HMM Log-linear HMM MMI RIMES ESTADISTICA E INVESTIGACION OPERATIVA LENGUAJES Y SISTEMAS INFORMATICOS |
| topic |
HTR Bernoulli HMM Log-linear HMM MMI RIMES ESTADISTICA E INVESTIGACION OPERATIVA LENGUAJES Y SISTEMAS INFORMATICOS |
| description |
[EN] Bernoulli HMMs (BHMMs) have been successfully applied to handwritten text recognition (HTR) tasks such as continuous and isolated handwritten words. BHMMs belong to the generative model family and, hence, are usually trained by (joint) maximum likelihood estimation (MLE) by means of the Baum-Welch algorithm. Despite the good properties of the MLE criterion, there are better training criteria such as maximum mutual information (MM!). The MMI is the most widespread criterion to train discriminative models such as log-linear (or maximum entropy) models. Inspired by a BHMM classifier, in this work, a log-linear HMM (LLHMM) for binary data is proposed. The proposed model is proved to be equivalent to the BHMM classifier, and, in this way, a discriminative training framework for BHMM classifiers is defined. The behavior of the proposed discriminative training framework is deeply studied in a well known task of isolated word recognition, the RIMES database. (C) 2013 Elsevier B.V. All rights reserved. |
| publishDate |
2014 |
| dc.date.none.fl_str_mv |
2014 2014-01-01 |
| dc.type.none.fl_str_mv |
journal article http://purl.org/coar/resource_type/c_6501 VoR http://purl.org/coar/version/c_970fb48d4fbd8a85 |
| dc.type.openaire.fl_str_mv |
info:eu-repo/semantics/article |
| format |
article |
| dc.identifier.none.fl_str_mv |
https://riunet.upv.es/handle/10251/50978 |
| url |
https://riunet.upv.es/handle/10251/50978 |
| dc.language.none.fl_str_mv |
Inglés eng |
| language_invalid_str_mv |
Inglés |
| language |
eng |
| dc.relation.none.fl_str_mv |
European Commission https://doi.org/10.13039/501100000780 FP7 216886 Pattern Analysis, Statistical Modelling and Computational Learning 2 Ministerio de Ciencia e Innovación http://dx.doi.org/10.13039/501100004837 TIN2009-14633-C03-01 Multimodal Interaction For Text Transcription With Adaptive Learning Ministerio de Educación y Ciencia https://doi.org/10.13039/501100008743 CSD2007-00018 Multimodal Intraction in Pattern Recognition and Computer Visionm Ministerio de Ciencia e Innovación http://dx.doi.org/10.13039/501100004837 TIN2009-14511 Traduccion De Textos Y Transcripcion De Voz Interactivas Ministerio de Industria, Turismo y Comercio MITURCO TSI-020110-2009-0439 ERUDITO.COM |
| dc.rights.none.fl_str_mv |
open access http://purl.org/coar/access_right/c_abf2 Reserva de todos los derechos http://rightsstatements.org/vocab/InC/1.0/ |
| dc.rights.openaire.fl_str_mv |
info:eu-repo/semantics/openAccess |
| rights_invalid_str_mv |
open access http://purl.org/coar/access_right/c_abf2 Reserva de todos los derechos http://rightsstatements.org/vocab/InC/1.0/ |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
application/pdf application/pdf |
| dc.publisher.none.fl_str_mv |
Elsevier |
| publisher.none.fl_str_mv |
Elsevier |
| dc.source.none.fl_str_mv |
reponame:RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia instname:Universitat Politècnica de València (UPV) |
| instname_str |
Universitat Politècnica de València (UPV) |
| reponame_str |
RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia |
| collection |
RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia |
| repository.name.fl_str_mv |
|
| repository.mail.fl_str_mv |
|
| _version_ |
1869424041488547840 |
| score |
15.300724 |