A Hybrid Learning for Named Entity Recognition Systems

This paper presents a hybrid method using machine learning approach for Named Entity Recognition (NER). A system built based on this method is able to achieve reasonable performance with minimal training data and gazetteers. The hybrid machine learning approach differs from previous machine learning...

Descripción completa

Detalles Bibliográficos
Autor: Chiong, Raymond
Tipo de recurso: artículo
Estado:Versión publicada
Fecha de publicación:2008
País:Brasil
Institución:Universidade Federal de Lavras (UFLA)
Repositorio:INFOCOMP: Jornal de Ciência da Computação
Idioma:inglés
OAI Identifier:oai:infocomp.dcc.ufla.br:article/243
Acceso en línea:https://infocomp.dcc.ufla.br/index.php/infocomp/article/view/243
Access Level:acceso abierto
Palabra clave:Machine learning
named entity recognition
tagging
Descripción
Sumario:This paper presents a hybrid method using machine learning approach for Named Entity Recognition (NER). A system built based on this method is able to achieve reasonable performance with minimal training data and gazetteers. The hybrid machine learning approach differs from previous machine learning-based systems in that it uses Maximum Entropy Model (MEM) and Hidden Markov Model (HMM) successively. We report on the performance of our proposed NER system using British National Corpus (BNC). In the recognition process, we first use MEM to identify the named entities in the corpus by imposing some temporary tagging as references. The MEM walkthrough can be regarded as a training process for HMM, as we then use HMM for the final tagging. We show that with enough training data and appropriate error correction mechanism, this approach can achieve higher precision and recall than using a single statistical model. We conclude with our experimental results that indicate the flexibility of our system in different domains.