Collective disambiguation in entity linking based on topic coherence in semantic graphs

Entity Linking (EL) consists of determinating the entities that best represent the mentions in a document. Mentions can be very ambiguous and can refer to different entities in different contexts. In this paper, we present ABACO, a semantic annotation system for Entity Linking (EL) which addresses n...

Descripción completa

Detalles Bibliográficos
Autores: Rama Maneiro, Efrén, Vidal Aguiar, Juan Carlos, Lama Penín, Manuel
Tipo de recurso: artículo
Fecha de publicación:2020
País:España
Institución:Universidad de Santiago de Compostela (USC)
Repositorio:Minerva. Repositorio Institucional de la Universidad de Santiago de Compostela
Idioma:inglés
OAI Identifier:oai:minerva.usc.gal:10347/39200
Acceso en línea:https://hdl.handle.net/10347/39200
Access Level:acceso abierto
Palabra clave:Entity linking
Semantic annotation
Topic coherence
Named entity disambiguation
120304 Inteligencia artificial
Descripción
Sumario:Entity Linking (EL) consists of determinating the entities that best represent the mentions in a document. Mentions can be very ambiguous and can refer to different entities in different contexts. In this paper, we present ABACO, a semantic annotation system for Entity Linking (EL) which addresses name ambiguity assuming that the entity that annotates a mention should be coherent with the main topics of the document. ABACO extracts a sub-graph from a knowledge base which interconnects all the candidate entities to annotate each mention in the document. Candidate entities are scored according to their degree of centrality in the knowledge graph and their textual similarity with the topics of the document, and worst candidates are pruned from the sub-graph. The approach has been validated with 13 datasets and compared with other 11 annotation systems using the GERBIL platform. Results show that ABACO outperforms the other systems for medium/large documents.