A common semantic space for monolingual and cross-lingual meta-embeddings

This master’s thesis presents a new technique for creating monolingual and cross-lingual meta-embeddings. Our method integrates multiple word embeddings created from complementary techniques, textual sources, knowledge bases and languages. Existing word vectors are projected to a common semantic spa...

Descripción completa

Detalles Bibliográficos
Autor: García Ferrero, Iker
Tipo de recurso: tesis de maestría
Fecha de publicación:2019
País:España
Institución:Universidad del País Vasco
Repositorio:Addi. Archivo Digital para la Docencia y la Investigación
OAI Identifier:oai:addi.ehu.eus:10810/36183
Acceso en línea:http://hdl.handle.net/10810/36183
Access Level:acceso abierto
Palabra clave:language analysis and processing
monolingual meta-embeddings
cross-lingual meta-embeddings
Descripción
Sumario:This master’s thesis presents a new technique for creating monolingual and cross-lingual meta-embeddings. Our method integrates multiple word embeddings created from complementary techniques, textual sources, knowledge bases and languages. Existing word vectors are projected to a common semantic space using linear transformations and averaging. With our method the resulting meta-embeddings maintain the dimensionality of the original embeddings without losing information while dealing with the out-of-vocabulary (OOV) problem. Furthermore, empirical evaluation demonstrates the effectiveness of our technique with respect to previous work on various intrinsic and extrinsic multilingual evaluations.