PANACEA Environment Bilingual Glossary EL-EN (Greek-English)

This folder contains files for bilingual glossary creation from factored phrase tables that include part of speech tagged text for EL-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speech sequences are fi...

Descripción completa

Detalles Bibliográficos
Autor: Dublin City University. School of Computing
Tipo de recurso: conjunto de datos
Fecha de publicación:2012
País:España
Institución:Consorci de Serveis Universitaris de Catalunya (CSUC)
Repositorio:CORA.Repositori de Dades de Recerca
OAI Identifier:oai:dnet:cora.rdr____::d09a5e7b208576abe8a3f378dc405607
Acceso en línea:https://doi.org/10.34810/DATA332
Access Level:acceso abierto
Palabra clave:Arts and Humanities
Social Sciences
Language resources
Lexical conceptual resource
Bilingual lexicon
Descripción
Sumario:This folder contains files for bilingual glossary creation from factored phrase tables that include part of speech tagged text for EL-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speech sequences are filtered out. Then, feature scores from the phrase table are combined in a log-linear model to score each entry. The user specifies how large the output glossary should be (relative to the input) and the bottom ranking entries are discarded to produce the desired size glossary.