Latin Vallex : A Treebank-based Semantic Valency Lexicon for Latin

ABSTRACT : Despite a centuries-long tradition in lexicography, Latin lacks state-of-the-art computational lexical resources. This situation is strictly related to the still quite limited amount of linguistically annotated textual data for Latin, which can help the building of new lexical resources b...

Descripción completa

Detalles Bibliográficos
Autores: González Saavedra, Berta, Passarotti, Marco, Onambele, Christophe
Tipo de recurso: capítulo de libro
Fecha de publicación:2016
País:España
Institución:Universidad Complutense de Madrid (UCM)
Repositorio:Docta Complutense
Idioma:inglés
OAI Identifier:oai:docta.ucm.es:20.500.14352/116332
Acceso en línea:https://hdl.handle.net/20.500.14352/116332
Access Level:acceso abierto
Palabra clave:81'322
811.124'02
Valency
Latin
Lexicography
Filología latina
Lingüística
5701.04 Lingüística Informatizada
Descripción
Sumario:ABSTRACT : Despite a centuries-long tradition in lexicography, Latin lacks state-of-the-art computational lexical resources. This situation is strictly related to the still quite limited amount of linguistically annotated textual data for Latin, which can help the building of new lexical resources by supporting them with empirical evidence. However, projects for creating new language resources for Latin have been launched over the last decade to fill this gap. In this paper, we present Latin Vallex, a valency lexicon for Latin built in mutual connection with the semantic and pragmatic annotation of two Latin treebanks featuring texts of different eras. On the one hand, such a connection between the empirical evidence provided by the treebanks and the lexicon allows to enhance each frame entry in the lexicon with its frequency in real data. On the other hand, each valency-capable word in the treebanks is linked to a frame entry in the lexicon.