Mark-up and Annotation in the Corpus of Historical English Law Reports (CHELAR): Potential for Historical Genre Analysis

Adding annotation and mark-up to linguistic corpora has become a standard practice in corpus building over the past few decades as a way to facilitate data extraction and at the same time guarantee that new corpora are compatible with existing and future tools. The purpose of this article is twofold...

Descripción completa

Detalles Bibliográficos
Autores: Rodríguez Puente, Paula|||0000-0002-7177-5984, Blanco García, Cristina, Tamaredo, Iván
Tipo de recurso: artículo
Fecha de publicación:2019
País:España
Institución:Universidad de Oviedo (UNIOVI)
Repositorio:RUO. Repositorio Institucional de la Universidad de Oviedo
Idioma:inglés
OAI Identifier:oai:digibuo.uniovi.es:10651/53983
Acceso en línea:http://hdl.handle.net/10651/53983
https://dx.doi.org/10.28914/Atlantis-2019-41.2.03
Access Level:acceso abierto
Palabra clave:Corpus annotation
Corpus mark-up
Law reports
TEI-XML
Legal English
Descripción
Sumario:Adding annotation and mark-up to linguistic corpora has become a standard practice in corpus building over the past few decades as a way to facilitate data extraction and at the same time guarantee that new corpora are compatible with existing and future tools. The purpose of this article is twofold. First, we provide an overview of the main forms of annotation and mark-up available to the research community and how they have been applied to the Corpus of Historical English Law Reports 1535-1999 (CHELAR), a specialized corpus consisting of law reports or records of judicial decisions. Second, we give an account of preliminary research based on the annotated versions of CHELAR, which so far has been primarily aimed at identifying the distinctive linguistic characteristics of law reports, as well as at investigating how the language of law reports has evolved over a time span of almost five centuries. Our article illustrates the multiple advantages of applying a simple annotation schema to