ClInt: A bilingual Spanish-Catalan spoken corpus of clinical interviews

In this paper we present ClInt (Clinical Interview), a bilingual Spanish-Catalan spoken corpus that contains 15 hours of clinical interviews. It consists of audio files aligned with multiple-level transcriptions comprising orthographic, phonetic and morphological information, as well as linguistic a...

Descripción completa

Detalles Bibliográficos
Autores: Vila Rigat, Marta, González Fuente, Santiago, Martí Antonin, M. Antònia, Llisterri, Joaquim, Machuca Ayuso, María Jesús
Tipo de recurso: artículo
Estado:Versión publicada
Fecha de publicación:2010
País:España
Institución:Varias* (Consorci de Biblioteques Universitáries de Catalunya, Centre de Serveis Científics i Acadèmics de Catalunya)
Repositorio:Recercat. Dipósit de la Recerca de Catalunya
OAI Identifier:oai:recercat.cat:2445/48252
Acceso en línea:https://hdl.handle.net/2445/48252
Access Level:acceso abierto
Palabra clave:Tipologia (Lingüística)
Corpus (Lingüística)
Tractament del llenguatge natural (Informàtica)
Lingüística computacional
Català
Castellà (Llengua)
Typology (Linguistics)
Corpora (Linguistics)
Natural language processing (Computer science)
Computational linguistics
Catalan language
Spanish language
Descripción
Sumario:In this paper we present ClInt (Clinical Interview), a bilingual Spanish-Catalan spoken corpus that contains 15 hours of clinical interviews. It consists of audio files aligned with multiple-level transcriptions comprising orthographic, phonetic and morphological information, as well as linguistic and extralinguistic encoding. This is a previously non-existent resource for these languages and it offers a wide-ranging exploitation potential in a broad variety of disciplines such as Linguistics, Natural Language Processing and related fields.