IULA Penn Treebank

This treebank consists of a number of Spanish and English sentences that has been manually annotated with syntactical information. The sentences have been choosed from the Penn TreeBank corpus, a resource containing texts from Wall Street Journal and originally compiled by the University of Pennsylv...

ver descrição completa

Detalhes bibliográficos
Autor: Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
Tipo de documento: conjunto de datos
Data de publicação:2012
País:España
Recursos:Consorci de Serveis Universitaris de Catalunya (CSUC)
Repositório:CORA.Repositori de Dades de Recerca
OAI Identifier:oai:dnet:cora.rdr____::893b76432ef48ef2860071d380ff589c
Acesso em linha:https://doi.org/10.34810/DATA265
Access Level:Acceso aberto
Palavra-chave:Arts and Humanities
Social Sciences
Language resources
Parallel corpus
Descrição
Resumo:This treebank consists of a number of Spanish and English sentences that has been manually annotated with syntactical information. The sentences have been choosed from the Penn TreeBank corpus, a resource containing texts from Wall Street Journal and originally compiled by the University of Pennsylvania./nIt contains 805 sentences that have been human translated to Spanish. The original English and the translated Spanish sentences share the same identification number. Sentences in both languages have been processed using the DELPH-IN environment (http://www.delph-in.net/).