Noun phase selection in automatic indexing

Objective: this study aims to synthetize and classify the noun phrases selection criteria present in methods for automatic indexing by noun phrases of texts written in Portuguese.Methods: The research methodology has an exploratory nature and bibliographic character, and has the content analysis as...

Descripción completa

Detalles Bibliográficos
Autores: Nascimento, Gustavo Diniz do, Correa, Renato Fernandes
Tipo de recurso: artículo
Estado:Versión publicada
Fecha de publicación:2019
País:Brasil
Institución:Universidade Federal de Santa Catarina (UFSC)
Repositorio:Encontros Bibli
Idioma:portugués
OAI Identifier:oai:periodicos.ufsc.br:article/57927
Acceso en línea:https://periodicos.ufsc.br/index.php/eb/article/view/1518-2924.2019.e57927
Access Level:acceso abierto
Palabra clave:Indexação automática
Sintagmas nominais
Seleção de sintagmas nominais
Língua portuguesa
Recuperação da informação
Automatic indexing
Noun phrases
Noun phrase selection
Portuguese language
Information retrieval
Descripción
Sumario:Objective: this study aims to synthetize and classify the noun phrases selection criteria present in methods for automatic indexing by noun phrases of texts written in Portuguese.Methods: The research methodology has an exploratory nature and bibliographic character, and has the content analysis as procedural method. The bases of the noun phrases selection methodologies are criteria as absolute frequency of occurrence, normalized frequency of occurrence, inverse document frequency, non-occurrence in list of stopwords, and the grammatical structure and level of noun phrases.Conclusions: As for the criteria scope, predominates in quantity those based on the noun phrases characteristics (grammatical structure, level, lexical content), in adoption predominates those based on the document content and the corpus content.Results: The main contribution of this work is the panoramic overview of the noun phrases selection criteria for texts written in the Portuguese idiom.