Early Dropout Predictors in Social Sciences and Management Degree students

Student dropout is a major concern in studies investigating retentionstrategies in higher education. This study identifies which variables areimportant to predict student dropout, using academic data from 3583first-year students on the Business Administration (BA) degree at theUniversity of Barcelon...

ver descrição completa

Detalhes bibliográficos
Autores: Ortiz-Lozano, José Mª, Aparicio Chueca, Ma. del Pilar (María del Pilar), Triadó i Ivern, Xavier Ma., Arroyo-Barrigüetea, Jose Luis
Formato: artículo
Estado:Versión aceptada para publicación
Fecha de publicación:2024
País:España
Recursos:Universidad de Barcelona
Repositorio:Dipòsit Digital de la UB
OAI Identifier:oai:diposit.ub.edu:2445/215840
Acesso em linha:https://hdl.handle.net/2445/215840
Access Level:acceso abierto
Palavra-chave:Abandó dels estudis (Educació superior)
Rendiment acadèmic
College dropouts
Academic achievement
Descrição
Resumo:Student dropout is a major concern in studies investigating retentionstrategies in higher education. This study identifies which variables areimportant to predict student dropout, using academic data from 3583first-year students on the Business Administration (BA) degree at theUniversity of Barcelona (Spain). The results indicate that two variables,the percentage of subjects failed and not attended in the first semester,demonstrate significant predictive power. This has been corroboratedwith an additional sample of 10,784 students from three-degreeprograms (Law, BA, and Economics) at the Complutense University ofMadrid (Spain), to assess the robustness of the results. Three differentalgorithms have also been utilized: neural networks, random forest, andlogit. In the specific case of neural networks, the NeuralSensmethodology has been employed, which is based on the use ofsensitivities, allowing for its interpretation. The outcomes are highlyconsistent in all cases: both a simple model (logit) and moresophisticated ones (neural networks and random forest) exhibit highaccuracy (correctly predicted values) and sensitivity (correctly predicteddropouts). In test set average values of 77% and 69% have beenrespectively achieved. In this regard, a noteworthy point is that onlyacademic data from the university itself was used to develop themodels. This ensures that there’s no dependence on other personal ororganizational variables, which can often be difficult to access.