Large scale analysis of gender bias and sexism in song lyrics

We employ Natural Language Processing techniques to analyse 377,808 English song lyrics from the “Two Million Song Database” corpus, focusing on the expression of sexism across five decades (1960–2010) and the measurement of gender biases. Using a sexism classifier, we identify sexist lyrics at a la...

Descripción completa

Detalles Bibliográficos
Autores: Betti, Lorenzo, Abrate, Carlo, Kaltenbrunner, Andreas
Tipo de recurso: artículo
Estado:Versión publicada
Fecha de publicación:2023
País:España
Institución:Varias* (Consorci de Biblioteques Universitáries de Catalunya, Centre de Serveis Científics i Acadèmics de Catalunya)
Repositorio:Recercat. Dipósit de la Recerca de Catalunya
OAI Identifier:oai:recercat.cat:10230/57420
Acceso en línea:http://hdl.handle.net/10230/57420
http://dx.doi.org/10.1140/epjds/s13688-023-00384-8
Access Level:acceso abierto
Palabra clave:Song lyrics
Gender
Natural language processing
Word embeddings
Language bias
Sexism
Descripción
Sumario:We employ Natural Language Processing techniques to analyse 377,808 English song lyrics from the “Two Million Song Database” corpus, focusing on the expression of sexism across five decades (1960–2010) and the measurement of gender biases. Using a sexism classifier, we identify sexist lyrics at a larger scale than previous studies using small samples of manually annotated popular songs. Furthermore, we reveal gender biases by measuring associations in word embeddings learned on song lyrics. We find sexist content to increase across time, especially from male artists and for popular songs appearing in Billboard charts. Songs are also shown to contain different language biases depending on the gender of the performer, with male solo artist songs containing more and stronger biases. This is the first large scale analysis of this type, giving insights into language usage in such an influential part of popular culture.