Automated metadata annotation: What is and is not possible with machine learning

Wu, Mingfang; Brandhorst, Hans; Marinescu, Maria Cristina; More López, Joaquim; Hlava, Margorie; Busch, Joseph

Automated metadata annotation: What is and is not possible with machine learning

Automated metadata annotation is only as good as training dataset, or rules that are available for the domain. It's important to learn what type of data content a pre-trained machine learning algorithm has been trained on to understand its limitations and potential biases. Consider what type of...

Descripción completa

Detalles Bibliográficos
Autores:	Wu, Mingfang, Brandhorst, Hans, Marinescu, Maria Cristina, More López, Joaquim, Hlava, Margorie, Busch, Joseph
Tipo de recurso:	artículo
Fecha de publicación:	2022
País:	España
Institución:	Universitat Politècnica de Catalunya (UPC)
Repositorio:	UPCommons. Portal del coneixement obert de la UPC
Idioma:	inglés
OAI Identifier:	oai:upcommons.upc.edu:2117/374724
Acceso en línea:	https://hdl.handle.net/2117/374724 https://dx.doi.org/10.1162/dint_a_00162
Access Level:	acceso abierto
Palabra clave:	Metadata Machine learning Metadades Aprenentatge automàtic Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic

Descripción
Sumario:	Automated metadata annotation is only as good as training dataset, or rules that are available for the domain. It's important to learn what type of data content a pre-trained machine learning algorithm has been trained on to understand its limitations and potential biases. Consider what type of content is readily available to train an algorithm—what's popular and what's available. However, scholarly and historical content is often not available in consumable, homogenized, and interoperable formats at the large volume that is required for machine learning. There are exceptions such as science and medicine, where large, well documented collections are available. This paper presents the current state of automated metadata annotation in cultural heritage and research data, discusses challenges identified from use cases, and proposes solutions.

Automated metadata annotation: What is and is not possible with machine learning

Similares en LA Referencia