InstanceRank: Bringing order to datasets

In this paper we present InstanceRank, a ranking algorithm that reflects the relevance of the instances within a dataset. InstanceRank applies a similar solution to that used by PageRank, the web pages ranking algorithm in the Google search engine. We also present ISR, an instance selection techniqu...

Descripción completa

Detalles Bibliográficos
Autores: García Vallejo, Carlos Antonio, Troyano Jiménez, José Antonio, Ortega Rodríguez, Francisco Javier
Tipo de recurso: artículo
Estado:Versión enviada para evaluación y publicación
Fecha de publicación:2010
País:España
Institución:Universidad de Sevilla (US)
Repositorio:idUS. Depósito de Investigación de la Universidad de Sevilla
OAI Identifier:oai:idus.us.es:11441/100070
Acceso en línea:https://hdl.handle.net/11441/100070
https://doi.org/10.1016/j.patrec.2009.09.022
Access Level:acceso abierto
Palabra clave:Instance-based learning
Instance reduction
Nearest neighbor
PageRank Classification
Descripción
Sumario:In this paper we present InstanceRank, a ranking algorithm that reflects the relevance of the instances within a dataset. InstanceRank applies a similar solution to that used by PageRank, the web pages ranking algorithm in the Google search engine. We also present ISR, an instance selection technique that uses InstanceRank. This algorithm chooses the most representative instances from a learning database. Experiments show that ISR algorithm, with InstanceRank as ranking criteria, obtains similar results in accuracy to other instance reduction techniques, noticeably reducing the size of the instance set.