An effective measure for assessing the quality of biclusters

Biclustering is becoming a popular technique for the study of gene expression data. This is mainly due to the capability of biclustering to address the data using various dimensions simultaneously, as opposed to clustering, which can use only one dimension at the time. Different heuristics have been...

Full description

Bibliographic Details
Authors: Divina, Federico, Pontes Balanza, Beatriz, Giráldez, Raúl, Aguilar Ruiz, Jesús Salvador
Format: article
Status:Published version
Publication Date:2012
Country:España
Institution:Universidad de Sevilla (US)
Repository:idUS. Depósito de Investigación de la Universidad de Sevilla
OAI Identifier:oai:idus.us.es:11441/130171
Online Access:https://hdl.handle.net/11441/130171
https://doi.org/10.1016/j.compbiomed.2011.11.015
Access Level:Open access
Keyword:Biclustering
Gene Expression Data
Shifting and scaling patterns
Description
Summary:Biclustering is becoming a popular technique for the study of gene expression data. This is mainly due to the capability of biclustering to address the data using various dimensions simultaneously, as opposed to clustering, which can use only one dimension at the time. Different heuristics have been proposed in order to discover interesting biclusters in data. Such heuristics have one common characteristic: they are guided by a measure that determines the quality of biclusters. It follows that defining such a measure is probably the most important aspect. One of the popular quality measure is the mean squared residue (MSR). However, it has been proven that MSR fails at identifying some kind of patterns. This motivates us to introduce a novel measure, called virtual error (VE), that overcomes this limitation. Results obtained by using VE confirm that it can identify interesting patterns that could not be found by MSR.