Please use this identifier to cite or link to this item:
Title: Iterative bicluster-based least square framework for estimation of missing values in microarray gene expression data
Authors: Cheng, Kin-on
Law, Ngai-fong Bonnie
Siu, Wan-chi
Subjects: Missing value imputation
Iterative estimation
Gene expression analysis
Issue Date: Apr-2012
Publisher: Elsevier
Source: Pattern recognition, Apr. 2012, v. 45, no. 4, p. 1281-1289.
Abstract: DNA microarray experiment inevitably generates gene expression data with missing values. An important and necessary pre-processing step is thus to impute these missing values. Existing imputation methods exploit gene correlation among all experimental conditions for estimating the missing values. However, related genes coexpress in subsets of experimental conditions only. In this paper, we propose to use biclusters, which contain similar genes under subset of conditions for characterizing the gene similarity and then estimating the missing values. To further improve the accuracy in missing value estimation, an iterative framework is developed with a stopping criterion on minimizing uncertainty. Extensive experiments have been conducted on artificial datasets, real microarray datasets as well as one non-microarray dataset. Our proposed biclusters-based approach is able to reduce errors in missing value estimation.
Rights: Pattern Recognition ©2011 Elsevier Ltd. The journal web site is located at
Type: Journal/Magazine Article
DOI: 10.1016/j.patcog.2011.10.012
ISSN: 0031-3203
Appears in Collections:EIE Journal/Magazine Articles

Files in This Item:
File Description SizeFormat 
2_K_PR_Revised.pdfPre-published version364.15 kBAdobe PDFView/Open

All items in the PolyU Institutional Repository are protected by copyright, with all rights reserved, unless otherwise indicated. No item in the PolyU IR may be reproduced for commercial or resale purposes.