Missing Value Estimation of Microarray Data Using Similarity Measurement

Pati, Soumen Kumar; Das, Asit Kumar

doi:10.1007/978-3-642-35380-2_70

Missing Value Estimation of Microarray Data Using Similarity Measurement

Soumen Kumar Pati²⁰ &
Asit Kumar Das²¹

Conference paper

2888 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7677))

Abstract

DNA gene expression profiling plays an important role in a wide range of areas in biological science for handling cancer diseases. Data generated in microarray related experiments have many missing expression values which lose valuable information from the dataset. The proposed method first partitions the genes without missing values using clustering algorithm and then measures the similarity between a gene with missing values and the centroid of the clusters and finally, the missing values are estimated by the corresponding expression values of the centroid giving maximum similarity factor. The method explicitly depends on expression values to imputes missing values, completed the input dataset with low errors for data analysis and knowledge discovery. The method is compared with prominent approaches, such as zero-impute, row-average-impute and KNN-impute in terms of “Normalized Root Mean Square Error” to claim its novelty.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

DeRisi, J., et al.: Use of a cDNA microarray to analyse gene expression patterns in human cancer. Nat. Genet. 14(4), 457–460 (1996)
Article Google Scholar
Luo, J., Yang, T., Wang, Y.: Missing Value Estimation for Microarray Data Based On Fuzzy C-means Clustering. In: Proceedings of the Eighth International Conference on High-Performance Computing in Asia-Pacific Region (2005)
Google Scholar
Butte, A.J., Ye, J.: Determining Significant Fold Differences in Gene Expression Analysis. In: Pac. Symp. Biocomput., vol. 6, pp. 6–17 (2001)
Google Scholar
Alizadeh, A.A., et al.: Distinct Types of Diffuse Large B-Cell Lymphoma Identified by Gene Expression Profiling. Nature 403, 503–511 (2000)
Article Google Scholar
Schafer, J.L., Graham, J.W.: Missing data: our view of the state of the art. Psychol. Methods 7, 144–177 (2002)
Article Google Scholar
Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Botstein, D., Altman, R.B.: Missing value estimation methods for DNA microarrays. Bioinformatics 17, 520–525 (2001)
Article Google Scholar
Huynen, M., Snel, B., Lathe, W., Bork, P.: Genome Res. 10, 1204–1210 (2000)
Google Scholar
Zhang, S., Zhang, J., Zhu, X., Qin, Y., Zhang, C.: Missing Value Imputation Based on Data Clustering. Transactions on Computational Science (TCOS) 1, 128–138 (2008)
Google Scholar
Velarde Cristina, C., Escudero, R., Zaliz, R.R.: Boolean Networks: A Study on Microarray Data Discretization. In: ESTYLF 2008, Cuencas Mineras, Mieres, Langreo, pp. 17–19 (2008)
Google Scholar
Pati, S.K., Das, A.K.: Cluster Analysis of Microarray Data Based on Similarity Measurement. International Journal of Bioinformatics Research 3(2), 207–213 (2011) ISSN: 0975-3087
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science/Information Technology, St. Thomas‘ College of Engineering and Technology, 4, D.H. Road, Kolkata, 23, India
Soumen Kumar Pati
Department of Computer Science and Technology, Bengal Engineering and Science University, Shibpur, Howrah, 03, India
Asit Kumar Das

Authors

Soumen Kumar Pati
View author publications
You can also search for this author in PubMed Google Scholar
Asit Kumar Das
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical Engineering, Indian Institute of Technology, 110016, Delhi, India
Bijaya Ketan Panigrahi
Electronics and Communication Sciences Unit, Indian Statistical Institute, 700108, Kolkata, India
Swagatam Das
School of Electrical and Electronic Engineering, Nanyang Technological University, Block N4, 2b-39, Nanyang Avenue, 639798, Singapore, Singapore
Ponnuthurai Nagaratnam Suganthan
Department of Electronics and Telecom. Engineering, Institute of Technical Education & Research, Siksha ’O’ Anusandhan University, 751030, Bhubaneswar, Odisha, India
Pradipta Kumar Nanda

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pati, S.K., Das, A.K. (2012). Missing Value Estimation of Microarray Data Using Similarity Measurement. In: Panigrahi, B.K., Das, S., Suganthan, P.N., Nanda, P.K. (eds) Swarm, Evolutionary, and Memetic Computing. SEMCCO 2012. Lecture Notes in Computer Science, vol 7677. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35380-2_70

Download citation

DOI: https://doi.org/10.1007/978-3-642-35380-2_70
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35379-6
Online ISBN: 978-3-642-35380-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics