SVD Based Feature Selection and Sample Classification of Proteomic Data

D’Addabbo, Annarita; Papale, Massimo; Di Paolo, Salvatore; Magaldi, Simona; Colella, Roberto; d’Onofrio, Valentina; Di Palma, Annamaria; Ranieri, Elena; Gesualdo, Loreto; Ancona, Nicola

doi:10.1007/978-3-540-85567-5_69

Annarita D’Addabbo¹,
Massimo Papale²,
Salvatore Di Paolo³,
Simona Magaldi²,
Roberto Colella¹,
Valentina d’Onofrio⁴,
Annamaria Di Palma²,
Elena Ranieri⁵,
Loreto Gesualdo² &
…
Nicola Ancona¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5179))

Included in the following conference series:

International Conference on Knowledge-Based and Intelligent Information and Engineering Systems

3406 Accesses
2 Citations

Abstract

Feature selection becomes a central task when ’signature’ profiles specific to a pathological status have to be extracted from high dimensional gene expression or proteomic data. In the present paper, we propose a feature selection method based on Singular Value Decomposition (SVD) and apply it to SELDI-TOF/MS proteomic data from a cohort of Type 2 Diabetics affected by Glomerulosclerosis and Membranous Nephropathy. We have selected a profile composed of 24 proteins that seems to be an effective signature for the pathology at hand, allowing to efficiently discriminate between the considered subtype of diabetes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Golub, T.R., Slonim, D.K., et al.: Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring. Science 286, 531–537 (1999)
Article Google Scholar
Ancona, N., Maglietta, R., D’Addabbo, A., et al.: Regularized Least Squares Cancer Classifiers from DNA microarray data. BMC-Bioinformatics 6 (Suppl 4):S2 (2005)
Google Scholar
Ancona, N., Maglietta, R., Piepoli, A.: D’Addabbo, et al: On the statistical assessment of classifiers using DNA microarray data. BMC-Bioinformatics 7, 387 (2006)
Article Google Scholar
Guyon, I., Weston, J., Barnhill, S., Vapnik, V.: Gene selection for cancer classification using support vector machines. Machine Learning 46(1), 389–422 (2002)
Article MATH Google Scholar
Furlanello, C., Serafini, M., et al.: Entropy-based gene ranking without selection bias for the predictive classification of microarray data. BMC Bioinf. 4, 54–73 (2003)
Article Google Scholar
Yasui, Y., et al.: A data-analytic strategy for protein biomarker discovery: profiling of high-dimensional proteomic data for cancer detection. Biostatistics 4(3), 449–463 (2003)
Article MATH MathSciNet Google Scholar
West, M., Blanchette, C.: Dressman, et al: Predicting the clinical status of human breast cancer by using gene expression profiles. PNAS 98(20), 11462–11467 (2001)
Article Google Scholar
Mazzucco, G., et al.: Am. J. Kidney Dis., vol. 39, p. 713 (2002)
Google Scholar
Vorderwulbecke, S., Cleverley, S., et al.: Protein quantification by the SELDI-TOF-MS based ProteinChip System. Nature Methods 2, 393–395 (2005)
Article Google Scholar
Pisitkun, T., Shen, R.F., Knepper, A.: PNAS, vol. 101 (36), pp. 13368–13373 (2004)
Google Scholar
Pisitkun, T., et al.: Molecular and Cellular Proteomics, vol. 5(10), pp. 1760–1771 (2006)
Google Scholar
Rindler, M.J., et al.: J. Biol. Chem., vol. 265(34), pp. 20784–20789 (1990)
Google Scholar
Fels, L.M., Bundschuh, I., Gwinner, W., et al.: Kidney Int. Suppl., vol. 47, pp. S81–S88 (1994)
Google Scholar
Usuda, K., Kono, K., Dote, T., et al.: Arch Toxicol, vol. 72, pp. 104–109 (1998)
Google Scholar
Nortier, J.L., Deschodt-Lanckman, M.M., et al.: Kidney Int., vol. 51, pp. 288–293 (1997)
Google Scholar
Jungers, P., Hannedouche, T., et al.: Nephrol Dial Transplant, vol. 10, pp. 1353–1360 (1995)
Google Scholar
Donaldio, C., Tramonti, G., Lucchesi, A., et al.: Ren Fail, vol. 20, pp. 319–324 (1998)
Google Scholar
Lynn, K.L., Marshall, R.D.: Clin Nephrol, vol. 22, pp. 253–257 (1984)
Google Scholar
Ambroise, C., McLachlan, G.J.: Selection bias in gene extraction on the basis of microarray gene-expression data. PNAS 99, 6562–6566 (2002)
Article MATH Google Scholar
Golub, G.H., Van Loan, C.F.: Matrix Computation. Johns Hopkins Univ. Press, Baltimore (1996)
Google Scholar
Guyon, I., Elisseeff, A.: An introduction to Variable and Feature Selection. Journal of Machine Learning Research 3, 1157–1182 (2003)
Article MATH Google Scholar
Tikhonov, A.N., Arsenin, V.Y.: Solutions of ill-posed problems. W. H. Winston, Washington DC (1977)
MATH Google Scholar
Poggio, T., Girosi, F.: A Theory of Networks for Approximation and Learning. A. I. Laboratory, MIT, Cambridge (1989) A.I. Memo No. 1140
Google Scholar
Girosi, F.: An Equivalence Between Sparse Approximation And Support Vector Machines. Neural Comp. 10(6), 1455–1480 (1998)
Article Google Scholar
Mukherjee, S., Tamayo, P., Rogers, S., et al.: Estimating dataset size requirements for classifying dna microarray data. J. Comp. Biol. 10, 119–142 (2003)
Article Google Scholar
Good, P.: Permutation tests: a practical guide to resampling methods for testing hypothesis. Springer, Heidelberg (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

Istituto di Studi sui Sistemi Intelligenti per l’Automazione, CNR, Via Amendola 122/D-I, 70126, Bari, Italy
Annarita D’Addabbo, Roberto Colella & Nicola Ancona
Molecular Medicine Center, Sect. of Nephrology, Dept. of Biomedical Sciences and Bioagromed, Faculty of Medicine, University of Foggia,
Massimo Papale, Simona Magaldi, Annamaria Di Palma & Loreto Gesualdo
Division of Nephrology and Dialysis, Hospital ”Dimiccoli”, ASL BAT, Barletta,
Salvatore Di Paolo
Dept. of Surgical Sciences , Faculty of Medicine, University of Foggia, Italy
Valentina d’Onofrio
Dept. of Biomedical Sciences, Sect. of Clinical Pathology, Faculty of Medicine, University of Foggia, Italy
Elena Ranieri

Authors

Annarita D’Addabbo
View author publications
You can also search for this author in PubMed Google Scholar
Massimo Papale
View author publications
You can also search for this author in PubMed Google Scholar
Salvatore Di Paolo
View author publications
You can also search for this author in PubMed Google Scholar
Simona Magaldi
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Colella
View author publications
You can also search for this author in PubMed Google Scholar
Valentina d’Onofrio
View author publications
You can also search for this author in PubMed Google Scholar
Annamaria Di Palma
View author publications
You can also search for this author in PubMed Google Scholar
Elena Ranieri
View author publications
You can also search for this author in PubMed Google Scholar
Loreto Gesualdo
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Ancona
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Ignac Lovrek Robert J. Howlett Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

D’Addabbo, A. et al. (2008). SVD Based Feature Selection and Sample Classification of Proteomic Data. In: Lovrek, I., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2008. Lecture Notes in Computer Science(), vol 5179. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85567-5_69

Download citation

DOI: https://doi.org/10.1007/978-3-540-85567-5_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85566-8
Online ISBN: 978-3-540-85567-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics