Feature Selection for Consistent Biclustering via Fractional 0–1 Programming

Busygin, Stanislav; Prokopyev, Oleg A.; Pardalos, Panos M.

doi:10.1007/s10878-005-1856-y

Feature Selection for Consistent Biclustering via Fractional 0–1 Programming

Published: August 2005

Volume 10, pages 7–21, (2005)
Cite this article

Journal of Combinatorial Optimization Aims and scope Submit manuscript

Stanislav Busygin¹,
Oleg A. Prokopyev¹ &
Panos M. Pardalos²

313 Accesses
39 Citations
Explore all metrics

Abstract

Biclustering consists in simultaneous partitioning of the set of samples and the set of their attributes (features) into subsets (classes). Samples and features classified together are supposed to have a high relevance to each other which can be observed by intensity of their expressions. We define the notion of consistency for biclustering using interrelation between centroids of sample and feature classes. We prove that consistent biclustering implies separability of the classes by convex cones. While previous works on biclustering concentrated on unsupervised learning and did not consider employing a training set, whose classification is given, we propose a model for supervised biclustering, whose consistency is achieved by feature selection. The developed model involves solution of a fractional 0–1 programming problem. Preliminary computational results on microarray data mining problems are reported.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Supervised Biclustering Optimization Model for Feature Selection in Biomedical Dataset Classification

A novel biclustering algorithm of binary microarray data: BiBinCons and BiBinAlter

Article Open access 30 November 2015

Biclustering Algorithms Based on Metaheuristics: A Review

References

A. Ben-Dor, L. Bruhn, I. Nachman, M. Schummer, and Z. Yakhini, “Tissue classification with gene expression profiles,” Journal of Computational Biology, vol. 7, pp. 559–584, 2000.
Article PubMed Google Scholar
A. Ben-Dor, N. Friedman, and Z. Yakhini, “Class discovery in gene expression data,” in Proc. Fifth Annual Inter. Conf. on Computational Molecular Biology (RECOMB), 2001.
S. Busygin, G. Jacobsen, and E. Krámer, “Double Conjugated Clustering Applied to Leukemia Microarray Data,” SDM 2002 Workshop on Clustering High Dimensional Data and its Applications, 2002.
Y. Cheng and G.M. Church, “Biclustering of Expression Data,” in: Proceedings of the 8th International Conference on Intelligent Systems for Molecular Biology, 2000, pp. 93–103.
I.S. Dhillon, “Co-Clustering Documents and Words Using Bipartite Spectral Graph Partitioning,” in: Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), August 26–29, 2001, San Francisco, CA.
P. Hansen, M. Poggi de Aragão, and C.C. Ribeiro, “Hyperbolic 0–1 programming and query optimization in information retrieval,” Math. Program., vol. 52, pp. 256–263, 1991.
Google Scholar
S. Hashizume, M. Fukushima, N. Katoh, and T. Ibaraki, “Approximation algortihms for combinatorial fractional programming problems,” Mathematical Programming, vol. 37, pp. 255–267.
L.-L. Hsiao, F. Dangond, T. Yoshida, R. Hong, R.V. Jensen, J. Misra, W. Dillon, K.F. Lee, KE. Clark, P. Haverty, Z. Weng, G. Mutter, M.P. Frosch, M.E. MacDonald, E.L. Milford, C.P. Crum, R. Bueno, R.E. Pratt, M. Mahadevappa, J.A. Warrington, G. Stephanopoulos, G. Stephanopoulos, and S.R. Gullans, “A Compendium of Gene Expression in Normal Human Tissues,” Physiol. Genomics, vol. 7, pp. 97–104, 2001.
PubMed Google Scholar
T.R. Golub, D.K. Slonim, P. Tamayo, C. Huard, M. Gaasenbeek, J.P. Mesirov, H. Coller, M.L. Loh, J.R. Downing, M.A. Caligiuri, C.D. Bloomfield, and E.S. Lander, “Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring,” Science, vol. 286, pp. 531–537, 1999.
Article PubMed Google Scholar
Y. Kluger, R. Basri, J.T. Chang, and M. Gerstein, “Spectral biclustering of microarray data: Coclustering genes and conditions,” Genome Res, vol. 13, pp. 703–716, 2003.
Article PubMed Google Scholar
J.-C. Picard and M. Queyranne, “A network flow solution to some nonlinear 0–1 programming problems, with applications to graph theory,” Networks, vol. 12, pp. 141–159, 1982.
Google Scholar
O.A. Prokopyev, H.-X. Huang, and P.M. Pardalos, “On complexity of unconstrained hyperbolic 0–1 programming problems,” Oper. Res. Lett., vol. 33, pp. 312–318, 2005a.
Article Google Scholar
O.A. Prokopyev, C. Meneses, C.A.S. Oliveira, and P.M. Pardalos, “On Multiple-Ratio Hyperbolic 0–1 Programming Problems,” to appear in Pacific Journal of Optimization, 2005b.
S. Saipe, “Solving a (0,1) hyperbolic program by branch and bound,” Naval Res. Logist. Quarterly, vol. 22, pp. 497–515, 1975.
Google Scholar
M. Tawarmalani, S. Ahmed, and N. Sahinidis, “Global optimization of 0–1 Hyperbolic Programs,” J. Global Optim., vol. 24, pp. 385–416, 2002.
Article Google Scholar
J. Weston, S. Mukherjee, O. Chapelle, M. Pontil, T. Poggio, and V. Vapnik, Feature selection for SVMs. NIPS, 2001.
T.-H. Wu, “A note on a global approach for general 0–1 fractional programming,” European J. Oper. Res., vol. 101, pp. 220–223, 1997.
Article Google Scholar
E.P. Xing and R.M. Karp “CLIFF: Clustering of high-dimensional microarray data via iterative feature filtering using normalized cuts,” Bioinformatics Discovery Note, vol. 1, pp. 1–9, 2001.
Google Scholar
CAMDA 2001 Conference. http://bioinformatics.duke.edu/camda/camda01/.
HuGE Index.org Website. http://www.hugeindex.org.
ILOG Inc. CPLEX 9.0 User’s Manual, 2004.

Download references

Author information

Authors and Affiliations

Department of Industrial and Systems Engineering, University of Florida, Gainesville, FL, 32611
Stanislav Busygin & Oleg A. Prokopyev
Department of Industrial and Systems Engineering, Biomedical Engineering Program, McKnight Brain Institute, University of Florida, Gainesville, FL, 32611
Panos M. Pardalos

Authors

Stanislav Busygin
View author publications
You can also search for this author in PubMed Google Scholar
Oleg A. Prokopyev
View author publications
You can also search for this author in PubMed Google Scholar
Panos M. Pardalos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stanislav Busygin.

Additional information

This research work was partially supported by NSF, NIH and AirForce grants.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Busygin, S., Prokopyev, O.A. & Pardalos, P.M. Feature Selection for Consistent Biclustering via Fractional 0–1 Programming. J Comb Optim 10, 7–21 (2005). https://doi.org/10.1007/s10878-005-1856-y

Download citation

Issue Date: August 2005
DOI: https://doi.org/10.1007/s10878-005-1856-y

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Feature Selection for Consistent Biclustering via Fractional 0–1 Programming

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Supervised Biclustering Optimization Model for Feature Selection in Biomedical Dataset Classification

A novel biclustering algorithm of binary microarray data: BiBinCons and BiBinAlter

Biclustering Algorithms Based on Metaheuristics: A Review

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now