Randomness and Sparsity Induced Codebook Learning with Application to Cancer Image Classification

Li, Quannan; Yao, Cong; Wang, Liwei; Tu, Zhuowen

doi:10.1007/978-3-642-36620-8_18

Quannan Li^22,23,
Cong Yao^23,24,
Liwei Wang^23,25 &
…
Zhuowen Tu^22,23

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7766))

Included in the following conference series:

International MICCAI Workshop on Medical Computer Vision

1606 Accesses

Abstract

Codebook learning is one of the central research topics in computer vision and machine learning. In this paper, we propose a new codebook learning algorithm, Randomized Forest Sparse Coding (RFSC), by harvesting the following three concepts: (1) ensemble learning, (2) divide-and-conquer, and (3) sparse coding. Given a set of training data, a randomized tree can be used to perform data partition (divide-and-conquer); after a tree is built, a number of bases are learned from the data within each leaf node for a sparse representation (subspace learning via sparse coding); multiple trees with diversities are trained (ensemble), and the collection of bases of these trees constitute the codebook. These three concepts in our codebook learning algorithm have the same target but with different emphasis: subspace learning via sparse coding makes a compact representation, and reduces the information loss; the divide-and-conquer process efficiently obtains the local data clusters; an ensemble of diverse trees provides additional robustness. We have conducted classification experiments on cancer images as well as a variety of natural image datasets and the experiment results demonstrate the efficiency and effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Assouad, P.: Plongements lipschitziens dans rn. Bull. Soc. Math. France (4), 429–448 (1983)
MathSciNet Google Scholar
Breiman, L.: Bagging predictors. Machine Learning 24(2), 123–140 (1996)
MathSciNet MATH Google Scholar
Breiman, L.: Random forests. Machine Learning 45(1), 5–32 (2001)
Article MATH Google Scholar
Candes, E., Tao, T.: Near-optimal signal recovery from random projections: universal encoding strategies. IEEE Trans. Inform. Theory 52(2), 5406–5425 (2005)
Article Google Scholar
Caruana, R., Karampatziakis, N., Yessenalina, A.: An empirical evaluation of supervised learning in high dimensions. In: ICML, pp. 96–103 (2008)
Google Scholar
Caruana, R., Niculescu-Mizil, A.: An empirical comparison of supervised learning algorithms. In: ICML, pp. 161–168 (2006)
Google Scholar
Dasgupta, S., Freund, Y.: Random projection trees and low dimensional manifolds. In: STOC, pp. 537–546 (2008)
Google Scholar
Everingham, M., Zisserman, A., Williams, C.K.I., Van Gool, L., Allan, M., Bishop, C.M., Chapelle, O., Dalal, N., Deselaers, T., Dorkó, G., Duffner, S., Eichhorn, J., Farquhar, J.D.R., Fritz, M., Garcia, C., Griffiths, T., Jurie, F., Keysers, D., Koskela, M., Laaksonen, J., Larlus, D., Leibe, B., Meng, H., Ney, H., Schiele, B., Schmid, C., Seemann, E., Shawe-Taylor, J., Storkey, A.J., Szedmak, S., Triggs, B., Ulusoy, I., Viitaniemi, V., Zhang, J.: The 2005 PASCAL Visual Object Classes Challenge. In: Quiñonero-Candela, J., Dagan, I., Magnini, B., d’Alché-Buc, F. (eds.) MLCW 2005. LNCS (LNAI), vol. 3944, pp. 117–176. Springer, Heidelberg (2006)
Chapter Google Scholar
Ferrari, V., Jurie, F., Schmid, C.: Accurate Object Detection with Deformable Shape Models Learnt from Images. In: CVPR (2007)
Google Scholar
Freund, Y., Dasgupta, S., Kabra, M., Verma, N.: Learning the structure of manifolds using random projections. In: NIPS, vol. 20 (2007)
Google Scholar
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. J. of Comp. and Sys. Sci. 55(1) (1997)
Google Scholar
Friedma, J., Hastie, T., Hofling, H., Tibshirani, R.: Pathwise Coordinate Optimization. The Annals of Applied Stat. (2007)
Google Scholar
Gao, S., Tsang, I.W.H., Chia, L.T., Zhao, P.: Local features are not lonely - laplacian sparse coding for image classification. In: CVPR (2010)
Google Scholar
June, P.G., Ernst, D., Wehenkel, L.: Extremely Randomized Trees. In: Machine Learning, vol. 36 (2003)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006)
Google Scholar
Lee, H., Battle, A., Raina, R., Ng, A.Y.: Efficient sparse coding algorithms. In: NIPS (2007)
Google Scholar
Li, Y., Osher, S.: Coordinate descent optimization for ℓ¹ minimization with application to compressed sensing; a greedy algorithm. CAM Report (2009)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Mairal, J., Bach, F., Ponce, J.: Task-driven dictionary learning. IEEE Trans. on PAMI (to appear)
Google Scholar
Moosmann, F., Nowak, E., Jurie, F.: Randomized clustering forests for image classification. IEEE Trans. on PAMI 30(9), 1632–1646 (2008)
Article Google Scholar
Ojala, T., Pietikäinen, M., Mäenpää, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 971–987 (2002)
Article Google Scholar
Opelt, A., Pinz, A., Fussenegger, M., Auer, P.: Generic Object Recognition with Boosting. IEEE Trans. on PAMI 28(3), 416–431 (2006)
Article Google Scholar
Quinlan, J.R.: Induction of decision trees. Machine Learning 1 (1986)
Google Scholar
Shotton, J., Johnson, M., Cipolla, R.: Semantic texton forests for image categorization and segmentation. In: CVPR (2008)
Google Scholar
Tibshirani, R.: Regression shrinkage and selection via the lasso. J. Royal. Statist. Soc B. 56(1), 267–288 (1996)
MathSciNet Google Scholar
Turk, M.: Eigenface for recognition. Journal of Cognitive Neuroscience (1991)
Google Scholar
Vedaldi, A., Fulkerson, B.: Vlfeat: an open and portable library of computer vision algorithms. In: ACM Multimedia, pp. 1469–1472 (2010)
Google Scholar
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Locality-constrained linear coding for image classification. In: CVPR (2010)
Google Scholar
Wright, J., Yang, A., Ganesh, A., Sastry, S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. on PAMI 31(2) (2009)
Google Scholar
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: CVPR (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Lab of Neuro Imaging, University of California, Los Angeles, USA
Quannan Li & Zhuowen Tu
Microsoft Research Asia, China
Quannan Li, Cong Yao, Liwei Wang & Zhuowen Tu
Huazhong University of Science and Technology, China
Cong Yao
The Chinese University of Hong Kong, China
Liwei Wang

Authors

Quannan Li
View author publications
You can also search for this author in PubMed Google Scholar
Cong Yao
View author publications
You can also search for this author in PubMed Google Scholar
Liwei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhuowen Tu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Vision Lab., ETH Zurich, Switzerland
Bjoern H. Menze
Computational Image Analysis and Radiology Lab Department of Radiology, Medical University of Vienna, Austria
Georg Langs
Siemens Corporate Research, 755 College Road East, 08540, Princeton, NJ, USA
Le Lu
GE Global Research, 1 Research Circle, 12309, Niskayuna, NY, USA
Albert Montillo
Lab of Neuro Imaging, Department of Neurology and Department of Computer Science, UCLA, USA
Zhuowen Tu
Microsoft Research, Cambridge, UK
Antonio Criminisi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, Q., Yao, C., Wang, L., Tu, Z. (2013). Randomness and Sparsity Induced Codebook Learning with Application to Cancer Image Classification. In: Menze, B.H., Langs, G., Lu, L., Montillo, A., Tu, Z., Criminisi, A. (eds) Medical Computer Vision. Recognition Techniques and Applications in Medical Imaging. MCV 2012. Lecture Notes in Computer Science, vol 7766. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36620-8_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-36620-8_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36619-2
Online ISBN: 978-3-642-36620-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics