Abstract
Hyperspectral imagery generally contains a very large amount of data due to hundreds of spectral bands. Band selection is often applied firstly to reduce computational cost and facilitate subsequent tasks such as land-cover classification and higher level image analysis. In this paper, we propose a new band selection algorithm using sparse nonnegative matrix factorization (sparse NMF). Though acting as a clustering method for band selection, sparse NMF need not consider the distance metric between different spectral bands, which is often the key step for most common clustering-based band selection methods. By imposing sparsity on the coefficient matrix, the bands’ clustering assignments can be easily indicated through the largest entry in each column of the matrix. Experimental results showed that sparse NMF provides considerable insight into the clustering-based band selection problem and the selected bands are good for land-cover classification.
Similar content being viewed by others
References
Archibald, R., Fann, G., 2007. Feature selection and classification of hyperspectral images, with support vector machines. IEEE Geosci. Remote Sens. Lett., 4(4):674–677. [doi:10.1109/LGRS.2007.905116]
Bazi, Y., Melgani, F., 2006. Toward an optimal SVM classification system for hyperspectral remote sensing images. IEEE Trans. Geosci. Remote Sens., 44(11):3374–3385. [doi:10.1109/TGRS.2006.880628]
Chang, C.C., Lin, C.J., 2001. LIBSVM: a Library for Support Vector Machines. Available from http://www.csie.ntu.edu.tw/~cjlin/libsvm/
Chang, C.I., Du, Q., Sun, T.L., Althouse, M.L.G., 1999. A joint band prioritization and band-decorrelation approach to band selection for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens., 37(6):2631–2641. [doi:10.1109/36.803411]
Cheng, Q., Varshney, P.K., Arora, M.K., 2006. Logistic regression for feature selection and soft classification of remote sensing data. IEEE Geosci. Remote Sens. Lett., 3(4):491–494. [doi:10.1109/LGRS.2006.877949]
Ding, C., He, X., Simon, H.D., 2005. On the Equivalence of Nonnegative Matrix Factorization and Spectral Clustering. Proc. SIAM Data Mining Conf., p.606–610.
Fisher, R.A., 1936. The use of multiple measurements in taxonomic problems. Ann. Hum. Genet., 7(2):179–188. [doi:10.1111/j.1469-1809.1936.tb02137.x]
Guyon, I., Elisseeff, A., 2003. An introduction to variable and feature selection. J. Mach. Learn. Res., 3(7–8):1157–1182. [doi:10.1162/153244303322753616]
Hoyer, P.O., 2004. Non-negative matrix factorization with sparseness constraints. J. Mach. Learn. Res., 5:1457–1469.
Hyvärinen, A., Oja, E., 2000. Independent component analysis: algorithms and applications. Neur. Networks, 13(4–5):411–430. [doi:10.1016/S0893-6080(00)00026-5]
Jolliffe, I., 2002. Principal Component Analysis (2nd Ed.). Springer, p.516.
Keshava, N., 2004. Distance metrics and band selection in hyperspectral processing with applications to material identification and spectral libraries. IEEE Trans. Geosci. Remote Sens., 42(7):1552–1565. [doi:10.1109/TGRS.2004.830549]
Kim, H., Park, H., 2007. Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis. Bioinformatics, 23(12):1495–1502. [doi:10.1093/bioinformatics/btm134]
Kim, J., Park, H., 2008. Sparse Nonnegative Matrix Factorization for Clustering. CSE Technical Report, GTCSE-08-01, Georgia Institute of Technology.
Lee, D.D., Seung, H.S., 1999. Learning the parts of objects by non-negative matrix factorization. Nature, 401(6755):788–791. [doi:10.1038/44565]
Lee, D.D., Seung, H.S., 2001. Algorithms for non-negative matrix factorization. Adv. Neur. Inform. Process. Syst., 13:556–562.
Martinez-Uso, A., Pla, F., Sotoca, J.M., Garcia-Sevilla, P., 2007. Clustering-based hyperspectral band selection using information measures. IEEE Trans. Geosci. Remote Sens., 45(12):4158–4171. [doi:10.1109/TGRS. 2007.904951]
Munoz-Mari, J., Bruzzone, L., Camps-Valls, G., 2007. A support vector domain description approach to supervised classification of remote sensing images. IEEE Trans. Geosci. Remote Sens., 45(8):2683–2692. [doi:10.1109/TGRS.2007.897425]
Neher, R., Srivastava, A., 2005. A Bayesian MRF framework for labeling terrain using hyperspectral imaging. IEEE Trans. Geosci. Remote Sens., 43(6):1363–1374. [doi:10.1109/TGRS.2005.846865]
Qian, Y., Yao, F., Jia, S., 2009. Band selection for hyperspectral imagery using affinity propagation. IET Comput. Vis., 3(4):213–222. [doi:10.1049/iet-cvi.2009.0034]
Shahnaz, F., Berry, M.W., Pauca, V.P., Plemmons, R.J., 2006. Document clustering using nonnegative matrix factorization. Inform. Process. Manag., 42(2):373–386. [doi:10.1016/j.ipm.2004.11.005]
Tibshirani, R., 1996. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B, 58(1):267–288.
Wang, S., Chang, C.I., 2007. Variable-number variable-band selection for feature characterization in hyperspectral signatures. IEEE Trans. Geosci. Remote Sens., 45(9):2979–2992. [doi:10.1109/TGRS.2007.901051]
Xu, W., Liu, X., Gong, Y., 2003. Document Clustering Based on Non-negative Matrix Factorization. Proc. 26th Annual Int. ACM SIGIR Conf. on Research and Development in Informaion Retrieval, p.267–273. [doi:10.1145/860435.860485]
Author information
Authors and Affiliations
Corresponding author
Additional information
Project (No. 60872071) supported by the National Natural Science Foundation of China
Rights and permissions
About this article
Cite this article
Li, Jm., Qian, Yt. Clustering-based hyperspectral band selection using sparse nonnegative matrix factorization. J. Zhejiang Univ. - Sci. C 12, 542–549 (2011). https://doi.org/10.1631/jzus.C1000304
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1631/jzus.C1000304