Abstract
Remote sensing has resulted in repositories of data that grow at a pace much faster than can be readily analyzed. One of the obstacles in dealing with remotely sensed data and others is the variable quality of the data. Instrument failures can result in entire missing observation cycles, while cloud cover frequently results in missing or distorted values. We investigated the use of several methods that automatically deal with corruptions in the data. These include robust measures which avoid overfitting, filtering which discards the corrupted instances, and polishing by which the corrupted elements are fitted with more appropriate values. We applied such methods to a data set of vegetation indices and land cover type assembled from NASA’s Moderate Resolution Imaging Spectroradiometer (MODIS) data collection.
This work was supported by NASA NCC2-1239, NNA04CK88A and ONR N00014-03-1-0516.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Brodley, C.E., Friedl, M.A.: Identifying mislabeled training data. Journal of Artificial Intelligence Research 11, 131–167 (1999)
Clark, P., Niblett, T.: The CN2 induction algorithm. Machine Learning 3(4), 261–283 (1989)
Corner, B.R., Narayanan, R.M., Reichenbach, S.E.: Noise estimation in remote sensing imagery using data masking. International Journal of Remote Sensing 24(4), 689–702 (2003)
Cressie, N.A.C.: Statistics for Spatial Data (revised edition). Wiley, Chichester (1993)
Drastal, G.: Informed pruning in constructive induction. In: Proceedings of the Eighth International Workshop on Machine Learning, pp. 132–136 (1991)
Gamberger, D., Lavrač, N., Grošelj, C.: Experiments with noise filtering in a medical domain. In: Proceedings of the Sixteenth International Conference on Machine Learning, pp. 143–151 (1999)
John, G.H.: Robust decision trees: Removing outliers from databases. In: Proceedings of the First International Conference on Knowledge Discovery and Data Mining, pp. 174–179 (1995)
Liu, H.Q., Huete, A.R.: A feedback based modification of the NDVI to minimize canopy background and atmospheric noise. IEEE Transactions on Geoscience and Remote Sensing 33, 457–465 (1995)
Lunetta, R.S., Congalton, R.G., Fenstermaker, L.K., Jensen, J.R., McGwire, K.C., Tinney, L.R.: Remote sensing and geographic information system data integration: Error sources and research issues. Photogrammetric Engineering and Remote Sensing 57(6), 677–687 (1991)
Maletic, J., Marcus, A.: Data cleansing: Beyond integrity analysis. In: Proceedings of the Conference on Information Quality, pp. 200–209 (2000)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Rousseeuw, P.J., Leroy, A.M.: Robust Regression and Outlier Detection. John Wiley & Sons, Chichester (1987)
Teng, C.M.: Correcting noisy data. In: Proceedings of the Sixteenth International Conference on Machine Learning, pp. 239–248 (1999)
Tucker, C.J., Newcomb, W.W., Los, S.O., Prince, S.D.: Mean and inter-year variation of growing-season normalized difference vegetation index for the Sahel 1981–1989. International Journal of Remote Sensing 12, 1113–1115 (1991)
Vaseghi, S.V.: Advanced Digital Signal Processing and Noise Reduction, 2nd edn. Wiley, Chichester (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Teng, C.M. (2005). Dealing with Data Corruption in Remote Sensing. In: Famili, A.F., Kok, J.N., Peña, J.M., Siebes, A., Feelders, A. (eds) Advances in Intelligent Data Analysis VI. IDA 2005. Lecture Notes in Computer Science, vol 3646. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11552253_41
Download citation
DOI: https://doi.org/10.1007/11552253_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28795-7
Online ISBN: 978-3-540-31926-9
eBook Packages: Computer ScienceComputer Science (R0)