ABSTRACT
The increase use of high dimensional, geographically distributed rich and massive meteorological data poses an increasing scientific challenge in efficient outlier mining. Properties in such meteorological data are observed to fluctuate in spatial synchrony. Capturing this spatial variation at different spatial scales requires a multi-resolution analysis. In this paper, we develop an algorithm for region outlier detection at different scales using the multi-resolution feature of wavelet analysis. Another challenge of meteorological data mining is that the data size is huge to accommodate different resolutions and number of samples varies with the spatial scales. This motivated us to design a load adaptive parallel algorithm for outlier detection which can maintain good scalability for all spatial scales. Our algorithm has been implemented on high-performance computing architecture and evaluated on real-world meteorological data.
- S. Barua and R. Alhajj. High performance computing for spatial outliers detection using parallel wavelet transform. Journal of Intell. Data Anal. (in press). Google ScholarDigital Library
- I. Daubechies. Ten lectures on wavelets. SIAM, Philadelphia, PA, USA, 1992. Google ScholarDigital Library
- Y. Kou, C.-T. Lu, and D. Chen. Spatial weighted outlier detection. In SDM, pages 613--617, USA, 2006.Google ScholarCross Ref
- H. Lee, J. Liu, A. Chan, and C. Chui. Parallel implementation of wavelet decomposition reconstruction algorithms. In Proc. SPIE Wavelet Application Conference, pages 248--259, Florida, 1994.Google ScholarCross Ref
- C.-T. Lu, D. Chen, and Y. Kou. Algorithms for spatial outlier detection. In Proc. of ICDM, pages 597--600, Washington, DC, USA, 2003. IEEE Computer Society. Google ScholarDigital Library
- C.-T. Lu, Y. Kou, J. Zhao, and L. Chen. Detecting and tracking regional outliers in meteorological data. Inf. Sci., 177(7):1609--1632, 2007. Google ScholarDigital Library
- S. Mallat. A theory for multiresolution signal decomposition: the wavelet representation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 11(7):674--693, 1989. Google ScholarDigital Library
- S. Shekhar, C.-T. Lu, and P. Zhang. Detecting graph-based spatial outliers: algorithms and applications (a summary of results). In Proc. of the seventh ACM SIGKDD, pages 371--376, 2001. Google ScholarDigital Library
- A. Uhl. A parallel wavelet image block-coding algorithm. In Proc. of HiPC, pages 61--66, India, 1995.Google Scholar
- D. Yu, G. Sheikholeslami, and A. Zhang. Findout: finding outliers in very large datasets. Knowl. Inf. Syst., 4(4):387--412, 2002. Google ScholarDigital Library
Index Terms
- A parallel multi-scale region outlier mining algorithm for meteorological data
Recommendations
High performance computing for spatial outliers detection using parallel wavelet transform
Wavelet analysis is a practical tool to study signal analysis and image processing. Traditional Fourier transform can also transfer the signal into frequency domain, but wavelet analysis is more attractive for its features of multi-resolution and ...
A parallel vector quantization algorithm for SIMD multiprocessor systems
DCC '95: Proceedings of the Conference on Data CompressionSummary form only given , as follows. This article proposes a parallel vector quantization (VQ) algorithm for an exhaustive search of codebooks on a single-instruction-multiple-data (SIMD) multiprocessor. The proposed parallel VQ algorithm can be ...
Parallel and Distributed Spatial Outlier Mining in Grid: Algorithm, Design and Application
There is an increasing interest in the field of parallel and distributed data mining in grid environment over the past decade. As an important branch of spatial data mining, spatial outlier mining can be used to find out some interesting and unexpected ...
Comments