Abstract
Classification of time series signals can be crucial for many practical applications. While the existing classifiers may accurately classify pure signals, the existence of noise can significantly disturb the classification accuracy of these classifiers. We propose a novel classification approach that uses multiple wavelets together with an ensemble of classifiers to return high classification accuracy even for noisy signals.
The proposed technique has two main steps. In Step 1, We convert raw signals into a useful dataset by applying multiple wavelet transforms, each from a different wavelet family or all from the same family with differing filter lengths. In Step 2, We apply the dataset processed in Step 1 to an ensemble of classifiers. We test on 500 noisy signals from five different classes. Our experimental results demonstrate the effectiveness of the proposed technique, on noisy signals, compared to the approaches that use either raw signals or a single wavelet transform.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The DWT in principle provides more information than the time series raw data points in classification because the DWT locates where the signal energies are concentrated in the frequency domain [3].
- 2.
In this new vector many data points are almost zero, a sparse representation.
- 3.
note: \(\sum ^J_{j=1} \frac{n}{2^j} = \frac{n}{2} + \frac{n}{4} + \dots + 2 + 1 = 2^J -1 = n-1\).
- 4.
If we decompose \(X_t\) to level \(I \le J\) then \( \mathbf V _I\) has \(n/2^{I}\) elements.
- 5.
Initially we applied the MDWT data to a single Decision tree classifier as a baseline and then to ensemble classifiers, which are outlined in Sect. 4.1.
- 6.
With these generated signals we attached class labels and such labels were used as the attribute for Class within the classification methods.
- 7.
Here Accuracy is defined as the number of correctly Classified Instances (with respect to our initial Class labels).
- 8.
Result tables not shown for this exercise.
- 9.
We maintain 500 noise enhanced signals at each step.
- 10.
For evaluation here, we use 80% data to form the training set and 20% of data for testing with 100 trees used for each classifier.
- 11.
The MDWT comprising of four wavelets from different families, we have labelled as “MDWT Diff”, similarly for the MDWT with four wavelets from the same family, as “MDWT Same”.
References
Clark, P., Niblett, T.: Induction in noisy domains. In: Progress in Machine Learning (from the Proceedings of the 2nd European Working Session on Learning), pp. 11–30. Sigma Press, Bled (1987)
Saez, J.A., Galar, M., Luengo, J., Herrera, F.: Tackling the problem of classification with noisy data using multiple classifier systems: analysis of the performance and robustness. Inf. Sci. 247(2013), 1–20 (2013). https://doi.org/10.1016/j.ins.2013.06.002
Fong, S.: Using hierarchical time series clustering algorithm and wavelet classifier for biometric voice classification. J. Biomed. Biotechnol. 2012, Article ID 215019 (2012)
Li, D., Bissyand’e, T.F., Klein, J., Traon, Y.L.: Time series classification with discrete wavelet transformed data: insights from an empirical study. In: The 28th International Conference on Software Engineering and Knowledge Engineering (2016)
Moazami-Goudarzi, M., Moradi, M.H., Abbasabadi, S.: High performance method for electrocardiogram compression using two dimensional multiwavelet transform. In: IEEE 7th Workshop on Multimedia Signal Processing (2006)
Graps, A.: An introduction to wavelets. IEEE Comput. Sci. Eng. 2(2), 50–61 (1995)
Percival, D.B., Walden, A.T.: Wavelet Methods for Time Series Analysis. Cambridge University Press, Cambridge (2000)
Nason, G.P.: Wavelet Methods in Statistics with R. Springer, New York (2008). https://doi.org/10.1007/978-0-387-75961-6
Adnan, M.N., Islam, M.Z.: A comprehensive method for attribute space extension for random forest. In: 17th International Conference on Computer and Information Technology, ICCIT 2014 (2014). https://doi.org/10.1109/ICCITechn.2014.7073129
R Core Team: R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2013). http://www.R-project.org/
Constantine, W., Percival, D.: wmtsa: Wavelet Methods for Time Series Analysis. R package version 2.0-3 (2017). http://CRAN.R-project.org/package=wmtsa
Frank, E., Hall, M.A., Witten, I.H.: The WEKA Workbench. Online Appendix for “Data Mining: Practical Machine Learning Tools and Techniques”, 4th edn. Morgan Kaufmann (2016)
Quinlan, R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo (1993)
Adnan, M.N., Islam, M.Z.: Forest PA: constructing a decision forest by penalizing attributes used in previous trees. Expert Syst. Appl. 89, 389–403 (2017)
Islam, M.Z., Giggins, H.: Knowledge discovery through SysFor: a systematically developed forest of multiple decision trees. In: Proceedings of the Ninth Australasian Data Mining Conference (AusDM 2011) (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Grant, P., Islam, M.Z. (2019). A Novel Approach for Noisy Signal Classification Through the Use of Multiple Wavelets and Ensembles of Classifiers. In: Li, J., Wang, S., Qin, S., Li, X., Wang, S. (eds) Advanced Data Mining and Applications. ADMA 2019. Lecture Notes in Computer Science(), vol 11888. Springer, Cham. https://doi.org/10.1007/978-3-030-35231-8_14
Download citation
DOI: https://doi.org/10.1007/978-3-030-35231-8_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-35230-1
Online ISBN: 978-3-030-35231-8
eBook Packages: Computer ScienceComputer Science (R0)