Abstract
Data from sensor logs in raw form is generally continuous valued. This data from multiple sensors in continuous stream becomes voluminous. For knowledge discovery like extraction of context, from these datasets, standard machine learning algorithms or their variations are used as classifier. Most classification schemes require the input data to be discretized. The focus of this paper is to study merits of some popular discretization methods when applied on noisy sensor logs. Representative methods from supervised and unsupervised discretization, like binning, clustering and entropy minimization are evaluated with context extraction. Interestingly, unlike common perception, for discretization of sensor data, supervised algorithms do not have a clear edge over unsupervised.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
McGrath, M., Dishongh, T.: Wireless Sensor Networks for Healthcare Applications. Artech House, London (2010)
Figo, D., Diniz, P.C., Ferreira, D.R., Cardoso, J.M.: Preprocessing techniques for context recognition from accelerometer data. Personal and Ubiquitous Computing 14(7), 645–662 (2010)
Randell, C., Muller, H.: Context awareness by analysing accelerometer data. In: Proceedings of The Fourth International Symposium on Wearable Computers, pp. 175–176. IEEE (2000)
Roggen, D., et al.: Collecting complex activity data sets in highly rich networked sensor environments. In: Seventh International Conference on Networked Sensing Systems, Kassel, Germany (June 2010)
Kulkarni, R.V., Anna, F., Kumar, G.V.: Computational intelligence in wireless sensor networks: A survey. IEEE Communications Surveys & Tutorials 13(1), 68–96 (2011)
Mittal, S., Aggarwal, A., Maskara, S.L.: Application of Bayesian Belief Networks for context extraction from wireless sensors data. In: Proceedings of 14th International Conference on Advanced Communication Technology (ICACT), South Korea,, pp. 410–415 (2012)
Mittal, S., Gopal, K., Maskara, S.L.: A Versatile Lattice Based Model for Situation Recognition from Dynamic Ambient Sensors. International Journal on Smart Sensing and Intelligent Systems 6(1), 403–432 (2013)
Chmielewski, M.R., Grzymala-Busse, J.W.: Global discretization of continuous attributes as preprocessing for machine learning. International Journal of Approximate Reasoning 15(4), 319–331 (1996)
Gemmeke, J.F., Van Hamme, H., Cranen, B., Boves, L.: Compressive sensing for missing data imputation in noise robust speech recognition. IEEE Journal of Selected Topics in Signal Processing 4(2), 272–287 (2010)
Mittal, S., Aggarwal, A., Maskara, S.L.: Online Cleaning of Wireless Sensor Data Resulting in Improved Context Extraction. International Journal of Computer Applications 60(15), 24–32 (2012)
Claudio, B., et al.: A survey of context modelling and reasoning techniques. Pervasive Mobile Computing 6(2), 161–180 (2010)
Fatourechi, M., et al.: Comparison of evaluation metrics in classification applications with imbalanced datasets. In: Seventh IEEE International Conference on Machine Learning and Applications, pp. 777–782 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Mittal, S., Gopal, K., Maskara, S.L. (2014). Effect of Choice of Discretization Methods on Context Extraction from Sensor Data – An Empirical Evaluation. In: Natarajan, R. (eds) Distributed Computing and Internet Technology. ICDCIT 2014. Lecture Notes in Computer Science, vol 8337. Springer, Cham. https://doi.org/10.1007/978-3-319-04483-5_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-04483-5_16
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04482-8
Online ISBN: 978-3-319-04483-5
eBook Packages: Computer ScienceComputer Science (R0)