Abstract
In this study, we introduce a predictive model leveraging the scaled Dirichlet mixture model (SDMM). This data-driven approach offers enhanced accuracy in predictions, especially with a limited training dataset, surpassing traditional point estimation methods. Recent research has highlighted the flexibility of the Dirichlet distribution in modelling multivariate proportional data. Our research extends this by employing a scaled Dirichlet distribution, which incorporates additional parameters, to construct our predictive model. Furthermore, we address the challenge of data imbalance through a novel approach centred on data spread rate, effectively balancing the dataset to optimize model performance. Empirical evaluations demonstrate the model’s efficacy with both synthetic and real datasets, particularly in estimating occupancy in smart buildings.











Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data Availability
Data could be made available on reasonable request.
References
Tabatabaee Malazi H, Davari M (2018) Combining emerging patterns with random forest for complex activity recognition in smart homes. Appl Intell 48(2):315–330
Liouane Z, Lemlouma T, Roose P, Weis F, Messaoud H (2018) An improved extreme learning machine model for the prediction of human scenarios in smart homes. Appl Intell 48:2017–2030
Huang Y, Guan X, Chen H, Liang Y, Yuan S, Ohtsuki T (2019) Risk assessment of private information inference for motion sensor embedded iot devices. IEEE Transactions on Emerging Topics in Computational Intelligence. 4(3):265–275
Natani A, Sharma A, Perumal T (2021) Sequential neural networks for multi-resident activity recognition in ambient sensing smart homes. Appl Intell 51:6014–6028
Mansouri SA, Jordehi AR, Marzband M, Tostado-Véliz M, Jurado F, Aguado JA (2023) An iot-enabled hierarchical decentralized framework for multi-energy microgrids market management in the presence of smart prosumers using a deep learning-based forecaster. Appl Energy 333:120560
Wang X, Liu J, Moore SJ, Nugent CD, Xu Y (2023) A behavioural hierarchical analysis framework in a smart home: integrating hmm and probabilistic model checking. Inform Fusion
D’Oca S, Hong T, Langevin J (2018) The human dimensions of energy use in buildings: a review. Renew Sust Energ Rev 81:731–742
Yan Y, Luh PB, Pattipati KR (2020) Fault prognosis of key components in hvac air-handling systems at component and system levels. IEEE Trans Autom Sci Eng 17(4):2145–2153
Yang Y, Hu G, Spanos CJ (2020) Hvac energy cost optimization for a multizone building via a decentralized approach. IEEE Trans Autom Sci Eng 17(4):1950–1960
Erickson VL, Carreira-Perpiñán MÁ, Cerpa AE (2014) Occupancy modeling and prediction for building energy management. ACM Trans Sensor Netw (TOSN). 10(3):1–28
Brooks J, Goyal S, Subramany R, Lin Y, Middelkoop T, Arpan L, Carloni L, Barooah P (2014) An experimental investigation of occupancy-based energy-efficient control of commercial building indoor climate. In: 53rd IEEE Conference on decision and control, IEEE, pp 5680–5685
Brooks J, Kumar S, Goyal S, Subramany R, Barooah P (2015) Energy-efficient control of under-actuated hvac zones in commercial buildings. Energy and Buildings. 93:160–168
Nesa N, Banerjee I (2017) Iot-based sensor data fusion for occupancy sensing using dempster-shafer evidence theory for smart buildings. IEEE Internet Things J 4(5):1563–1570
Zimmermann L, Weigel R, Fischer G (2017) Fusion of nonintrusive environmental sensors for occupancy detection in smart homes. IEEE Internet Things J 5(4):2343–2352
Balakumar P, Vinopraba T, Chandrasekaran K (2023) Machine learning based demand response scheme for iot enabled pv integrated smart building. Sustain Cities Soc 89:104260
Diethe T, Twomey N, Flach PA (2016) Active transfer learning for activity recognition. In: ESANN
Hossain HS, Khan MAAH, Roy N (2017) Active learning enabled activity recognition. Pervasive Mob Comput 38:312–330
Ma Z, Leijon A, Tan Z-H, Gao S (2014) Predictive distribution of the dirichlet mixture model by local variational inference. J Signal Process Syst 74:359–374
Bishop CM, Nasrabadi NM (2006) Pattern Recognition and Machine Learning, Springer, vol 4
Fan W, Yang L, Bouguila N (2021) Unsupervised grouped axial data modeling via hierarchical bayesian nonparametric models with watson distributions. IEEE Trans Pattern Anal Mach Intell 44(12):9654–9668
Fan W, Bouguila N, Du J-X, Liu X (2018) Axially symmetric data clustering through dirichlet process mixture models of watson distributions. IEEE Trans Neural Netw Learn Syst 30(6):1683–1694
Fan W, Bouguila N, Ziou D (2012) Variational learning for finite dirichlet mixture models and applications. IEEE Trans Neural Netw Learn Syst 23(5):762–774
Monti GS, Mateu-Figueras G, Pawlowsky-Glahn V (2011) Notes on the scaled dirichlet distribution. Compositional data Anal pp 128–138
Nguyen H, Azam M, Bouguila N (2019) Data clustering using variational learning of finite scaled dirichlet mixture models. In: 2019 IEEE 28th International symposium on industrial electronics (ISIE), IEEE, pp 1391–1396
Oboh BS, Bouguila N (2017) Unsupervised learning of finite mixtures using scaled dirichlet distribution and its application to software modules categorization. In: 2017 IEEE International conference on industrial technology (ICIT), IEEE, pp 1085–1090
Zamzami N, Alsuroji R, Eromonsele O, Bouguila N (2020) Proportional data modeling via selection and estimation of a finite mixture of scaled dirichlet distributions. Comput Intell 36(2):459–485
Amayri M, Ploix S, Bouguila N, Wurtz F (2020) Database quality assessment for interactive learning: application to occupancy estimation. Energy and Buildings. 209:109578
Guo J, Amayri M, Fan W, Bouguila N (2022) A generalized inverted dirichlet predictive model for activity recognition using small training data. In: Advances and trends in artificial intelligence. Theory and practices in artificial intelligence: 35th international conference on industrial, engineering and other applications of applied intelligent systems, IEA/AIE 2022, Kitakyushu, Japan, Proceedings, Springer, pp 431–442. Accessed 19–22 July 2022
Joo I-Y, Choi D-H (2017) Optimal household appliance scheduling considering consumer’s electricity bill target. IEEE Trans Consum Electron 63(1):19–27
Rajasekhar B, Tushar W, Lork C, Zhou Y, Yuen C, Pindoriya NM, Wood KL (2020) A survey of computational intelligence techniques for air-conditioners energy management. IEEE Trans Emerg Top Comput Intell 4(4):555–570
Leonori S, Martino A, Mascioli FMF, Rizzi A (2019) Anfis microgrid energy management system synthesis by hyperplane clustering supported by neurofuzzy min-max classifier. IEEE Trans Emerg Top Comput Intell 3(3):193–204
Jiang J, Wang C, Roth T, Nguyen C, Kamongi P, Lee H, Liu Y (2021) Residential house occupancy detection: trust-based scheme using economic and privacy-aware sensors. IEEE Internet Things J 9(3):1938–1950
Zou H, Jiang H, Yang J, Xie L, Spanos C (2017) Non-intrusive occupancy sensing in commercial buildings. Energy and Buildings. 154:633–643
Petersen S, Pedersen TH, Nielsen KU, Knudsen MD (2016) Establishing an image-based ground truth for validation of sensor data-based room occupancy detection. Energy and Buildings. 130:787–793
Amayri M, Arora A, Ploix S, Bandhyopadyay S, Ngo Q-D, Badarla VR (2016) Estimating occupancy in heterogeneous sensor environment. Energy and Buildings. 129:46–58
Wang W, Zhang M, Zhang L (2018) Classification of data stream in sensor network with small samples. IEEE Internet Things J 6(4):6018–6025
Snell J, Swersky K, Zemel R (2017) Prototypical networks for few-shot learning. Adv Neural Inform Process Syst 30
Vinyals O, Blundell C, Lillicrap T, Wierstra D et al (2016) Matching networks for one shot learning. Adv Neural Inform Process Syst 29
Branco P, Torgo L, Ribeiro RP (2016) A survey of predictive modeling on imbalanced domains. ACM Comput Surv (CSUR) 49(2):1–50
Dickey JM (1968) Three multidimensional-integral identities with bayesian applications. Ann Math Stat pp 1615–1628
Guo J, Amayri M, Fan W, Bouguila N (2022) Beta-liouville and inverted beta-liouville based predictive models for occupancy detection using small training data. In: 2022 IEEE Symposium series on computational intelligence (SSCI), IEEE, pp 223–230
Wang J-D, Liu H-C (2011) An approach to evaluate the fitness of one class structure via dynamic centroids. Expert Syst Appl 38(11):13764–13772
Wang J-D, Liu H-C, Shi Y-C (2009) A novel approach for evaluating class structure ambiguity. In: 2009 International conference on machine learning and cybernetics, IEEE, vol 3, pp 1550–1555
Panaretos VM, Zemel Y (2019) Statistical aspects of wasserstein distances. Ann Rev Stat Appl 6:405–431
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V et al (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830
Manouchehri N, Dalhoumi O, Amayri M, Bouguila N (2020) Variational learning of a shifted scaled dirichlet model with component splitting approach. In: 2020 Third international conference on artificial intelligence for industries (AI4I), IEEE, pp 75–78
Guo J, Amayri M, Fan W, Bouguila N (2023) Liouville-based predictive models for occupancy estimation using small training data. IEEE Internet Things J
Guo J, Amayri M, Najar F, Fan W, Bouguila N (2023) Occupancy estimation in smart buildings using predictive modeling in imbalanced domains. J Ambient Intell Humaniz Comput 14(8):10917–10929
Acknowledgements
The completion of this research was made possible thanks to the Natural Sciences and Engineering Research Council of Canada (NSERC), a start-up grant awarded to Manar Amayri from Concordia University, the National Natural Science Foundation of China (62276106), the Guangdong Provincial Key Laboratory IRADS (2022B1212010006, R0400001-22) and the UIC Start-up Research Fund (UICR0700056-23).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Guo, J., Amayri, M., Fan, W. et al. A scaled dirichlet-based predictive model for occupancy estimation in smart buildings. Appl Intell 54, 6981–6996 (2024). https://doi.org/10.1007/s10489-024-05543-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-024-05543-6