Reference Hub

This research has been cited in:

Chapter
HMLF_CDD_SSBM: A Hybrid Machine Learning Framework for Cardiovascular Disease Diagnosis Prediction Using the SMOTE Stacking MethodInternational Conference on Innovative Computing and Communications10.1007/978-981-99-3010-4_47
Conference
Data Obfuscation Technique in Cloud Security2021 2nd International Conference on Smart Electronics and Communication (ICOSEC)10.1109/ICOSEC51865.2021.9591915
Article
Development of a Statistical Model for Automated Ground Truth Generation in Low-Resource LanguagesSN Computer Science10.1007/s42979-024-02829-x
Chapter
EL-ID-BID: Ensemble Stacking-Based Intruder Detection in BoT-IoT DataInternational Conference on Innovative Computing and Communications10.1007/978-981-99-4071-4_62
Chapter
Visualizing Missing Data: COVID-2019Congress on Intelligent Systems10.1007/978-981-16-9416-5_41
Article
Providing Consistent State to Distributed Storage SystemComputers10.3390/computers10020023
Article
Developing an effective biclustering technique using an enhanced proximity measureNetwork Modeling Analysis in Health Informatics and Bioinformatics10.1007/s13721-019-0211-7
Chapter
AD-ResNet50: An Ensemble Deep Transfer Learning and SMOTE Model for Classification of Alzheimer’s DiseaseInternational Conference on Innovative Computing and Communications10.1007/978-981-99-4071-4_54
Article
Multiple imputation in big identifiable data for educational research: An example from the Brazilian education assessment systemEnsaio: Avaliação e Políticas Públicas em Educação10.1590/s0104-40362020002802346
Article
A new SEAIRD pandemic prediction model with clinical and epidemiological data analysis on COVID-19 outbreakApplied Intelligence10.1007/s10489-020-01938-3
Conference
An Ensemble Method for Heterogeneous Data Classification using Boosted k-NN with Active Learning2024 Fourth International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies (ICAECT)10.1109/ICAECT60202.2024.10468925
Article
RETRACTED: Detecting Faults within a Cloud Using Machine Learning TechniquesIOP Conference Series: Materials Science and Engineering10.1088/1757-899X/981/2/022029
Chapter
Weather Divergence of Season Through Regression AnalyticsIntelligent Sustainable Systems10.1007/978-981-16-2422-3_10
Article
A Secured and Effective Load Monitoring and Scheduling Migration VM in Cloud ComputingIOP Conference Series: Materials Science and Engineering10.1088/1757-899X/981/2/022069
Chapter
IPSO-SMOTE-AdaBoost: An Optimized Class Imbalance Strategy Using Boosting and PSO TechniquesInternational Conference on Innovative Computing and Communications10.1007/978-981-99-3010-4_46

Distributed Based Serial Regression Multiple Imputation for High Dimensional Multivariate Data in Multicore Environment of Cloud

Lavanya K., L.S.S. Reddy, B. Eswara Reddy

Source Title: International Journal of Ambient Computing and Intelligence (IJACI)10(2)

ISSN: 1941-6237|EISSN: 1941-6245|EISBN13: 9781522565079|DOI: 10.4018/IJACI.2019040105

Cite Article Cite Article

MLA

Lavanya K., et al. "Distributed Based Serial Regression Multiple Imputation for High Dimensional Multivariate Data in Multicore Environment of Cloud." IJACI vol.10, no.2 2019: pp.63-79. http://doi.org/10.4018/IJACI.2019040105

APA

Lavanya K., Reddy, L., & Reddy, B. E. (2019). Distributed Based Serial Regression Multiple Imputation for High Dimensional Multivariate Data in Multicore Environment of Cloud. International Journal of Ambient Computing and Intelligence (IJACI), 10(2), 63-79. http://doi.org/10.4018/IJACI.2019040105

Chicago

Lavanya K., L.S.S. Reddy, and B. Eswara Reddy. "Distributed Based Serial Regression Multiple Imputation for High Dimensional Multivariate Data in Multicore Environment of Cloud," International Journal of Ambient Computing and Intelligence (IJACI) 10, no.2: 63-79. http://doi.org/10.4018/IJACI.2019040105

Export Reference

Favorite Full-Issue Download

View Full Text HTML

View Full Text PDF

Abstract

Multiple imputations (MI) are predominantly applied in such processes that are involved in the transaction of huge chunks of missing data. Multivariate data that follow traditional statistical models undergoes great suffering for the inadequate availability of pertinent data. The field of distributed computing research faces the biggest hurdle in the form of insufficient high dimensional multivariate data. It mainly deals with the analysis of parallel input problems found in the cloud computing network in general and evaluation of high-performance computing in particular. In fact, it is a tough task to utilize parallel multiple input methods for accomplishing remarkable performance as well as allowing huge datasets achieves scale. In this regard, it is essential that a credible data system is developed and a decomposition strategy is used to partition workload in the entire process for minimum data dependence. Subsequently, a moderate synchronization and/or meager communication liability is followed for placing parallel impute methods for achieving scale as well as more processes. The present article proposes many novel applications for better efficiency. As the first step, this article suggests distributed-oriented serial regression multiple imputation for enhancing the efficiency of imputation task in high dimensional multivariate normal data. As the next step, the processes done in three diverse with parallel back ends viz. Multiple imputation that used the socket method to serve serial regression and the Fork Method to distribute work over workers, and also same work experiments in dynamic structure with a load balance mechanism. In the end, the set of distributed MI methods are used to experimentally analyze amplitude of imputation scores spanning across three probable scenarios in the range of 1:500. Further, the study makes an important observation that due to the efficiency of numerous imputation methods, the data is arranged proportionately in a missing range of 10% to 50%, low to high, while dealing with data between 1000 and 100,000 samples. The experiments are done in a cloud environment and demonstrate that it is possible to generate a decent speed by lessening the repetitive communication between processors.

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.

Username or email: *

Password: *

Forgot individual login password?

Create individual account

Distributed Based Serial Regression Multiple Imputation for High Dimensional Multivariate Data in Multicore Environment of Cloud

MLA

APA

Chicago

Export Reference

Abstract

Request Access