Abstract
Fusion effect of SVM in the Spark architecture for speech data mining in cluster structure is studied in this manuscript. Based on the information entropy of nodes, the data in clusters are fused to eliminate redundant data and improve the efficiency of information fusion. Information entropy is a statistical form based on the characteristics of information representation, which reflects the average amount of information in information. Based on the Spark platform SVM algorithm, the frequent items with the highest support after each sort are directly recursively obtained, and the transaction data set is allocated to each computing node. The structure of the item head table directly affects the efficiency of the algorithm, so optimizing the structure of the item head table can improve the efficiency of the algorithm in constructing FP-Tree, and then improve the efficiency of the whole algorithm. The proposed speech data mining algorithm can cluster, analyze, and comprehensively detection the saliency information, the detection accuracy is much higher than the state-of-the-art models. The experimental results compared with the latest research have reflected that fact that the proposed model has the better performance and robustness.
Similar content being viewed by others
References
Alassaf, N., Gutub, A., Parah, S. A., & Al Ghamdi, M. (2019). Enhancing speed of SIMON: A light-weight-cryptographic algorithm for IoT applications. Multimedia Tools and Applications, 78(23), 32633–32657.
Chauhan, D. S., Singh, A. K., Kumar, B., & Saini, J. P. (2019). Quantization based multiple medical information watermarking for secure e-health. Multimedia Tools and Applications, 78(4), 3911–3923.
Chen, Q., Zhang, G., Yang, X., Li, S., Li, Y., & Wang, H. H. (2018). Single image shadow detection and removal based on feature fusion and multiple dictionary learning. Multimedia Tools and Applications, 77(14), 18601–18624.
Cordero, J. A., Nebro, A. J., Barba-González, C., Durillo, J. J., García-Nieto, J., Navas-Delgado, I., et al. (2016). Dynamic multi-objective optimization with jmetal and spark: A case study. International workshop on machine learning, optimization, and big data (pp. 106–117). Cham: Springer.
Deng, W., Yao, R., Zhao, H., Yang, X., & Li, G. (2019). A novel intelligent diagnosis method using optimal LS-SVM with improved PSO algorithm. Soft Computing, 23(7), 2445–2462.
Gupta, A., Thakur, H. K., Shrivastava, R., Kumar, P., & Nag, S. (2017, November). A big data analysis framework using apache spark and deep learning. In 2017 IEEE international conference on data mining workshops (ICDMW) (pp. 9–16). IEEE.
Ibrahim, F., El-Gindy, S. A. E., El-Dolil, S. M., El-Fishawy, A. S., El-Rabaie, E. S. M., Dessouky, M. I., et al. (2019). A statistical framework for EEG channel selection and seizure prediction on mobile. International Journal of Speech Technology, 22(1), 191–203.
Kadyan, V., Mantri, A., Aggarwal, R. K., & Singh, A. (2019). A comparative study of deep neural network based Punjabi-ASR system. International Journal of Speech Technology, 22(1), 111–119.
Lang, S. M., Bernhardt, T. M., Bakker, J. M., Yoon, B., & Landman, U. (2018). The interaction of ethylene with free gold cluster cations: Infrared photodissociation spectroscopy combined with electronic and vibrational structure calculations. Journal of Physics: Condensed Matter, 30(50), 504001.
Maleki, N., Loni, M., Daneshtalab, M., Conti, M., & Fotouhi, H. (2019). SoFA: A spark-oriented fog architecture. In IECON 2019-45th annual conference of the IEEE industrial electronics society (Vol. 1, pp. 2792–2799). IEEE.
Ning, J., Yang, J., Jiang, S., Zhang, L., & Yang, M. H. (2016). Object tracking via dual linear structured SVM and explicit feature map. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 4266–4274).
El Ouahabi, S., Atounti, M., & Bellouki, M. (2019). Toward an automatic speech recognition system for amazigh-tarifit language. International Journal of Speech Technology, 22(2), 421–432.
Ozcan, T., & Basturk, A. (2019). Transfer learning-based convolutional neural networks with heuristic optimization for hand gesture recognition. Neural Computing and Applications, 31(12), 8955–8970.
Ray, R. B., Kumar, M., & Rath, S. K. (2016). Fast computing of microarray data using resilient distributed dataset of apache spark. Recent advances in information and communication technology 2016 (pp. 171–182). Cham: Springer.
Sangaiah, A. K., Medhane, D. V., Han, T., Hossain, M. S., & Muhammad, G. (2019). Enforcing position-based confidentiality with machine learning paradigm through mobile edge computing in real-time industrial informatics. IEEE Transactions on Industrial Informatics, 15(7), 4189–4196.
Singh, T., Di Troia, F., Corrado, V. A., Austin, T. H., & Stamp, M. (2016). Support vector machines and malware detection. Journal of Computer Virology and Hacking Techniques, 12(4), 203–212.
Song, T., Pang, S., Hao, S., Rodríguez-Patón, A., & Zheng, P. (2019). A parallel image skeletonizing method using spiking neural P systems with weights. Neural Processing Letters, 50(2), 1485–1502.
Souza, M. A., Miyake, H., Borello-Lewin, T., da Rocha, C. A., & Frajuca, C. (2019). α-Cluster structure above double-shell closures and α-decay of 104Te. Physics Letters B, 793, 8–12.
Sreeyuktha, H. S., & Reddy, J. G. (2019). Partitioning in apache spark. Innovations in computer science and engineering (pp. 493–498). Singapore: Springer.
Suthakar, U., Magnoni, L., Smith, D. R., & Khan, A. (2016). Optimised lambda architecture for monitoring WLCG using spark and spark streaming. In 2016 IEEE nuclear science symposium, medical imaging conference and room-temperature semiconductor detector workshop (NSS/MIC/RTSD) (pp. 1–2). IEEE.
Śmieja, M., & Wiercioch, M. (2017). Constrained clustering with a complex cluster structure. Advances in Data Analysis and Classification, 11(3), 493–518.
Talan, P. P., Sharma, K. U., Nawade, P. P., & Talan, K. P. (2019). An overview of hadoop MapReduce, spark, and scalable graph processing architecture. Recent developments in machine learning and data analytics (pp. 35–42). Singapore: Springer.
Thakur, S., Singh, A. K., Ghrera, S. P., & Elhoseny, M. (2019). Multi-layer security of medical data through watermarking and chaotic encryption for tele-health applications. Multimedia Tools and Applications, 78(3), 3457–3470.
Vapnik, V., & Izmailov, R. (2017). Knowledge transfer in SVM and neural networks. Annals of Mathematics and Artificial Intelligence, 81(1–2), 3–19.
Wang, W., Lilyestrom, W. G., Hu, Z. Y., & Scherer, T. M. (2018). Cluster size and quinary structure determine the rheological effects of antibody self-association at high concentrations. The Journal of Physical Chemistry B, 122(7), 2138–2154.
Wottschel, V., Chard, D. T., Enzinger, C., Filippi, M., Frederiksen, J. L., Gasperini, C., et al. (2019). SVM recursive feature elimination analyses of structural brain MRI predicts near-term relapses in patients with clinically isolated syndromes suggestive of multiple sclerosis. NeuroImage: Clinical, 24, 102011.
Wu, X., Zuo, W., Lin, L., Jia, W., & Zhang, D. (2018). F-SVM: Combination of feature transformation and SVM learning via convex relaxation. IEEE Transactions on Neural Networks and Learning Systems, 29(11), 5185–5199.
Xiong, X., Tang, R., & Yang, X. (2019). Finite-time synchronization of memristive neural networks with proportional delay. Neural Processing Letters, 50(2), 1139–1152.
Yu, Z., Zhu, X., Wong, H. S., You, J., Zhang, J., & Han, G. (2016). Distribution-based cluster structure selection. IEEE Transactions on Cybernetics, 47(11), 3554–3567.
Zhang, S., Wang, H., & Huang, W. (2017). Two-stage plant species recognition by local mean clustering and Weighted sparse representation classification. Cluster Computing, 20(2), 1517–1525.
Zhang, S., Wang, H., Huang, W., & You, Z. (2018). Plant diseased leaf segmentation and recognition by fusion of superpixel, K-means and PHOG. Optik, 157, 866–872.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Shen, J., Wang, H.H. Fusion effect of SVM in spark architecture for speech data mining in cluster structure. Int J Speech Technol 23, 481–488 (2020). https://doi.org/10.1007/s10772-020-09710-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10772-020-09710-1