Skip to main content
Log in

Fusion effect of SVM in spark architecture for speech data mining in cluster structure

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

Fusion effect of SVM in the Spark architecture for speech data mining in cluster structure is studied in this manuscript. Based on the information entropy of nodes, the data in clusters are fused to eliminate redundant data and improve the efficiency of information fusion. Information entropy is a statistical form based on the characteristics of information representation, which reflects the average amount of information in information. Based on the Spark platform SVM algorithm, the frequent items with the highest support after each sort are directly recursively obtained, and the transaction data set is allocated to each computing node. The structure of the item head table directly affects the efficiency of the algorithm, so optimizing the structure of the item head table can improve the efficiency of the algorithm in constructing FP-Tree, and then improve the efficiency of the whole algorithm. The proposed speech data mining algorithm can cluster, analyze, and comprehensively detection the saliency information, the detection accuracy is much higher than the state-of-the-art models. The experimental results compared with the latest research have reflected that fact that the proposed model has the better performance and robustness.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Alassaf, N., Gutub, A., Parah, S. A., & Al Ghamdi, M. (2019). Enhancing speed of SIMON: A light-weight-cryptographic algorithm for IoT applications. Multimedia Tools and Applications, 78(23), 32633–32657.

    Article  Google Scholar 

  • Chauhan, D. S., Singh, A. K., Kumar, B., & Saini, J. P. (2019). Quantization based multiple medical information watermarking for secure e-health. Multimedia Tools and Applications, 78(4), 3911–3923.

    Article  Google Scholar 

  • Chen, Q., Zhang, G., Yang, X., Li, S., Li, Y., & Wang, H. H. (2018). Single image shadow detection and removal based on feature fusion and multiple dictionary learning. Multimedia Tools and Applications, 77(14), 18601–18624.

    Article  Google Scholar 

  • Cordero, J. A., Nebro, A. J., Barba-González, C., Durillo, J. J., García-Nieto, J., Navas-Delgado, I., et al. (2016). Dynamic multi-objective optimization with jmetal and spark: A case study. International workshop on machine learning, optimization, and big data (pp. 106–117). Cham: Springer.

    Chapter  Google Scholar 

  • Deng, W., Yao, R., Zhao, H., Yang, X., & Li, G. (2019). A novel intelligent diagnosis method using optimal LS-SVM with improved PSO algorithm. Soft Computing, 23(7), 2445–2462.

    Article  Google Scholar 

  • Gupta, A., Thakur, H. K., Shrivastava, R., Kumar, P., & Nag, S. (2017, November). A big data analysis framework using apache spark and deep learning. In 2017 IEEE international conference on data mining workshops (ICDMW) (pp. 9–16). IEEE.

  • Ibrahim, F., El-Gindy, S. A. E., El-Dolil, S. M., El-Fishawy, A. S., El-Rabaie, E. S. M., Dessouky, M. I., et al. (2019). A statistical framework for EEG channel selection and seizure prediction on mobile. International Journal of Speech Technology, 22(1), 191–203.

    Article  Google Scholar 

  • Kadyan, V., Mantri, A., Aggarwal, R. K., & Singh, A. (2019). A comparative study of deep neural network based Punjabi-ASR system. International Journal of Speech Technology, 22(1), 111–119.

    Article  Google Scholar 

  • Lang, S. M., Bernhardt, T. M., Bakker, J. M., Yoon, B., & Landman, U. (2018). The interaction of ethylene with free gold cluster cations: Infrared photodissociation spectroscopy combined with electronic and vibrational structure calculations. Journal of Physics: Condensed Matter, 30(50), 504001.

    Google Scholar 

  • Maleki, N., Loni, M., Daneshtalab, M., Conti, M., & Fotouhi, H. (2019). SoFA: A spark-oriented fog architecture. In IECON 2019-45th annual conference of the IEEE industrial electronics society (Vol. 1, pp. 2792–2799). IEEE.

  • Ning, J., Yang, J., Jiang, S., Zhang, L., & Yang, M. H. (2016). Object tracking via dual linear structured SVM and explicit feature map. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 4266–4274).

  • El Ouahabi, S., Atounti, M., & Bellouki, M. (2019). Toward an automatic speech recognition system for amazigh-tarifit language. International Journal of Speech Technology, 22(2), 421–432.

    Article  Google Scholar 

  • Ozcan, T., & Basturk, A. (2019). Transfer learning-based convolutional neural networks with heuristic optimization for hand gesture recognition. Neural Computing and Applications, 31(12), 8955–8970.

    Article  Google Scholar 

  • Ray, R. B., Kumar, M., & Rath, S. K. (2016). Fast computing of microarray data using resilient distributed dataset of apache spark. Recent advances in information and communication technology 2016 (pp. 171–182). Cham: Springer.

    Google Scholar 

  • Sangaiah, A. K., Medhane, D. V., Han, T., Hossain, M. S., & Muhammad, G. (2019). Enforcing position-based confidentiality with machine learning paradigm through mobile edge computing in real-time industrial informatics. IEEE Transactions on Industrial Informatics, 15(7), 4189–4196.

    Article  Google Scholar 

  • Singh, T., Di Troia, F., Corrado, V. A., Austin, T. H., & Stamp, M. (2016). Support vector machines and malware detection. Journal of Computer Virology and Hacking Techniques, 12(4), 203–212.

    Article  Google Scholar 

  • Song, T., Pang, S., Hao, S., Rodríguez-Patón, A., & Zheng, P. (2019). A parallel image skeletonizing method using spiking neural P systems with weights. Neural Processing Letters, 50(2), 1485–1502.

    Article  Google Scholar 

  • Souza, M. A., Miyake, H., Borello-Lewin, T., da Rocha, C. A., & Frajuca, C. (2019). α-Cluster structure above double-shell closures and α-decay of 104Te. Physics Letters B, 793, 8–12.

    Article  Google Scholar 

  • Sreeyuktha, H. S., & Reddy, J. G. (2019). Partitioning in apache spark. Innovations in computer science and engineering (pp. 493–498). Singapore: Springer.

    Chapter  Google Scholar 

  • Suthakar, U., Magnoni, L., Smith, D. R., & Khan, A. (2016). Optimised lambda architecture for monitoring WLCG using spark and spark streaming. In 2016 IEEE nuclear science symposium, medical imaging conference and room-temperature semiconductor detector workshop (NSS/MIC/RTSD) (pp. 1–2). IEEE.

  • Śmieja, M., & Wiercioch, M. (2017). Constrained clustering with a complex cluster structure. Advances in Data Analysis and Classification, 11(3), 493–518.

    Article  MathSciNet  Google Scholar 

  • Talan, P. P., Sharma, K. U., Nawade, P. P., & Talan, K. P. (2019). An overview of hadoop MapReduce, spark, and scalable graph processing architecture. Recent developments in machine learning and data analytics (pp. 35–42). Singapore: Springer.

    Chapter  Google Scholar 

  • Thakur, S., Singh, A. K., Ghrera, S. P., & Elhoseny, M. (2019). Multi-layer security of medical data through watermarking and chaotic encryption for tele-health applications. Multimedia Tools and Applications, 78(3), 3457–3470.

    Article  Google Scholar 

  • Vapnik, V., & Izmailov, R. (2017). Knowledge transfer in SVM and neural networks. Annals of Mathematics and Artificial Intelligence, 81(1–2), 3–19.

    Article  MathSciNet  Google Scholar 

  • Wang, W., Lilyestrom, W. G., Hu, Z. Y., & Scherer, T. M. (2018). Cluster size and quinary structure determine the rheological effects of antibody self-association at high concentrations. The Journal of Physical Chemistry B, 122(7), 2138–2154.

    Article  Google Scholar 

  • Wottschel, V., Chard, D. T., Enzinger, C., Filippi, M., Frederiksen, J. L., Gasperini, C., et al. (2019). SVM recursive feature elimination analyses of structural brain MRI predicts near-term relapses in patients with clinically isolated syndromes suggestive of multiple sclerosis. NeuroImage: Clinical, 24, 102011.

    Article  Google Scholar 

  • Wu, X., Zuo, W., Lin, L., Jia, W., & Zhang, D. (2018). F-SVM: Combination of feature transformation and SVM learning via convex relaxation. IEEE Transactions on Neural Networks and Learning Systems, 29(11), 5185–5199.

    Article  Google Scholar 

  • Xiong, X., Tang, R., & Yang, X. (2019). Finite-time synchronization of memristive neural networks with proportional delay. Neural Processing Letters, 50(2), 1139–1152.

    Article  Google Scholar 

  • Yu, Z., Zhu, X., Wong, H. S., You, J., Zhang, J., & Han, G. (2016). Distribution-based cluster structure selection. IEEE Transactions on Cybernetics, 47(11), 3554–3567.

    Article  Google Scholar 

  • Zhang, S., Wang, H., & Huang, W. (2017). Two-stage plant species recognition by local mean clustering and Weighted sparse representation classification. Cluster Computing, 20(2), 1517–1525.

    Article  Google Scholar 

  • Zhang, S., Wang, H., Huang, W., & You, Z. (2018). Plant diseased leaf segmentation and recognition by fusion of superpixel, K-means and PHOG. Optik, 157, 866–872.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianfei Shen.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shen, J., Wang, H.H. Fusion effect of SVM in spark architecture for speech data mining in cluster structure. Int J Speech Technol 23, 481–488 (2020). https://doi.org/10.1007/s10772-020-09710-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-020-09710-1

Keywords

Navigation