Bigram Based Deep Neural Network for Extremism Detection in Online User Generated Contents in the Kazakh Language

Mussiraliyeva, Shynar; Omarov, Batyrkhan; Bolatbek, Milana; Bagitova, Kalamkas; Alimzhanova, Zhanna

doi:10.1007/978-3-030-88113-9_45

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1463))

Included in the following conference series:

International Conference on Computational Collective Intelligence

1204 Accesses
2 Citations

Abstract

Countering the spread of aggressive information and extremism in the global network is an urgent problem of society and government agencies, which is solved in particular by filtering unwanted Internet resources. A necessary condition for such filtering is the classification of the content of websites, texts and documents of the information flow. Therefore, an urgent problem of information technologies is the classification of texts in natural languages in order to detect extremist texts, such as calls for extremism and other messages that threaten the security of citizens.

Therefore, our research examines the detection of extremist messages in online content in the Kazakh language. To do this, we have collected a corpus of extremist texts from open sources, developed a deep neural network based on bigrams for detecting extremist texts in the Kazakh language. The proposed model has shown high efficiency in comparison with classical methods of machine learning and deep learning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Applying Deep Learning for Extremism Detection

Use of Natural Language Processing to Identify Inappropriate Content in Text

Machine Learning for Identifying Abusive Content in Text Data

References

Diez-Olivan, A., Del Ser, J., Galar, D., Sierra, B.: Data fusion and machine learning for industrial prognosis: trends and perspectives towards Industry 4.0. Inf. Fusion 50, 92–111 (2019)
Article Google Scholar
Da Li, X., Duan, L.: Big data for cyber physical systems in industry 4.0: a survey. Enterp. Inf. Syst. 13(2), 148–169 (2019). https://doi.org/10.1080/17517575.2018.1442934
Article Google Scholar
Kendzhaeva, B., Omarov, B., Abdiyeva, G., Anarbayev, A., Dauletbek, Y., Omarov, B.: Providing safety for citizens and tourists in cities: a system for detecting anomalous sounds. In: Luhach, A.K., Jat, D.S., Ghazali, K.H.B., Gao, X.-Z., Lingras, P. (eds.) Advanced Informatics for Computing Research: 4th International Conference, ICAICR 2020, Gurugram, India, December 26–27, 2020, Revised Selected Papers, Part I, pp. 264–273. Springer Singapore, Singapore (2021). https://doi.org/10.1007/978-981-16-3660-8_25
Chapter Google Scholar
Sinha, S., Basak, S., Dey, Y., Mondal, A.: An educational Chatbot for answering queries. In: Mandal, J.K., Bhattacharya, D. (eds.) Emerging Technology in Modelling and Graphics. AISC, vol. 937, pp. 55–60. Springer, Singapore (2020). https://doi.org/10.1007/978-981-13-7403-6_7
Chapter Google Scholar
Ait-Mlouk, A., Jiang, L.: KBot: a knowledge graph based chatBot for natural language understanding over linked data. IEEE Access 8, 149220–149230 (2020)
Article Google Scholar
Alshemali, B., Kalita, J.: Improving the reliability of deep neural networks in NLP: a review. Knowle. Based Syst. 191, 105210 (2020). KBot: a Knowledge graph based chatBot for natural language understanding over linked data
Article Google Scholar
Manogaran, G., Varatharajan, R., Priyan, M.K.: Hybrid recommendation system for heart disease diagnosis based on multiple kernel learning with adaptive neuro-fuzzy inference system. Multimedia Tools Appl. 77(4), 4379–4399 (2017). https://doi.org/10.1007/s11042-017-5515-y
Article Google Scholar
Mussiraliyeva, S., Bolatbek, M., Omarov, B., Bagitova, K.: Detection of extremist ideation on social media using machine learning techniques. In: Nguyen, N.T., Hoang, B.H., Huynh, C.P., Hwang, D., Trawiński, B., Vossen, G. (eds.) Computational Collective Intelligence: 12th International Conference, ICCCI 2020, Da Nang, Vietnam, November 30 – December 3, 2020, Proceedings, pp. 743–752. Springer International Publishing, Cham (2020). https://doi.org/10.1007/978-3-030-63007-2_58
Chapter Google Scholar
Altınel, B., Ganiz, M.C.: Semantic text classification: a survey of past and recent advances. Inf. Process. Manage. 54(6), 1129–1153 (2018)
Article Google Scholar
Murzamadieva, M., Ivashov, A., Omarov, B., Omarov, B., Kendzhayeva, B., Abdrakhmanov, R.: Development of a system for ensuring humidity in sport complexes. In: 2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence), pp. 530–535. IEEE (2021)
Google Scholar
Sinoara, R.A., Camacho-Collados, J., Rossi, R.G., Navigli, R., Rezende, S.O.: Knowledge-enhanced document embeddings for text classification. Knowl. Based Syst. 163, 955–971 (2019)
Article Google Scholar
Ahmad, S., Asghar, M.Z., Alotaibi, F.M. et al.: Detection and classification of social media-based extremist affiliations using sentiment analysis techniques. Human-centric Comput. Inf. Sci. 9(24) (2019). https://doi.org/10.1186/s13673-019-0185-6
Salminen, J., Hopf, M., Chowdhury, S.A., Jung, S.-G., Almerekhi, H., Jansen, B.J.: Developing an online hate classifier for multiple social media platforms. Human-centric Comput. Inf. Sci. 10(1), 1–34 (2020). https://doi.org/10.1186/s13673-019-0205-6
Article Google Scholar
Johnston, A., Marku, A.: Identifying extremism in text using deep learning. In: Pedrycz, W., Chen, S.-M. (eds.) Development and Analysis of Deep Learning Architectures. SCI, vol. 867, pp. 267–289. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-31764-5_10
Chapter Google Scholar
Duwairi, R., Hayajneh, A., Quwaider, M.: A deep learning framework for automatic detection of hate speech embedded in Arabic tweets. Arab. J. Sci. Eng. 46(4), 4001–4014 (2021). https://doi.org/10.1007/s13369-021-05383-3
Article Google Scholar
Sharif, O., Hoque, M.M., Kayes, A.S.M., Nowrozy, R., Sarker, I.H.: Detecting suspicious texts using machine learning techniques. Appl. Sci. 10(18), 6527 (2020). https://doi.org/10.3390/app10186527
Article Google Scholar
Armaan, K, Saini, J.K., Bansal, D.: Detecting radical text over online media using deep learning. Comput. Sci. Math. ArXiv abs/1907.12368 (2019)
Google Scholar
Vk.com – Vkontakte Social Network
Google Scholar
Huang, F., Zhang, S., Zhang, J., Yu, G.: Multimodal learning for topic sentiment analysis in microblogging. Neurocomputing 253, 144–153 (2017). https://doi.org/10.1016/j.neucom.2016.10.086
Article Google Scholar
https://vk.com/dev/methods
Kadhim, A.I.: Survey on supervised machine learning techniques for automatic text classification. Artif. Intell. Rev. 52(1), 273–292 (2019). https://doi.org/10.1007/s10462-018-09677-1
Article MathSciNet Google Scholar
Sun, S., Cao, Z., Zhu, H., Zhao, J.: A survey of optimization methods from a machine learning perspective. IEEE Trans. Cybern. 50(8), 3668–3681 (2019)
Article Google Scholar
Khan, F.A., Ibrahim, A.A., Rais, M.S., Rajpoot, P., Khan, A., Akhtar, M.N.: Performance analysis of supervised learning algorithms based on classification approach. In: 2019 IEEE 6th International Conference on Engineering Technologies and Applied Sciences (ICETAS), pp. 1–6. IEEE (2019). DOI: https://doi.org/10.1109/ICETAS48360.2019.9117394

Download references

Author information

Authors and Affiliations

Al-Farabi Kazakh National University, Almaty, Kazakhstan
Shynar Mussiraliyeva, Batyrkhan Omarov, Milana Bolatbek, Kalamkas Bagitova & Zhanna Alimzhanova

Authors

Shynar Mussiraliyeva
View author publications
You can also search for this author in PubMed Google Scholar
Batyrkhan Omarov
View author publications
You can also search for this author in PubMed Google Scholar
Milana Bolatbek
View author publications
You can also search for this author in PubMed Google Scholar
Kalamkas Bagitova
View author publications
You can also search for this author in PubMed Google Scholar
Zhanna Alimzhanova
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Wrocław University of Science and Technology, Wrocław, Poland
Krystian Wojtkiewicz
VU Amsterdam, Amsterdam, The Netherlands
Jan Treur
University of the West of England, Bristol, UK
Elias Pimenidis
Wrocław University of Science and Technology, Wrocław, Poland
Marcin Maleszka

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mussiraliyeva, S., Omarov, B., Bolatbek, M., Bagitova, K., Alimzhanova, Z. (2021). Bigram Based Deep Neural Network for Extremism Detection in Online User Generated Contents in the Kazakh Language. In: Wojtkiewicz, K., Treur, J., Pimenidis, E., Maleszka, M. (eds) Advances in Computational Collective Intelligence. ICCCI 2021. Communications in Computer and Information Science, vol 1463. Springer, Cham. https://doi.org/10.1007/978-3-030-88113-9_45

Download citation

DOI: https://doi.org/10.1007/978-3-030-88113-9_45
Published: 27 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-88112-2
Online ISBN: 978-3-030-88113-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Bigram Based Deep Neural Network for Extremism Detection in Online User Generated Contents in the Kazakh Language

Abstract

Access this chapter

Similar content being viewed by others

Applying Deep Learning for Extremism Detection

Use of Natural Language Processing to Identify Inappropriate Content in Text

Machine Learning for Identifying Abusive Content in Text Data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Bigram Based Deep Neural Network for Extremism Detection in Online User Generated Contents in the Kazakh Language

Abstract

Access this chapter

Similar content being viewed by others

Applying Deep Learning for Extremism Detection

Use of Natural Language Processing to Identify Inappropriate Content in Text

Machine Learning for Identifying Abusive Content in Text Data

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation