Skip to main content

Islamic Web Content Filtering and Categorization on Deviant Teaching

  • Conference paper
Recent Advances on Soft Computing and Data Mining

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 287))

Abstract

Currently, process for blocking the deviant teaching website is done manually by Malaysia authorities. In addition there are no Web filtering product offered to filter religion content and especially for Malay language. Web filtering can be used as protection against inappropriate and prevention of misuse of the network and hence, it can be used to filter the content of suspicious websites and alleviate the dissemination of such Web page. The purpose of the paper is to filter the deviant teachings Web page and classify them into three categories which are deviate, suspicious and clean. There are three Term Weighting Scheme techniques were used as feature selection included Term Frequency Inverse Document Frequency (TFIDF), Entropy and Modified Entropy. Support Vector Machine (SVM) will be used for classification process. As a result, M. Entropy shows the most suitable term weighting scheme to use in Islamic web pages filtering rather than TFIDF and Entropy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Heins, M., Cho, C., Goldberg, D.: Internet Filters (2006)

    Google Scholar 

  2. Foot, K.A., Schneider, S.M.: Web Campaigning, p. 263. MIT Press (2006)

    Google Scholar 

  3. Du, R., Safavi-Naini, R., Susilo, W.: Web filtering using text classification. In: The 11th IEEE International Conference on Networks, pp. 325–330 (October 2003)

    Google Scholar 

  4. Lee, Z.S.: Enhanced Feature Selection Method For Illicit Web Content Filtering. Universiti Teknologi Malaysia (2010)

    Google Scholar 

  5. Salleh, S.F.M.: Comparative Study On Term Weighting Schemes As Feature Selection Method For Malay Illicit Web Content Filtering. Universiti Teknologi Malaysia (2012)

    Google Scholar 

  6. Lee, Z.-S., Maarof, M.A., Selamat, A., Shamsuddin, S.M.: Enhance Term Weighting Algorithm as Feature Selection Technique for Illicit Web Content Classification. In: 2008 Eighth Int. Conf. Intell. Syst. Des. Appl., pp. 145–150 (November 2008)

    Google Scholar 

  7. Mazlam, N.: Enhancement of Stemming Process for Malay Illicit Web Content. Universiti Teknologi Malaysia (2012)

    Google Scholar 

  8. Sembok, T.M.T., Bakar, Z.A., Ahmad, F.: Experiments in Malay Information Retrieval. In: 2011 International Conference on Electrical Engineering and Informatics (July 2011)

    Google Scholar 

  9. Fadzli, S.A., Norsalehen, A.K., Syarilla, I.A., Hasni, H., Satar, S.D.M.: Simple rules malay stemmer. In: The International Conference on Informatics and Applications (ICIA 2012), pp. 28–35 (2012)

    Google Scholar 

  10. Cummins, R., O’Riordan, C.: Evolved term-weighting schemes in Information Retrieval: an analysis of the solution space. Artif. Intell. Rev. 26(1-2), 35–47 (2007)

    Article  Google Scholar 

  11. Yang, Y., Pederson, J.O.: A Comparative Study on Feature Selection inText Categorization. In: Proceedings of the Fourteenth International Conference on Machine Learning, ICML 1997, pp. 412–420 (1997)

    Google Scholar 

  12. Verikas, A., Bacauskiene, M.: Feature selection with neural networks. Pattern Recognit. Lett. 23(11), 1323–1335 (2002)

    Article  MATH  Google Scholar 

  13. Liu, Y., Loh, H.T., Sun, A.: Imbalanced text classification: A term weighting approach. Expert Syst. Appl. 36(1), 690–701 (2009)

    Article  Google Scholar 

  14. Salton, G., Wong, A., Yang, C.S.: AVector Space Model for Automatic Indexing. Commun. ACM 18(11) (1975)

    Google Scholar 

  15. Selamat, A., Omatu, S.: Web page feature selection and classification using neural networks. Inf. Sci. (Ny). 158, 69–88 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nurfazrina Mohd Zamry .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Zamry, N.M., Maarof, M.A., Zainal, A. (2014). Islamic Web Content Filtering and Categorization on Deviant Teaching. In: Herawan, T., Ghazali, R., Deris, M. (eds) Recent Advances on Soft Computing and Data Mining. Advances in Intelligent Systems and Computing, vol 287. Springer, Cham. https://doi.org/10.1007/978-3-319-07692-8_63

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-07692-8_63

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-07691-1

  • Online ISBN: 978-3-319-07692-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics