Skip to main content

Pornography Detection with the Wisdom of Crowds

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8281))

Abstract

With rapid development of the Internet, much attention has been paid to the problem of children exposed to Internet pornography. Existing detection techniques, which mainly focus on pornography content analysis have obtained much success. However, they still meet challenges in practical Web environment due to the great computational costs and the difficulties in dealing with various pornography forms. We attempt to solve this problem from a new perspective with the wisdom of crowds in search engine click-through logs. Inspired by the idea that different pornography Web pages may be oriented by similar search keywords, a label propagation method on click-through bipartite graph is proposed which can locate pornography Web pages from a small set (a few hundreds) of manually labeled seed pages. Experiments performed on datasets collected from both English and Chinese search engines show that the proposed algorithm can identify different forms of Internet pornography both effectively and efficiently.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Guan, S.S.A., Subrahmanyam, K.: Youth internet use: risks and opportunities. Current opinion in Psychiatry 22(4), 351–356 (2009)

    Article  Google Scholar 

  2. Ropelato, J.: Internet pornography statistics. TopTenReviews.com, internetfilter-review (2006), toptenreviews.com/internetpornographystatistics.html (accessed December 3, 2012)

  3. Ybarra, M.L., Mitchell, K.J.: Exposure to internet pornography among children and adolescents: A national survey. CyberPsychology & Behavior 8(5), 473–486 (2005)

    Article  Google Scholar 

  4. Goldstein, M.P.: Congress and the courts battle over the first amendment: Can the law really protect children from pornography on the internet. J. Marshall J. Computer & Info. L. 21, 141 (2002)

    Google Scholar 

  5. Lee, P.Y., Hui, S.C., Fong, A.C.M.: An intelligent categorization engine for bilingual web content filtering. IEEE Transactions on Multimedia 7(6), 1183–1190 (2005)

    Article  Google Scholar 

  6. Ho, W.H., Watters, P.A.: Statistical and structural approaches to filtering internet pornography. In: 2004 IEEE International Conference on Systems, Man and Cybernetics, vol. 5, pp. 4792–4798. IEEE (2004)

    Google Scholar 

  7. Polpinij, J., Sibunruang, C., Paungpronpitag, S., Chamchong, R., Chotthanom, A.: A web pornography patrol system by content-based analysis: In particular text and image. In: IEEE International Conference on Systems, Man and Cybernetics, SMC 2008, pp. 500–505. IEEE (2008)

    Google Scholar 

  8. Resnick, P., Miller, J.: Pics: Internet access controls without censorship. Communications of the ACM 39(10), 87–93 (1996)

    Article  Google Scholar 

  9. Lee, P.Y., Hui, S.C., Fong, A.C.M.: Neural networks for web content filtering. IEEE Intelligent Systems 17(5), 48–57 (2002)

    Article  Google Scholar 

  10. Lee, L.H., Luh, C.J.: Generation of pornographic blacklist and its incremental update using an inverse chi-square based method. Information Processing & Management 44(5), 1698–1706 (2008)

    Article  Google Scholar 

  11. Du, R., Safavi-Naini, R.: andW. Susilo. Web filtering using text classification. In: The 11th IEEE International Conference on Networks, ICON 2003, pp. 325–330. IEEE (2003)

    Google Scholar 

  12. Su, G., Li, J., Ma, Y., Li, S.: Improving the precision of the keyword-matching pornographic text filtering method using a hybrid model. Journal of Zhejiang University-Science A 5(9), 1106–1113 (2004)

    Article  Google Scholar 

  13. Zhu, X., Ghahramani, Z.: Learning from labeled and unlabeled data with label propagation. Technical report, Technical Report CMU-CALD- 02-107, Carnegie Mellon University (2002)

    Google Scholar 

  14. Pass, G., Chowdhury, A., Torgeson, C.: A picture of search. In: Proceedings of the 1st International Conference on Scalable Information Systems, p. 1. Citeseer (2006)

    Google Scholar 

  15. Gyöngyi, Z., Garcia-Molina, H., Pedersen, J.: Combating web spam with trustrank. In: Proceedings of the Thirtieth International Conference on Very Large Data Bases, vol. 30, pp. 576–587. VLDB Endowment (2004)

    Google Scholar 

  16. Wei, C., Liu, Y., Zhang, M., Ma, S., Ru, L., Zhang, K.: Fighting against web spam: a novel propagation method based on click-through data. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2012, pp. 395–404. ACM, New York (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Luo, C., Liu, Y., Ma, S., Zhang, M., Ru, L., Zhang, K. (2013). Pornography Detection with the Wisdom of Crowds. In: Banchs, R.E., Silvestri, F., Liu, TY., Zhang, M., Gao, S., Lang, J. (eds) Information Retrieval Technology. AIRS 2013. Lecture Notes in Computer Science, vol 8281. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45068-6_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-45068-6_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-45067-9

  • Online ISBN: 978-3-642-45068-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics