Abstract
With rapid development of the Internet, much attention has been paid to the problem of children exposed to Internet pornography. Existing detection techniques, which mainly focus on pornography content analysis have obtained much success. However, they still meet challenges in practical Web environment due to the great computational costs and the difficulties in dealing with various pornography forms. We attempt to solve this problem from a new perspective with the wisdom of crowds in search engine click-through logs. Inspired by the idea that different pornography Web pages may be oriented by similar search keywords, a label propagation method on click-through bipartite graph is proposed which can locate pornography Web pages from a small set (a few hundreds) of manually labeled seed pages. Experiments performed on datasets collected from both English and Chinese search engines show that the proposed algorithm can identify different forms of Internet pornography both effectively and efficiently.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Guan, S.S.A., Subrahmanyam, K.: Youth internet use: risks and opportunities. Current opinion in Psychiatry 22(4), 351–356 (2009)
Ropelato, J.: Internet pornography statistics. TopTenReviews.com, internetfilter-review (2006), toptenreviews.com/internetpornographystatistics.html (accessed December 3, 2012)
Ybarra, M.L., Mitchell, K.J.: Exposure to internet pornography among children and adolescents: A national survey. CyberPsychology & Behavior 8(5), 473–486 (2005)
Goldstein, M.P.: Congress and the courts battle over the first amendment: Can the law really protect children from pornography on the internet. J. Marshall J. Computer & Info. L. 21, 141 (2002)
Lee, P.Y., Hui, S.C., Fong, A.C.M.: An intelligent categorization engine for bilingual web content filtering. IEEE Transactions on Multimedia 7(6), 1183–1190 (2005)
Ho, W.H., Watters, P.A.: Statistical and structural approaches to filtering internet pornography. In: 2004 IEEE International Conference on Systems, Man and Cybernetics, vol. 5, pp. 4792–4798. IEEE (2004)
Polpinij, J., Sibunruang, C., Paungpronpitag, S., Chamchong, R., Chotthanom, A.: A web pornography patrol system by content-based analysis: In particular text and image. In: IEEE International Conference on Systems, Man and Cybernetics, SMC 2008, pp. 500–505. IEEE (2008)
Resnick, P., Miller, J.: Pics: Internet access controls without censorship. Communications of the ACM 39(10), 87–93 (1996)
Lee, P.Y., Hui, S.C., Fong, A.C.M.: Neural networks for web content filtering. IEEE Intelligent Systems 17(5), 48–57 (2002)
Lee, L.H., Luh, C.J.: Generation of pornographic blacklist and its incremental update using an inverse chi-square based method. Information Processing & Management 44(5), 1698–1706 (2008)
Du, R., Safavi-Naini, R.: andW. Susilo. Web filtering using text classification. In: The 11th IEEE International Conference on Networks, ICON 2003, pp. 325–330. IEEE (2003)
Su, G., Li, J., Ma, Y., Li, S.: Improving the precision of the keyword-matching pornographic text filtering method using a hybrid model. Journal of Zhejiang University-Science A 5(9), 1106–1113 (2004)
Zhu, X., Ghahramani, Z.: Learning from labeled and unlabeled data with label propagation. Technical report, Technical Report CMU-CALD- 02-107, Carnegie Mellon University (2002)
Pass, G., Chowdhury, A., Torgeson, C.: A picture of search. In: Proceedings of the 1st International Conference on Scalable Information Systems, p. 1. Citeseer (2006)
Gyöngyi, Z., Garcia-Molina, H., Pedersen, J.: Combating web spam with trustrank. In: Proceedings of the Thirtieth International Conference on Very Large Data Bases, vol. 30, pp. 576–587. VLDB Endowment (2004)
Wei, C., Liu, Y., Zhang, M., Ma, S., Ru, L., Zhang, K.: Fighting against web spam: a novel propagation method based on click-through data. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2012, pp. 395–404. ACM, New York (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Luo, C., Liu, Y., Ma, S., Zhang, M., Ru, L., Zhang, K. (2013). Pornography Detection with the Wisdom of Crowds. In: Banchs, R.E., Silvestri, F., Liu, TY., Zhang, M., Gao, S., Lang, J. (eds) Information Retrieval Technology. AIRS 2013. Lecture Notes in Computer Science, vol 8281. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45068-6_20
Download citation
DOI: https://doi.org/10.1007/978-3-642-45068-6_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-45067-9
Online ISBN: 978-3-642-45068-6
eBook Packages: Computer ScienceComputer Science (R0)