Abstract
We present a method to detect automatically pornographic content on the Web. Our method combines techniques from language engineering and image analysis within a machine-learning framework. Experimental results show that it achieves nearly perfect performance on a set of hard cases.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
See http://websearch.about.com/internet/websearch/library/myths for related statistics.
Consult [4] for an evaluation of third-party and self-regulating rating schemes.
An evaluation copy of a commercial product included a ~12,000 strong URL blacklist which contained only 36 out of the 500 pornographic URLs we easily summoned in a day.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
R.O. Duda and P.E. Hart. Bayes Decision Theory. Pattern Classification and Scene Analysis, pp. 10–43. John Wiley, 1973.
D. Forsyth. Finding Naked People. Proc. of the 4th European Conference on Computer Vision, Cambridge, England, 1996.
T.M. Mitchell. Bayesian Learning. Machine Learning, pp.154–200. McGraw-Hill, 1997.
Clairview Internet Sheriff, An independent review. Electronic Frontiers Australia. http://www.efa.org.au/Publish/report_isheriff.html
Platform for Internet Content Selection (PICS). http://www.w3.org/PICS
Internet Content Rating Association (ICRA). http://www.icra.org
SafeSurf. http://www.safesurf.com/
P. Greenfield. Technical Aspects of Blocking Internet Content. National Office of the Information Economy, Australia, 1999. http://www.noie.gov.au
T. Minka. An Image Database Browser that Learns from User Interaction. MIT Media Lab TR 365, 1995.
T.K. Leung, M.C. Burl, and P. Perona. Finding Faces in Cluttered Scenes Using Random Labeled Graph Matching. Proc. of the International Conference on Computer Vision, pp. 63–644, 1995.
C.D. Manning and H. Schutze. Foundations of Statistical Natural Language Processing. MIT Press, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chandrinos, K.V., Androutsopoulos, I., Paliouras, G., Spyropoulos, C.D. (2000). Automatic Web Rating: Filtering Obscene Content on the Web. In: Borbinha, J., Baker, T. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2000. Lecture Notes in Computer Science, vol 1923. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45268-0_50
Download citation
DOI: https://doi.org/10.1007/3-540-45268-0_50
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41023-2
Online ISBN: 978-3-540-45268-3
eBook Packages: Springer Book Archive