Abstract
Newspaper archives are a fundamental working tool for editorial teams. Their exploitation in digital format through the web, and the provision of technology to make this possible, are also important businesses today. The volume of archive contents, and the complexity of human teams that create and maintain them, give rise to diverse management difficulties. We propose the introduction of the emergent semantic-based technologies to improve the processes of creation, maintenance, and exploitation of the digital archive of a newspaper. We describe a platform based on these technologies, that consists of a) a knowledge base associated to the newspaper archive, based on an ontology for the description of journalistic information, b) a semantic search module, and c) a module for content browsing and visualisation based on ontologies.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
NewsLibrary, the world’s largest news archive, http://www.newslibrary.com
The British Library, the world’s knowledge, http://www.bl.uk
Baumgart, J.: U. S. Newspaper Archives on the Web, Available at http://www.ibiblio.org/slanews/internet/archives.html
NewsViews Solutions, http://www.newsviewsolutions.com
ActivePaper Archive by Olive Software, http://www.active-paper.com/ap_aparchive.html
LexisNexis for law, public records, company data, government, academic and business news sources, http://www.lexisnexis.com
Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American (2001)
IPTC Subject Reference System & NewsML Topicsets, http://www.iptc.org/metadata
Milenium Arcano by Protec, http://www.mileniumcrossmedia.com/Arcano/Arcano.htm
El País - el archivo - Hemeroteca, http://www.elpais.es/archivo/hemeroteca.html
ProQuest Historical Newspapers, http://www.il.proquest.com/products/pt-product-HistNews.shtml
ArchiveIQue by Baseview, http://www.baseview.com/products/archiveique.html
Canto - Digital Asset Management with Cumuluc, http://www.canto.com
DC4, The Digital Collections System, http://www.digitalcollections.biz/dc4.asp
Lassila, O., Swick, R.R.: Resource Description Framework (RDF) Model and Syntax Specification. W3C Recommendation, February 22(1999), Available at http://www.w3.org/TR/REC-rdf-syntax
Noy, N.F., McGuinness, D.L.: Ontology Development 101: A Guide to Creating Your First Ontology. Stanford Knowledge Systems Laboratory Technical Report KSL-01-05 and Stanford Medical Informatics Technical Report SMI-2001-0880 (2001)
IPTC NewsML, http://www.newsml.org
IPTC News Industry Text Format (NITF), A Solution for Sharing News, http://www.nitf.org
XMLNews, XML and the News Industry, http://www.xmlnews.org
Publishing Requirements for Industry Standard Metadata (PRISM), http://www.prismstandard.org
Noy, N.F., Sintek, M., Decker, C.M., Fergerson, R.W., Musen, M.A.: Creating Semantic Web Contents with Protege-2000. IEEE Intelligent Systems 16(2), 60–71 (2001)
Guha, R., McCool, R., Miller, E.: Semantic search. In: 12th International World Wide Web Conference (WWW 2003), Budapest, Hungary, pp. 700–709 (2003)
Shah, U., Finin, T., Joshi, A., Cost, R.S., Mayfield, J.: Information Retrieval on the Semantic Web. In: 10th International Conference on Information and Knowledge Management (2002)
Contreras, J., Benjamins, V.R., Prieto, J.A., Patón, D., Losada, S., González, D.: Duontology: an Approach to Semantic Portals based on a Domain and Visualisation Ontology. KTWeb, http://www.drecommerce.com/doc/Benjamins-Duontology-a.pdf
Haustein, S., Pleumann, J.: Is participation in the semantic web too difficult? In: Horrocks, I., Hendler, J. (eds.) ISWC 2002. LNCS, vol. 2342, p. 448. Springer, Heidelberg (2002)
Bizer, C.: D2R MAP - A Database to RDF Mapping Language. In: 12th International World Wide Web Conference (WWW 2003), Budapest, Hungary (2003)
Jena 2 – A Semantic Web Framework, http://www.hpl.hp.com/semweb/jena2.htm
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Castells, P. et al. (2004). Neptuno: Semantic Web Technologies for a Digital Newspaper Archive. In: Bussler, C.J., Davies, J., Fensel, D., Studer, R. (eds) The Semantic Web: Research and Applications. ESWS 2004. Lecture Notes in Computer Science, vol 3053. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-25956-5_31
Download citation
DOI: https://doi.org/10.1007/978-3-540-25956-5_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21999-6
Online ISBN: 978-3-540-25956-5
eBook Packages: Springer Book Archive