Abstract
The identification of ”actionable” information in news stories has become a popular area for investigation. News presents some unique challenges for the researcher. The size constraints of a news story often require that full background information is omitted. Although this is acceptable for a human reader, it makes any form of automatic analysis difficult. Computational analysis may require some background information to provide context to news stories. There have been some attempts to identify and store background information. These approaches have tended to use an ontology to represent relationships and concepts present in the background information. The current methods of creating and populating ontologies with background information for news analysis were unsuitable for our future needs.
In this paper we present an automatic construction and population method of a domain ontology. This method produces an ontology which has the coverage of a manually created ontology and the ease of construction of the semi-automatic method. The proposed method uses a recursive algorithm which identifies relevant news stories from a corpus. For each story the algorithm tries to locate further related stories and background information. The proposed method also describes a pruning procedure which removes extraneous information from the ontology. Finally, the proposed method describes a procedure for adapting the ontology over time in response to changes in the monitored domain.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Andrew, B.C.: Media-generated shortcuts: Do newspaper headlines present another roadblock for low-information rationality? The Harvard International Journal of Press/Politics 12(2), 24–43 (2007)
Borsje, J., Levering, L., Frasincar, F.: Hermes: a semantic web-based news decision support system. In: The 23rd ACM Symposium on Applied Computing (SAC 2008), Special Track on Web Technologies (2008)
Emargee, E.: A .net 3.5 class library wrapper for the calais web service, http://www.codeplex.com/CalaisDotNet (consulted in 2009)
Halevy, A., Norvig, P., Pereira, F.: The unreasonable effectiveness of data. Intelligent Systems, IEEE 24(2), 8–12 (2009)
Levering, L., Frasincar, F., Borsje, J.: A semantic web-based approach for building personalized news services. International Journal of E-Business Research (2009)
Lloyd, L., Kechagias, D., Skiena, S.: News and blog analysis with lydia. In: 12 Internation Conference Spire, pp. 161–166 (2005)
McGuinness, D.L., van Harmelen, F., et al.: OWL Web Ontology Language Overview. W3C Recommendation 10, 2004–3 (2004)
McManus, J.: An economic theory of news selection. In: Annual Meeting for Education in Mass Media and Journalism (1988)
Newman, D., Chemudugunta, C., Smyth, P., Steyvers, M.: Analyzing entities and topics in news articles using statistical topic models. In: IEEE International Conference on Intelligence and Security Informatics (2006)
Castells, P., Perdrix, F., Pulido, E., et al.: Newspaper archives on the semantic web. In: HCI related papers of Interaccion 2004, pp. 267–276 (2004)
Reuters. Calais web service, http://opencalais.com/ (consulted in 2009)
Reuters. Calais web service linked data, http://d.opencalais.com/er/company/ralg-tr1r/9e3f6c34-aa6b-3a3b-b221-a07aa7933633.html (consulted in 2009)
The New York Times. New york times news service, http://developer.nytimes.com/docs/ (consulted in 2009)
Vargas-Vera, M., Celjuska, D.: Event recognition on news stories and semi-automatic population of an ontology. In: Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 615–618 (2004)
Weiss, S.M., Indurkhya, N., Zhang, T., Damerau, F.: Text Mining - Predictive Methods for Analyzing Unstructured Information. Springer, Heidelberg (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Drury, B., Almeida, J.J. (2009). Construction of a Local Domain Ontology from News Stories. In: Lopes, L.S., Lau, N., Mariano, P., Rocha, L.M. (eds) Progress in Artificial Intelligence. EPIA 2009. Lecture Notes in Computer Science(), vol 5816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04686-5_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-04686-5_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04685-8
Online ISBN: 978-3-642-04686-5
eBook Packages: Computer ScienceComputer Science (R0)