Skip to main content

Construction of a Local Domain Ontology from News Stories

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5816))

Abstract

The identification of ”actionable” information in news stories has become a popular area for investigation. News presents some unique challenges for the researcher. The size constraints of a news story often require that full background information is omitted. Although this is acceptable for a human reader, it makes any form of automatic analysis difficult. Computational analysis may require some background information to provide context to news stories. There have been some attempts to identify and store background information. These approaches have tended to use an ontology to represent relationships and concepts present in the background information. The current methods of creating and populating ontologies with background information for news analysis were unsuitable for our future needs.

In this paper we present an automatic construction and population method of a domain ontology. This method produces an ontology which has the coverage of a manually created ontology and the ease of construction of the semi-automatic method. The proposed method uses a recursive algorithm which identifies relevant news stories from a corpus. For each story the algorithm tries to locate further related stories and background information. The proposed method also describes a pruning procedure which removes extraneous information from the ontology. Finally, the proposed method describes a procedure for adapting the ontology over time in response to changes in the monitored domain.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Andrew, B.C.: Media-generated shortcuts: Do newspaper headlines present another roadblock for low-information rationality? The Harvard International Journal of Press/Politics 12(2), 24–43 (2007)

    Article  Google Scholar 

  2. Borsje, J., Levering, L., Frasincar, F.: Hermes: a semantic web-based news decision support system. In: The 23rd ACM Symposium on Applied Computing (SAC 2008), Special Track on Web Technologies (2008)

    Google Scholar 

  3. Emargee, E.: A .net 3.5 class library wrapper for the calais web service, http://www.codeplex.com/CalaisDotNet (consulted in 2009)

  4. Halevy, A., Norvig, P., Pereira, F.: The unreasonable effectiveness of data. Intelligent Systems, IEEE 24(2), 8–12 (2009)

    Article  Google Scholar 

  5. Levering, L., Frasincar, F., Borsje, J.: A semantic web-based approach for building personalized news services. International Journal of E-Business Research (2009)

    Google Scholar 

  6. Lloyd, L., Kechagias, D., Skiena, S.: News and blog analysis with lydia. In: 12 Internation Conference Spire, pp. 161–166 (2005)

    Google Scholar 

  7. McGuinness, D.L., van Harmelen, F., et al.: OWL Web Ontology Language Overview. W3C Recommendation 10, 2004–3 (2004)

    Google Scholar 

  8. McManus, J.: An economic theory of news selection. In: Annual Meeting for Education in Mass Media and Journalism (1988)

    Google Scholar 

  9. Newman, D., Chemudugunta, C., Smyth, P., Steyvers, M.: Analyzing entities and topics in news articles using statistical topic models. In: IEEE International Conference on Intelligence and Security Informatics (2006)

    Google Scholar 

  10. Castells, P., Perdrix, F., Pulido, E., et al.: Newspaper archives on the semantic web. In: HCI related papers of Interaccion 2004, pp. 267–276 (2004)

    Google Scholar 

  11. Reuters. Calais web service, http://opencalais.com/ (consulted in 2009)

  12. Reuters. Calais web service linked data, http://d.opencalais.com/er/company/ralg-tr1r/9e3f6c34-aa6b-3a3b-b221-a07aa7933633.html (consulted in 2009)

  13. The New York Times. New york times news service, http://developer.nytimes.com/docs/ (consulted in 2009)

  14. Vargas-Vera, M., Celjuska, D.: Event recognition on news stories and semi-automatic population of an ontology. In: Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 615–618 (2004)

    Google Scholar 

  15. Weiss, S.M., Indurkhya, N., Zhang, T., Damerau, F.: Text Mining - Predictive Methods for Analyzing Unstructured Information. Springer, Heidelberg (2005)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Drury, B., Almeida, J.J. (2009). Construction of a Local Domain Ontology from News Stories. In: Lopes, L.S., Lau, N., Mariano, P., Rocha, L.M. (eds) Progress in Artificial Intelligence. EPIA 2009. Lecture Notes in Computer Science(), vol 5816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04686-5_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04686-5_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04685-8

  • Online ISBN: 978-3-642-04686-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics