skip to main content
10.1145/2814864.2814870acmotherconferencesArticle/Chapter ViewAbstractPublication PagessemanticsConference Proceedingsconference-collections
research-article

Complex event extraction from real-time news streams

Published:16 September 2015Publication History

ABSTRACT

Information overload on news data is a known problem these days. People and organizations have an increasing demand for extraction of relevant information from massive amounts of news data arriving in real-time as news streams. In this paper, we present a novel approach for real-time extraction of news, based on user specifications and by using background knowledge from specific news domains. We create a powerful filtering service which limits the news data to the concrete and essential preferences of a user. In our approach, enrichment of real-time news with background knowledge is a preprocessing step. We use a Complex Event Processor to detect complex events from the enriched articles and match them to the user specified query. Each time a news article is matched, its result is notified to the user immediately. Our experimental evaluation shows that our approach is feasible for detecting news in real-time with high precision and recall.

References

  1. N. Bansal and N. Koudas. Blogscope: A system for online analysis of high volume text streams. In Proceedings of the 33rd International Conference on Very Large Data Bases, VLDB '07. VLDB Endowment, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. I. Cantador, A. Bellogín, and P. Castells. News@hand: A semantic web approach to recommending news. In Proceedings of the 5th International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems, AH '08. Springer-Verlag, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. I. Cantador, A. Bellogín, and P. Castells. Ontology-based personalised and context-aware recommendations of news items. In Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01, WI-IAT '08. IEEE Computer Society, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. I. Cantador and P. Castells. Semantic contextualisation in a news recommender system. In Workshop on Context-Aware Recommender Systems (CARS-2009), 2009.Google ScholarGoogle Scholar
  5. S. Chakravarthy, V. Krishnaprasad, E. Anwar, and S.-K. Kim. Composite events for active databases: Semantics, contexts and detection. In VLDB '94. Morgan Kaufmann Publishers Inc., 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. S. Das, M. Datar, A. Garg, and S. Rajaram. Google news personalization: Scalable online collaborative filtering. In WWW '07. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. H. Kopetz. Real-Time Systems: Design Principles for Distributed Embedded Applications. Kluwer Academic Publishers, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. P. A. Laplante. Real-Time Systems Design and Analysis: An Engineer's Handbook. IEEE Press, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Lehmann, R. Isele, M. Jakob, A. Jentzsch, D. Kontokostas, P. N. Mendes, S. Hellmann, M. Morsey, P. van Kleef, S. Auer, and C. Bizer. DBpedia - a large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web Journal, 2014.Google ScholarGoogle Scholar
  10. J. Liu, P. Dolan, and E. R. Pedersen. Personalized news recommendation based on click behavior. In Proceedings of the 15th International Conference on Intelligent User Interfaces, IUI '10. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. C. Luckham. The Power of Events: An Introduction to Complex Event Processing in Distributed Enterprise Systems. Addison-Wesley Longman Publishing Co., Inc., 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge University Press, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Passant and P. N. Mendes. sparqlpush: Proactive notification of data updates in rdf stores using pubsubhubbub. In SFSW, 2010.Google ScholarGoogle Scholar
  14. O. Phelan, K. McCarthy, M. Bennett, and B. Smyth. Terms of a feather: Content-based news recommendation and discovery using twitter. In Advances in Information Retrieval, Lecture Notes in Computer Science. Springer Berlin Heidelberg, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. H. Saif, Y. He, and H. Alani. Semantic sentiment analysis of twitter. In ISWC'12. Springer-Verlag, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. H. Weifeng, H. Di, and C. Juan. An osgi based rfid complex event processing system. In EUC. IEEE, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Complex event extraction from real-time news streams

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            SEMANTICS '15: Proceedings of the 11th International Conference on Semantic Systems
            September 2015
            220 pages
            ISBN:9781450334624
            DOI:10.1145/2814864

            Copyright © 2015 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 16 September 2015

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            SEMANTICS '15 Paper Acceptance Rate22of97submissions,23%Overall Acceptance Rate40of182submissions,22%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader