ABSTRACT
Information overload on news data is a known problem these days. People and organizations have an increasing demand for extraction of relevant information from massive amounts of news data arriving in real-time as news streams. In this paper, we present a novel approach for real-time extraction of news, based on user specifications and by using background knowledge from specific news domains. We create a powerful filtering service which limits the news data to the concrete and essential preferences of a user. In our approach, enrichment of real-time news with background knowledge is a preprocessing step. We use a Complex Event Processor to detect complex events from the enriched articles and match them to the user specified query. Each time a news article is matched, its result is notified to the user immediately. Our experimental evaluation shows that our approach is feasible for detecting news in real-time with high precision and recall.
- N. Bansal and N. Koudas. Blogscope: A system for online analysis of high volume text streams. In Proceedings of the 33rd International Conference on Very Large Data Bases, VLDB '07. VLDB Endowment, 2007. Google ScholarDigital Library
- I. Cantador, A. Bellogín, and P. Castells. News@hand: A semantic web approach to recommending news. In Proceedings of the 5th International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems, AH '08. Springer-Verlag, 2008. Google ScholarDigital Library
- I. Cantador, A. Bellogín, and P. Castells. Ontology-based personalised and context-aware recommendations of news items. In Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01, WI-IAT '08. IEEE Computer Society, 2008. Google ScholarDigital Library
- I. Cantador and P. Castells. Semantic contextualisation in a news recommender system. In Workshop on Context-Aware Recommender Systems (CARS-2009), 2009.Google Scholar
- S. Chakravarthy, V. Krishnaprasad, E. Anwar, and S.-K. Kim. Composite events for active databases: Semantics, contexts and detection. In VLDB '94. Morgan Kaufmann Publishers Inc., 1994. Google ScholarDigital Library
- A. S. Das, M. Datar, A. Garg, and S. Rajaram. Google news personalization: Scalable online collaborative filtering. In WWW '07. ACM, 2007. Google ScholarDigital Library
- H. Kopetz. Real-Time Systems: Design Principles for Distributed Embedded Applications. Kluwer Academic Publishers, 1997. Google ScholarDigital Library
- P. A. Laplante. Real-Time Systems Design and Analysis: An Engineer's Handbook. IEEE Press, 1992. Google ScholarDigital Library
- J. Lehmann, R. Isele, M. Jakob, A. Jentzsch, D. Kontokostas, P. N. Mendes, S. Hellmann, M. Morsey, P. van Kleef, S. Auer, and C. Bizer. DBpedia - a large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web Journal, 2014.Google Scholar
- J. Liu, P. Dolan, and E. R. Pedersen. Personalized news recommendation based on click behavior. In Proceedings of the 15th International Conference on Intelligent User Interfaces, IUI '10. ACM, 2010. Google ScholarDigital Library
- D. C. Luckham. The Power of Events: An Introduction to Complex Event Processing in Distributed Enterprise Systems. Addison-Wesley Longman Publishing Co., Inc., 2001. Google ScholarDigital Library
- C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge University Press, 2008. Google ScholarDigital Library
- A. Passant and P. N. Mendes. sparqlpush: Proactive notification of data updates in rdf stores using pubsubhubbub. In SFSW, 2010.Google Scholar
- O. Phelan, K. McCarthy, M. Bennett, and B. Smyth. Terms of a feather: Content-based news recommendation and discovery using twitter. In Advances in Information Retrieval, Lecture Notes in Computer Science. Springer Berlin Heidelberg, 2011. Google ScholarDigital Library
- H. Saif, Y. He, and H. Alani. Semantic sentiment analysis of twitter. In ISWC'12. Springer-Verlag, 2012. Google ScholarDigital Library
- H. Weifeng, H. Di, and C. Juan. An osgi based rfid complex event processing system. In EUC. IEEE, 2010. Google ScholarDigital Library
Index Terms
- Complex event extraction from real-time news streams
Recommendations
Complex event processing over distributed probabilistic event streams
With the rapid development of Internet of Things (IoT), enormous events are produced every day. Complex Event Processing (CEP), which can be used to extract high level patterns from raw data, becomes the key part of the IoT middleware. In large-scale ...
Frontex real-time news event extraction framework
KDD '11: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data miningAn ever-growing amount of information relevant for early detection of certain threats can be extracted from on-line news. This led to an emergence of news mining tools to help analysts to digest the overflow of information and to extract valuable ...
High-performance complex event processing over streams
SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of dataIn this paper, we present the design, implementation, and evaluation of a system that executes complex event queries over real-time streams of RFID readings encoded as events. These complex event queries filter and correlate events to match specific ...
Comments