skip to main content
10.1145/2335484.2335512acmconferencesArticle/Chapter ViewAbstractPublication PagesdebsConference Proceedingsconference-collections
research-article

Approximate semantic matching of heterogeneous events

Published: 16 July 2012 Publication History

Abstract

Event-based systems have loose coupling within space, time and synchronization, providing a scalable infrastructure for information exchange and distributed workflows. However, event-based systems are tightly coupled, via event subscriptions and patterns, to the semantics of the underlying event schema and values. The high degree of semantic heterogeneity of events in large and open deployments such as smart cities and the sensor web makes it difficult to develop and maintain event-based systems. In order to address semantic coupling within event-based systems, we propose vocabulary free subscriptions together with the use of approximate semantic matching of events. This paper examines the requirement of event semantic decoupling and discusses approximate semantic event matching and the consequences it implies for event processing systems. We introduce a semantic event matcher and evaluate the suitability of an approximate hybrid matcher based on both thesauri-based and distributional semantics-based similarity and relatedness measures. The matcher is evaluated over a structured representation of Wikipedia and Freebase events. Initial evaluations show that the approach matches events with a maximal combined precision-recall F1 score of 75.89% on average in all experiments with a subscription set of 7 subscriptions. The evaluation shows how a hybrid approach to semantic event matching outperforms a single similarity measure approach.

References

[1]
Atzori, L., Iera, A., and Morabito, G. The internet of things: A survey. Computer Networks 54, 15 (2010), 2787--2805.
[2]
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., and Ives, Z. DBpedia: A Nucleus for a Web of Open Data. The Semantic Web 4825, (2007), 722--735.
[3]
Aumueller, D., Do, H.-H., Massmann, S., and Rahm, E. Schema and ontology matching with COMA++. Proceedings of the 2005 ACM SIGMOD international conference on Management of data, ACM (2005), 906--908.
[4]
Belkin, N. J. and Croft, W. B. Information filtering and information retrieval: two sides of the same coin? Commun. ACM 35, 12 (1992), 29--38.
[5]
Berners-Lee, T. Linked Data- Design Issues. 2006. http://www.w3.org/DesignIssues/LinkedData.html.
[6]
Botts, M., Percivall, G., Reed, C., and Davidson, J. OGC sensor web enablement: Overview and high level architecture. GeoSensor networks, (2008), 175--190.
[7]
Budanitsky, A. and Hirst, G. Evaluating wordnet-based measures of lexical semantic relatedness. Computational Linguistics 32, 1 (2006), 13--47.
[8]
Curry, E., Hasan, S., Hassan, U. ul, Herstand, M., and O'Riain, S. An Entity-Centric Approach to Green Information Systems. The 19th European Conference on Information Systems (ECIS), (2011).
[9]
Drosou, M., Stefanidis, K., and Pitoura, E. Preference-aware publish/subscribe delivery with diversity. Proceedings of the Third ACM International Conference on Distributed Event-Based Systems, (2009), 6:1--6:12.
[10]
Eugster, P. T., Felber, P. A., Guerraoui, R., and Kermarrec, A. M. The many faces of publish/subscribe. ACM Computing Surveys (CSUR) 35, 2 (2003), 114--131.
[11]
Freitas, A., Curry, E., J. G., O., and O'Riain, S. Querying Heterogeneous Datasets on the Linked Data Web: Challenges, Approaches and Trends. IEEE Internet Computing Special Issue on Internet Scale Data 16, 1 (2012), 24--33.
[12]
Gabrilovich, E. and Markovitch, S. Computing semantic relatedness using wikipedia-based explicit semantic analysis. Proceedings of the 20th international joint conference on Artifical intelligence, (2007), 1606--1611.
[13]
Hasan, S., Curry, E., Banduk, M., and O'Riain, S. Toward Situation Awareness for the Semantic Sensor Web: Complex Event Processing with Dynamic Linked Data Enrichment. the 4th International Workshop on Semantic Sensor Networks 2011 (SSN11), (2011), 60--72.
[14]
Hinze, A., Sachs, K., and Buchmann, A. Event-based applications and enabling technologies. Proceedings of the Third ACM International Conference on Distributed Event-Based Systems, ACM (2009), 1:1--1:15.
[15]
Jiang, J. J. and Conrath, D. W. Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy. Proceedings of the International Conference on Re- search in Computational Linguistic, (1998).
[16]
Klyne, G. and Carroll, J. J. Resource Description Framework (RDF): Concepts and Abstract Syntax. 2004. http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/.
[17]
Liu, H. and Jacobsen, H.-A. A-TOPSS: a publish/subscribe system supporting approximate matching. Proceedings of the 28th international conference on Very Large Data Bases, VLDB Endowment (2002), 1107--1110.
[18]
Luckham, D. C. The power of events: an introduction to complex event processing in distributed enterprise systems. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA, 2002.
[19]
Machanavajjhala, A., Vee, E., Garofalakis, M., and Shanmugasundaram, J. Scalable ranked publish/subscribe. Proc. VLDB Endow. 1, 1 (2008), 451--462.
[20]
Manning, C. D., Raghavan, P., and Schutze, H. Introduction to information retrieval. Cambridge University Press Cambridge, 2008.
[21]
Miller, G. A. WordNet: a lexical database for English. Commun. ACM 38, 11 (1995), 39--41.
[22]
OECD. Machine-to-Machine Communications: Connecting Billions of Devices. OECD Digital Economy Papers No. 192, 2012.
[23]
Patwardhan, S. and Pedersen, T. Using WordNet-based context vectors to estimate the semantic relatedness of concepts. Proceedings of the EACL 2006 Workshop Making Sense of Sense-Bringing Computational Linguistics and Psycholinguistics Together, (2006), 1--8.
[24]
Petrovic, M., Burcea, I., and Jacobsen, H.-A. S-ToPSS: semantic Toronto publish/subscribe system. Proceedings of the 29th international conference on Very large data bases - Volume 29, VLDB Endowment (2003), 1101--1104.
[25]
Pripužić, K., Žarko, I. P., and Aberer, K. Top-k/w publish/subscribe: finding k most relevant publications in sliding time window w. Proceedings of the second international conference on Distributed event-based systems, ACM (2008), 127--138.
[26]
Prud'Hommeaux, E. and Seaborne, A. SPARQL query language for RDF. W3C working draft 4, January (2008).
[27]
Rada, R., Mili, H., Bicknell, E., and Blettner, M. Development and application of a metric on semantic nets. Systems, Man and Cybernetics, IEEE Transactions on 19, 1 (1989), 17--30.
[28]
Wasserkrug, S., Gal, A., Etzion, O., and Turchin, Y. Complex event processing over uncertain data. Proceedings of the second international conference on Distributed event-based systems, (2008), 253--264.
[29]
Zhang, W., Ma, J., and Ye, D. FOMatch: A Fuzzy Ontology-Based Semantic Matching Algorithm of Publish/Subscribe Systems. Computational Intelligence for Modelling Control & Automation, 2008 International Conference on, (2008), 111--117.

Cited By

View all
  • (2022)Multimodal Event Processing: A Neural-Symbolic Paradigm for the Internet of Multimedia ThingsIEEE Internet of Things Journal10.1109/JIOT.2022.31431719:15(13705-13724)Online publication date: 1-Aug-2022
  • (2022)FEDARGOS-V1: A Monitoring Architecture for Federated Cloud Computing InfrastructuresIEEE Access10.1109/ACCESS.2022.323162210(133557-133573)Online publication date: 2022
  • (2022)Weighted propositional configuration logicsInformation and Computation10.1016/j.ic.2020.104647282:COnline publication date: 1-Jan-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DEBS '12: Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems
July 2012
410 pages
ISBN:9781450313155
DOI:10.1145/2335484
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 July 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. approximate event matching
  2. semantic decoupling
  3. semantic event matching

Qualifiers

  • Research-article

Funding Sources

Conference

DEBS '12

Acceptance Rates

Overall Acceptance Rate 145 of 583 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)1
Reflects downloads up to 20 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Multimodal Event Processing: A Neural-Symbolic Paradigm for the Internet of Multimedia ThingsIEEE Internet of Things Journal10.1109/JIOT.2022.31431719:15(13705-13724)Online publication date: 1-Aug-2022
  • (2022)FEDARGOS-V1: A Monitoring Architecture for Federated Cloud Computing InfrastructuresIEEE Access10.1109/ACCESS.2022.323162210(133557-133573)Online publication date: 2022
  • (2022)Weighted propositional configuration logicsInformation and Computation10.1016/j.ic.2020.104647282:COnline publication date: 1-Jan-2022
  • (2021)Investigating response time and accuracy in online classifier learning for multimedia publish-subscribe systemsMultimedia Tools and Applications10.1007/s11042-020-10277-xOnline publication date: 9-Jan-2021
  • (2021)Business Intelligence DimensionsBusiness Intelligence10.1007/978-3-030-67032-0_4(51-80)Online publication date: 9-Mar-2021
  • (2020)Object Detection for Unseen Domains while Reducing Response Time using Knowledge Transfer in Multimedia Event ProcessingProceedings of the 2020 International Conference on Multimedia Retrieval10.1145/3372278.3391936(373-377)Online publication date: 8-Jun-2020
  • (2020)Die Anwendung von Machine Learning zur Gewinnung von Erkenntnissen aus DokumentenstapelnKünstliche Intelligenz in Wirtschaft & Gesellschaft10.1007/978-3-658-29550-9_15(275-295)Online publication date: 6-Oct-2020
  • (2020)Weighted PCL over Product Valuation MonoidsCoordination Models and Languages10.1007/978-3-030-50029-0_19(301-319)Online publication date: 10-Jun-2020
  • (2019)On combining probabilistic and semantic similarity-based methods toward off-domain reasoning for situational awareness2019 22th International Conference on Information Fusion (FUSION)10.23919/FUSION43075.2019.9011197(1-8)Online publication date: Jul-2019
  • (2019)A Design of Cyber-Physical System Architecture for Smart CityRecent Trends in Intelligent Computing, Communication and Devices10.1007/978-981-13-9406-5_116(967-973)Online publication date: 2-Oct-2019
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media