Skip to main content

Knowledge-Based Entity Resolution with Contextual Information Defined over a Monoid

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 9344))

Abstract

Entity resolution (aka record linkage) addresses the problem to decide whether two entity representations in a database or stream correspond to the same real-world object. Knowledge-based entity resolution is grounded in knowledge patterns, which combine rules defined by Horn clauses with conditions prescribing when the rule is applicable, and conditions specifying when the application of the rule is not permitted. So far, these positive and negative conditions are expressed as bindings of the variables appearing in the Horn clause. In this paper the condition part of a knowledge pattern is generalised to a context, which is still defined by a positive and a negative part, but for both equations involving operators are permitted. The paper concentrates on conditions over a monoid for the constraints in a context. With this generalisation standard properties of knowledge patterns such as minimality, containment and optimality are investigated, which altogether minimise redundancy and thus optimise the inference of equivalences between entities.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Arasu, A., Götz, M., Kaushik, R.: On active learning of record matching packages. Proceed. SIGMOD 2010, 783–794 (2010)

    Google Scholar 

  2. Bhattacharya, I., Getoor, L.: Collective entity resolution in relational data. TKDD 1(1), 5 (2007)

    Article  Google Scholar 

  3. Christen, P.: Data Matching. Springer, Heidelberg (2012)

    Book  Google Scholar 

  4. Cohen, W.W.: Data integration using similarity joins and a word-based information representation language. ACM TOIS 18(3), 288–321 (2000)

    Article  Google Scholar 

  5. Dong, X., Halevy, A., Madhavan, J.: Reference reconciliation in complex information spaces. Proceed. SIGMOD 2005, 85–96 (2005)

    Google Scholar 

  6. Fellegi, I.P., Sunter, A.B.: A theory for record linkage. J. Am. Statist. Assoc. 64(328), 1183–1210 (1969)

    Article  Google Scholar 

  7. Jaffar, J.: Minimal and complete word unification. J. ACM 37, 47–85 (1990)

    Article  MathSciNet  MATH  Google Scholar 

  8. Makanin, G.: The problem of solvability of equations in a free semigroup. Math. USSR Sb. 32, 129–198 (1977)

    Article  MATH  Google Scholar 

  9. Newcombe, H., Kennedy, J.: Record linkage: making maximum use of the discriminating power of identifying information. Commun. ACM 5(11), 563–566 (1962)

    Article  Google Scholar 

  10. Sarawagi, S., Bhamidipaty, A.: Interactive deduplication using active learning. In: KDD, pp. 269–278 (2002)

    Google Scholar 

  11. Schewe, K.D., Wang, Q.: Knowledge-aware identity services. Knowl. Inf. Syst. 36(2), 335–357 (2013)

    Article  Google Scholar 

  12. Schewe, K.D., Wang, Q.: A theoretical framework for knowledge-based entity resolution. Theor. Comput. Sci. 549, 101–126 (2014)

    Article  MathSciNet  MATH  Google Scholar 

  13. Tejada, S., Knoblock, C.A., Minton, S.: Learning object identification rules for information integration. Inf. Syst. 26(8), 607–633 (2001)

    Article  MATH  Google Scholar 

  14. Verykios, V., Moustakides, G., Elfeky, M.: A bayesian decision model for cost optimal record matching. VLDB J. 12(1), 28–40 (2003)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Klaus-Dieter Schewe .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Schewe, KD., Wang, Q., Rady, M. (2015). Knowledge-Based Entity Resolution with Contextual Information Defined over a Monoid. In: Bellatreche, L., Manolopoulos, Y. (eds) Model and Data Engineering. Lecture Notes in Computer Science(), vol 9344. Springer, Cham. https://doi.org/10.1007/978-3-319-23781-7_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-23781-7_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-23780-0

  • Online ISBN: 978-3-319-23781-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics