Automated Country Name Disambiguation for Code Set Alignment

Richardson, Gramm

doi:10.1007/978-3-642-15464-5_66

Gramm Richardson²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6273))

Included in the following conference series:

International Conference on Theory and Practice of Digital Libraries

1619 Accesses
1 Citations

Abstract

Multiple standards and encodings for names of countries, as well as multiple renderings of the country names themselves cause problems for interoperability. This impacts both human and automated processing. This paper describes an automated method for aligning pairs of country code sets by examining the string similarity between the names of the countries in each set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bilenko, M., Mooney, R.J.: Adaptive duplicate detection using learnable string similarity measures. In: KDD 2003: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 39–48. ACM, New York (2003)
Chapter Google Scholar
Cohen, W.W., Ravikumar, P., Fienberg, S.E.: A comparison of string distance metrics for name-matching tasks. In: Proceedings of the IJCAI-2003 Workshop on Information Integration on the Web, pp. 73–78 (2003)
Google Scholar
French, J.C., Powell, A.L., Schulman, E.: Using clustering strategies for creating authority files. Journal of the American Society for Information Science 51(8), 774–786 (2000)
Article Google Scholar
Kondrak, G.: N-gram similarity and distance. In: Consens, M.P., Navarro, G. (eds.) SPIRE 2005. LNCS, vol. 3772, pp. 115–126. Springer, Heidelberg (2005)
Chapter Google Scholar
Navarro, G.: A guided tour to approximate string matching. ACM Computing Surveys (CSUR) 33(1), 31–88 (2001)
Article Google Scholar
Piskorski, J., Sydow, M.: String distance metrics for reference matching and search query correction. In: 10th Business Information Systems Conference, pp. 353–365 (2007)
Google Scholar
Siegfried, S.L., Bernstein, J.: Synoname: Getty’s new approach to pattern matching for personal names. Computers and the Humanities 25(4), 211–226 (1991)
Article Google Scholar
van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Butterworths, London (1979)
Google Scholar

Download references

Author information

Authors and Affiliations

U.S. Department of Defense,
Gramm Richardson

Authors

Gramm Richardson
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science, University of Glasgow, 17 Lilybank Gardens, G12 8QQ, Glasgow, UK
Mounia Lalmas & Joemon Jose &
Vienna University of Technology, 1040, Vienna, Austria
Andreas Rauber
Istituto di Scienza e Tecnologia dell’Informazione, Consiglio Nazionale delle Ricerche, Via G Moruzzi 1, 56124, Pisa, Italy
Fabrizio Sebastiani
University of Glasgow, G12 8QQ, Glasgow, Uk
Ingo Frommholz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Richardson, G. (2010). Automated Country Name Disambiguation for Code Set Alignment. In: Lalmas, M., Jose, J., Rauber, A., Sebastiani, F., Frommholz, I. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2010. Lecture Notes in Computer Science, vol 6273. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15464-5_66

Download citation

DOI: https://doi.org/10.1007/978-3-642-15464-5_66
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15463-8
Online ISBN: 978-3-642-15464-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics