Abstract
In this paper, we present a method, that uses domain knowledge, to automatically discover and assign household identifiers to individual historical records. We apply this algorithm on a full count real census (the 1891 Canadian census) to assign household identifiers to all the records.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Statistics Canada website, URL: http://www.statcan.gc.ca/pub/11-630-x/11-630-x2015008-eng.htm.
References
Winkler, W.E.: Overview of record linkage and current research directions. Statistical Research Division Report (2006)
Elmagarmid, A.K., Ipeirotis, P.G., Verykios, V.S.: Duplicate record detection: a survey. IEEE Trans. Knowl. Data Eng. 19, 116 (2007)
On, B.W., Koudas, N., Lee, D., Srivastava, D.: Group linkage. In: IEEE International Conference on Data Engineering, pp. 496–505 (2007)
Antonie, L., Inwood, K., Lizotte, D.J., Ross, J.A.: Tracking people over time in 19th century Canada for longitudinal analysis. Mach. Learn. 95(1), 129–146 (2014)
Richards, L.: Disambiguating Multiple Links in Historical Record Linkage, MSc thesis, University of Guelph (2014)
Fu, Z., Boot, H.M., Christen, P., Zhou, J.: Automatic record linkage of individuals and households in historical census data. Int. J. Humanit. Arts Comput. 8(2), 204–225 (2014)
Ruggles, S.: Linking historical censuses: a new approach. Hist. Comput. 14(1–2), 213–224 (2002)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Antonie, L., Grewal, G., Inwood, K., Zarti, S. (2017). Automatic Household Identification for Historical Census Data. In: Mouhoub, M., Langlais, P. (eds) Advances in Artificial Intelligence. Canadian AI 2017. Lecture Notes in Computer Science(), vol 10233. Springer, Cham. https://doi.org/10.1007/978-3-319-57351-9_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-57351-9_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-57350-2
Online ISBN: 978-3-319-57351-9
eBook Packages: Computer ScienceComputer Science (R0)