Skip to main content

The Use of Uncertainty to Choose Matching Variables in Statistical Matching

  • Conference paper
  • First Online:
Soft Methods for Data Science (SMPS 2016)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 456))

Included in the following conference series:

Abstract

Statistical matching aims at combining information available in distinct sample surveys referred to the same target population. The matching is usually based on a set of common variables shared by the available data sources. For matching purposes just a subset of all the common variables should be used, the so called matching variables. The paper presents a novel method for selecting the matching variables based on the analysis of the uncertainty characterizing the matching framework. The uncertainty is caused by unavailability of data for estimating parameters describing the association/correlation between variables not jointly observed in a single data source. The paper focuses on the case of categorical variables and presents a sequential procedure for identifying the most effective subset of common variables in reducing the overall uncertainty.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agresti A (2013) Categorical data analysis, 3rd edn. Wiley, New York

    MATH  Google Scholar 

  2. Agresti A, Yang MC (1987) An empirical investigation of some effects of sparseness in contingency tables. Comput Stat Data Anal 5:9–21

    Article  MATH  Google Scholar 

  3. Bishop YM, Fienberg SE, Holland PW (1975) Discrete Multivariate Analysis: Theory and Practice. MIT. Press, Cambridge, MA. Paperback edition

    Google Scholar 

  4. Cohen ML (1991) Statistical matching and microsimulation models. In: Citro, H (ed) Improving information for social policy decisions: The uses of microsimulation modeling, vol II Technical papers, Washington D.C

    Google Scholar 

  5. Conti PL, Marella D, Scanu M (2012) Uncertainty analysis in statistical matching. J Official Stat 28:69–88

    Google Scholar 

  6. D’Orazio M, Di Zio M, Scanu M (2006) Statistical matching: theory and practice. Wiley, Chichester

    Book  MATH  Google Scholar 

  7. D’Orazio M (2016) StatMatch: statistical matching (aka data fusion). R package version 1.2.4 http://CRAN.R-project.org/package=StatMatch

  8. Särndal CE, Swensson B, Wretman J (1992) Model assisted survey sampling. Springer, New York

    Book  MATH  Google Scholar 

  9. Vantaggi B (2008) Statistical matching of multiple sources: a look through coherence. In J Approximate Reasoning 49:701–711

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marco Di Zio .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing Switzerland

About this paper

Cite this paper

D’Orazio, M., Di Zio, M., Scanu, M. (2017). The Use of Uncertainty to Choose Matching Variables in Statistical Matching. In: Ferraro, M., et al. Soft Methods for Data Science. SMPS 2016. Advances in Intelligent Systems and Computing, vol 456. Springer, Cham. https://doi.org/10.1007/978-3-319-42972-4_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42972-4_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42971-7

  • Online ISBN: 978-3-319-42972-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics