Skip to main content

Uplift Modeling

  • Reference work entry
  • First Online:
Encyclopedia of Machine Learning and Data Mining
  • 260 Accesses

Abstract

Uplift modeling is a machine learning technique which aims at predicting, on the level of individuals, the gain from performing a given action with respect to refraining from taking it. Examples include medical treatments and direct marketing campaigns where the rate of spontaneous recovery and the background purchase rate need to be taken into account to assess the true gains from taking an action. Uplift modeling addresses this problem by using two training sets: the treatment dataset containing data on objects on which the action has been taken and the control dataset containing data on objects left untreated. A model is then built which predicts the difference between outcomes after treatment and without it conditional on available predictor variables. An obvious approach to uplift modeling is to build two separate models on both training sets and subtract their predictions. In many cases, better results can be obtained with models which predict the difference in outcomes directly. A popular class of uplift models are decision trees with splitting criteria favoring tests which promote differences between treatment and control groups. Ensemble methods have proven to be particularly useful in uplift modeling, often leading to significant increases in performance over the base learners. Linear models, such as logistic regression and support vector machines, have also been adapted to this setting. Dedicated methods, such as uplift or qini curves, are necessary for evaluating uplift models. Application of the methodology to survival data and scenarios with more than one possible action have also been considered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 699.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 949.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  • Guelman L, Guillén M, Pérez-Marín AM (2012) Random forests for uplift modeling: an insurance customer retention case. In: Modeling and simulation in engineering, economics and management. Lecture notes in business information processing (LNBIP), vol 115. Springer, Heidelberg, pp 123–133

    Google Scholar 

  • Hansotia B, Rukstales B (2002) Incremental value modeling. J. Interact Mark 16(3):35–46

    Article  Google Scholar 

  • Holland PW (1986) Statistics and causal inference. J Am Stat Assoc 81(396):945–960

    Article  MathSciNet  MATH  Google Scholar 

  • Jaroszewicz S, Rzepakowski P (2014) Uplift modeling with survival data. In: ACM SIGKDD workshop on health informatics (HI-KDD’14), New York

    Google Scholar 

  • Jaśkowski M, Jaroszewicz S (2012) Uplift modeling for clinical trial data. In: ICML 2012 workshop on machine learning for clinical data analysis, Edinburgh

    Google Scholar 

  • Kuusisto F, Santos Costa V, Nassif H, Burnside E, Page D, Shavlik J (2014) Support vector machines for differential prediction. In: ECML-PKDD, Nancy

    Book  Google Scholar 

  • Lai Y-T, Wang K, Ling D, Shi H, Zhang J (2006) Direct marketing when there are voluntary buyers. In: Sixth International Conference on Data Mining, 2006 (ICDM’06), IEEE, Los Alamitos, pp 922–927. http://www.comp.hkbu.edu.hk/iwi06/icdm/

  • Larsen K (2011) Net lift models: optimizing the impact of your marketing. In: Predictive analytics world, workshop presentation, San Francisco

    Google Scholar 

  • Lo VSY (2002) The true lift model—a novel data mining approach to response modeling in database marketing. SIGKDD Explor 4(2):78–86

    Article  Google Scholar 

  • Radcliffe NJ, Surry PD (1999) Differential response analysis: modeling true response by isolating the effect of a single action. In: Proceedings of credit scoring and credit control VI. Credit Research Centre, University of Edinburgh Management School

    Google Scholar 

  • Radcliffe NJ, Surry PD (2011) Real-world uplift modelling with significance-based uplift trees. Portrait Technical Report TR-2011-1, Stochastic Solutions

    Google Scholar 

  • Robins J (1994) Correcting for non-compliance in randomized trials using structural nested mean models. Commun Stat—Theory Methods 23(8):2379–2412

    Article  MathSciNet  MATH  Google Scholar 

  • Rzepakowski P, Jaroszewicz S (2010) Decision trees for uplift modeling. In: Proceedings of the 10th IEEE international conference on data mining (ICDM), Sydney, pp 441–450

    Google Scholar 

  • Rzepakowski P, Jaroszewicz S (2012) Decision trees for uplift modeling with single and multiple treatments. Knowl Inf Syst 32:303–327

    Article  Google Scholar 

  • Siegel E, Davenport TH (2013) Predictive analytics: the power to predict who will click, buy, lie, or die. Wiley, Hoboken

    Google Scholar 

  • Sołtys M, Jaroszewicz S, Rzepakowski P (2015) Ensemble methods for uplift modeling. Data Min Knowl Discov 29(6):1531–1559

    Article  MathSciNet  Google Scholar 

  • Zaniewicz Ł, Jaroszewicz S (2013) Support vector machines for uplift modeling. In: The first IEEE ICDM workshop on causal discovery (CD 2013), Dallas

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Szymon Jaroszewicz .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Science+Business Media New York

About this entry

Cite this entry

Jaroszewicz, S. (2017). Uplift Modeling. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning and Data Mining. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7687-1_911

Download citation

Publish with us

Policies and ethics