Double Machine Learning at Scale to Predict Causal Impact of Customer Actions

More, Sushant; Kotwal, Priya; Chappidi, Sujith; Mandalapu, Dinesh; Khawand, Chris

doi:10.1007/978-3-031-43427-3_31

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14174))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

883 Accesses

Abstract

Causal Impact (CI) measurement is broadly used across the industry to inform both short- and long-term investment decisions of various types. In this paper, we apply the double machine learning (DML) methodology to estimate average and conditional average treatment effects across 100s of customer action types for ecommerce and digital businesses and 100s of millions of customers that can be used in decisions supporting those busiensses. We operationalize DML through a causal machine learning library. It uses distributed computation on Spark and is configured via a flexible, JSON-driven model configuration approach to estimate causal impacts at scale (i.e., across hundred of actions and millions of customers). We outline the DML methodology and implementation. We show examples of average treatment effect and conditional average treatment effect (i.e., customer-level) estimates values along with confidence intervals. Our validation metrics show a \(2.2\%\) gain over the baseline methods and a 2.5X gain in the computational time. Our contribution is to advance the scalable application of CI, while also providing an interface that allows faster experimentation, ability to onboard new use cases, and improved accessibility of underlying code for partner teams.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
We use placeholder X,Y,Z to maintain business confidentiality.
2.
We anonymize actions to preserve business confidentiality.

References

Chernozhukov, V., et al.: Double/debiased machine learning for treatment and structural parameters. Economet. J. 21(1), C1–C68 (2018). https://doi.org/10.1111/ectj.12097
Article MathSciNet MATH Google Scholar
Sekhon, J.: The Neyman-Rubin Model of Causal Inference and Estimation via Matching Methods. The Oxford Handbook of Political Methodology (2007)
Google Scholar
Holland, P.W.: Statistics and causal inference. J. Am. Stat. Assoc. 81(396), 945–960 (1986). https://doi.org/10.1080/01621459.1986.10478354
Article MathSciNet MATH Google Scholar
Neyman, J.: Sur les applications de la theorie des probabilites aux experiences agricoles: essai des principes. Master’s thesis, excerpts reprinted in English. Stat. Sci. 5, 463–472 (1923)
Google Scholar
Rubin, D.: Causal inference using potential outcomes. J. Am. Stat. Assoc. 81(396), 945–960 (2005). https://doi.org/10.1080/01621459.1986.10478354
Article Google Scholar
Chernozhukov, V., Goldman, M., Semenova, V., Taddy, M.: Orthogonal Machine Learning for Demand Estimation: High Dimensional Causal Inference in Dynamic Panels. ArXiv:1712.09988 [Stat] (2017)
Huber, P.J.: The behavior of maximum likelihood estimates under nonstandard conditions. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 5, pp. 221–233 (1967)
Google Scholar
White, H.: A heteroskedasticity-consistent covariance matrix estimator and a direct test for heteroskedasticity. Econometrica 48(4), 817–838 (1980)
Article MathSciNet MATH Google Scholar
Horvitz, D.G., Thompson, D.J.: A generalization of sampling without replacement from a Finite Universe. J. Am. Stat. Assoc. 47(260), 663–685 (1952)
Article MathSciNet MATH Google Scholar
Nie, X., Wager, S.: Quasi-oracle estimation of heterogeneous treatment effects. ArXiv:1712.04912 [Econ, Math, Stat] (2017)
Kennedy, E.H.: Optimal doubly robust estimation of heterogeneous causal effects. ArXiv:2004.14497 [math.ST] (2020)
Robins, J.M., Mark, S.D.: Estimating exposure effects by modelling the expectation of exposure conditional on confounders. Biometrics 48, 479–495 (1992). MR1173493
Article MathSciNet MATH Google Scholar
Ruth, C., et al.: Long-Term Outcomes Of Manitoba’s Insight Mentoring Program: A Comparative Statistical Analysis. Manitoba Centre for Health Policy, Winnipeg, MB (2015)
Google Scholar
Austin, P.C., Stuart, E.A.: Moving towards best practice when using inverse probability of treatment weighting (IPTW) using the propensity score to estimate causal treatment effects in observational studies. Stat. Med. 34, 3661–3679 (2015)
Article MathSciNet Google Scholar
Hirano, K., Imbens, G.W.: Estimation of causal effects using propensity score weighting: an application to data on right heart catheterization. Health Serv. Outcomes Res. Method. 2, 259–278 (2001)
Article Google Scholar
Abadie, A., Imbens, G.: On the failure of bootstrap for matching estimators. Econometrica 76(6), 1537–1157 (2008)
Article MathSciNet MATH Google Scholar
Victor, C., Demirer, M., Duflo, E., Fernandez-Val, I.: Generic machine learning inference on heterogenous treatment effects in randomized experiments. aXiv:1712.04802v6 [stat.ML] (2022)
Google Scholar
Knaus, M.C., Lechner, M., Strittmatter, A.: Machine learning estimation of heterogeneous causal effects: empirical monte carlo evidence. Economet. J. 24(1), 134–161 (2021)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Amazon, Seattle, WA, USA
Sushant More, Priya Kotwal, Sujith Chappidi, Dinesh Mandalapu & Chris Khawand

Authors

Sushant More
View author publications
You can also search for this author in PubMed Google Scholar
Priya Kotwal
View author publications
You can also search for this author in PubMed Google Scholar
Sujith Chappidi
View author publications
You can also search for this author in PubMed Google Scholar
Dinesh Mandalapu
View author publications
You can also search for this author in PubMed Google Scholar
Chris Khawand
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sushant More .

Editor information

Editors and Affiliations

CENTAI, Turin, Italy
Gianmarco De Francisci Morales
NYU and Two Sigma, New York, NY, USA
Claudia Perlich
Netflix, Los Angeles, CA, USA
Natali Ruchansky
Telefonica Research, Barcelona, Spain
Nicolas Kourtellis
Politecnico di Torino, Turin, Italy
Elena Baralis
CENTAI, Turin, Italy
Francesco Bonchi

Ethics declarations

Ethical Implication

The data presented in the paper is completely anonymized. It cannot be used for inference of personal information of any kind.

The method presented in this paper falls in the domain of observational causal inference. Observational causal inference methods are used to gauge impact of things already happened. The inference methods by itself do not aid in any wrong doing. But in the unfortunate case of bad things happening to an individual (e.g., unfair economic policy/ smoking/ abuse), the causal methods can help identify the impact and help guide the recovery methods. In that sense, work presented here can be used to seek justice for the victim.

Of course, as a society we want to make sure that we do not subject individuals to an unscrupulous treatment to extract the causal impact of that treatment. Because the impact of such treatment could be adverse in some cases. But again the work presented here is used to analyze the aftermath of an action/ treatment. The type of treatments a person can be subjected to is outside the scope of current work.

Appendices

Appendix

A Sample JSON Config

We show a snippet of JSON config in Fig. 11. We can swap the specified models in the outcome and propensity step with any ML model. Likewise we can easily configure pre/post-processing steps and hyperparameters through JSON files.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

More, S., Kotwal, P., Chappidi, S., Mandalapu, D., Khawand, C. (2023). Double Machine Learning at Scale to Predict Causal Impact of Customer Actions. In: De Francisci Morales, G., Perlich, C., Ruchansky, N., Kourtellis, N., Baralis, E., Bonchi, F. (eds) Machine Learning and Knowledge Discovery in Databases: Applied Data Science and Demo Track. ECML PKDD 2023. Lecture Notes in Computer Science(), vol 14174. Springer, Cham. https://doi.org/10.1007/978-3-031-43427-3_31

Download citation

DOI: https://doi.org/10.1007/978-3-031-43427-3_31
Published: 17 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-43426-6
Online ISBN: 978-3-031-43427-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the ECML PKDD community (opens in a new tab)

Double Machine Learning at Scale to Predict Causal Impact of Customer Actions