skip to main content
10.1145/3308558.3313452acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Improving Treatment Effect Estimators Through Experiment Splitting

Published:13 May 2019Publication History

ABSTRACT

We present a method for implementing shrinkage of treatment effect estimators, and hence improving their precision, via experiment splitting. Experiment splitting reduces shrinkage to a standard prediction problem. The method makes minimal distributional assumptions, and allows for the degree of shrinkage in one metric to depend on other metrics. Using a dataset of 226 Facebook News Feed A/B tests, we show that a lasso estimator based on repeated experiment splitting has a 44% lower mean squared predictive error than the conventional, unshrunk treatment effect estimator, a 18% lower mean squared predictive error than the James-Stein shrinkage estimator, and would lead to substantially improved launch decisions over both.

References

  1. Hirotogu Akaike. 1998. Information theory and an extension of the maximum likelihood principle. In Selected Papers of Hirotugu Akaike. Springer, 199-213.Google ScholarGoogle Scholar
  2. Michael L Anderson and Jeremy Magruder. 2017. Split-sample strategies for avoiding false discoveries. Technical Report. National Bureau of Economic Research.Google ScholarGoogle Scholar
  3. Susan Athey and Guido Imbens. 2016. Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences 113, 27(2016), 7353-7360.Google ScholarGoogle ScholarCross RefCross Ref
  4. Susan Athey, Julie Tibshirani, and Stefan Wager. 2016. Generalized random forests. arXiv preprint arXiv:1610.01271(2016).Google ScholarGoogle Scholar
  5. Susan Athey and Stefan Wager. 2017. Efficient policy learning. arXiv preprint arXiv:1702.02896(2017).Google ScholarGoogle Scholar
  6. Eduardo M Azevedo, Alex Deng, Jose Luis Montiel Olea, Justin Rao, and E Glen Weyl. 2018. The A/B Testing Problem. In Proceedings of the 2018 ACM Conference on Economics and Computation. ACM, 461-462. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Gerard Biau. 2012. Analysis of a random forests model. Journal of Machine Learning Research 13, Apr (2012), 1063-1095. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Thomas Blake and Dominic Coey. 2014. Why marketplace experimentation is harder than it seems: The role of test-control interference. In Proceedings of the fifteenth ACM conference on Economics and computation. ACM, 567-582. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Lawrence D Brown. 2008. In-season prediction of batting averages: A field test of empirical Bayes and Bayes methodologies. The Annals of Applied Statistics(2008), 113-152.Google ScholarGoogle Scholar
  10. Lawrence D Brown and Eitan Greenshtein. 2009. Nonparametric empirical Bayes and compound decision approaches to estimation of a high-dimensional vector of normal means. The Annals of Statistics(2009), 1685-1704.Google ScholarGoogle Scholar
  11. Bradley P Carlin and Thomas A Louis. 2010. Bayes and empirical Bayes methods for data analysis. Chapman and Hall/CRC. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. George Casella. 1985. An introduction to empirical Bayes data analysis. The American Statistician 39, 2 (1985), 83-87.Google ScholarGoogle Scholar
  13. Victor Chernozhukov, Denis Chetverikov, Mert Demirer, Esther Duflo, Christian Hansen, Whitney Newey, and James Robins. 2018. Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal 21, 1 (2018), C1-C68.Google ScholarGoogle ScholarCross RefCross Ref
  14. Victor Chernozhukov, Whitney Newey, and James Robins. 2018. Double/de-biased machine learning using regularized Riesz representers. arXiv preprint arXiv:1802.08667(2018).Google ScholarGoogle Scholar
  15. Alex Deng. 2015. Objective bayesian two sample hypothesis testing for online controlled experiments. In Proceedings of the 24th International Conference on World Wide Web. ACM, 923-928. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Bradley Efron. 2011. Tweedie's formula and selection bias. J. Amer. Statist. Assoc. 106, 496 (2011), 1602-1614.Google ScholarGoogle ScholarCross RefCross Ref
  17. Bradley Efron. 2012. Large-scale inference: empirical Bayes methods for estimation, testing, and prediction. Cambridge University Press.Google ScholarGoogle Scholar
  18. Bradley Efron and Trevor Hastie. 2016. Computer age statistical inference. Cambridge University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Bradley Efron and Carl Morris. 1973. Stein's estimation rule and its competitors-an empirical Bayes approach. J. Amer. Statist. Assoc. 68, 341 (1973), 117-130.Google ScholarGoogle Scholar
  20. Bradley Efron and Carl Morris. 1975. Data analysis using Stein's estimator and its generalizations. J. Amer. Statist. Assoc. 70, 350 (1975), 311-319.Google ScholarGoogle ScholarCross RefCross Ref
  21. Bradley Efron, Carl Morris, 1976. Multivariate empirical Bayes and estimation of covariance matrices. The Annals of Statistics 4, 1 (1976), 22-32.Google ScholarGoogle ScholarCross RefCross Ref
  22. Bradley Efron, Robert Tibshirani, John D Storey, and Virginia Tusher. 2001. Empirical Bayes analysis of a microarray experiment. Journal of the American statistical association 96, 456(2001), 1151-1160.Google ScholarGoogle ScholarCross RefCross Ref
  23. Marcel Fafchamps and Julien Labonne. 2017. Using Split Samples to Improve Inference on Causal Effects. Political Analysis 25, 4 (2017), 465-482.Google ScholarGoogle ScholarCross RefCross Ref
  24. Jerome Friedman, Trevor Hastie, Holger Höfling, Robert Tibshirani, 2007. Pathwise coordinate optimization. The Annals of Applied Statistics 1, 2 (2007), 302-332.Google ScholarGoogle ScholarCross RefCross Ref
  25. F. Hayashi. 2011. Econometrics. Princeton University Press.Google ScholarGoogle Scholar
  26. William James and Charles Stein. 1961. Estimation with quadratic loss. In Proceedings of the fourth Berkeley symposium on mathematical statistics and probability, Vol. 1. 361-379.Google ScholarGoogle Scholar
  27. Ron Kohavi, Alex Deng, Brian Frasca, Toby Walker, Ya Xu, and Nils Pohlmann. 2013. Online controlled experiments at large scale. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 1168-1176. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Colin L Mallows. 1973. Some comments on Cp. Technometrics 15, 4 (1973), 661-675.Google ScholarGoogle Scholar
  29. Whitney K Newey and James R Robins. 2018. Cross-fitting and fast remainder rates for semiparametric estimation. arXiv preprint arXiv:1801.09138(2018).Google ScholarGoogle Scholar
  30. Alexander Peysakhovich and Dean Eckles. 2018. Learning Causal Effects From Many Randomized Experiments Using Regularized Instrumental Variables. In Proceedings of the 2018 World Wide Web Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 699-707. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Herbert Robbins. 1956. An Empirical Bayes Approach to Statistics. In Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics. The Regents of the University of California.Google ScholarGoogle ScholarCross RefCross Ref
  32. Charles M Stein. 1956. Inadmissibility of the usual estimator for the mean of a multivariate normal distribution. In Proc. Third Berkeley Symp. Math. Statist. Probab., 1956, Vol. 1. Univ. California Press, 197-206.Google ScholarGoogle ScholarCross RefCross Ref
  33. Charles M Stein. 1962. Confidence sets for the mean of a multivariate normal distribution. Journal of the Royal Statistical Society. Series B (Methodological) (1962), 265-296.Google ScholarGoogle Scholar
  34. Charles M Stein. 1981. Estimation of the mean of a multivariate normal distribution. The annals of Statistics(1981), 1135-1151.Google ScholarGoogle Scholar
  35. William E Strawderman. 1971. Proper Bayes minimax estimators of the multivariate normal mean. The Annals of Mathematical Statistics 42, 1 (1971), 385-388.Google ScholarGoogle ScholarCross RefCross Ref
  36. Mark J Van der Laan and Sherri Rose. 2011. Targeted learning: causal inference for observational and experimental data. Springer Science & Business Media.Google ScholarGoogle Scholar
  37. Stefan Wager and Susan Athey. 2017. Estimation and inference of heterogeneous treatment effects using random forests. J. Amer. Statist. Assoc.just-accepted (2017).Google ScholarGoogle Scholar
  38. Stefan Wager, Wenfei Du, Jonathan Taylor, and Robert J Tibshirani. 2016. High-dimensional regression adjustments in randomized experiments. Proceedings of the National Academy of Sciences 113, 45(2016), 12673-12678.Google ScholarGoogle ScholarCross RefCross Ref

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    WWW '19: The World Wide Web Conference
    May 2019
    3620 pages
    ISBN:9781450366748
    DOI:10.1145/3308558

    Copyright © 2019 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 13 May 2019

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate1,899of8,196submissions,23%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format