skip to main content
10.1145/2740908.2743062acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
other

Objective Bayesian Two Sample Hypothesis Testing for Online Controlled Experiments

Published: 18 May 2015 Publication History

Abstract

As A/B testing gains wider adoption in the industry, more people begin to realize the limitations of the traditional frequentist null hypothesis statistical testing (NHST). The large number of search results for the query "Bayesian A/B testing" shows just how much the interest in the Bayesian perspective is growing. In recent years there are also voices arguing that Bayesian A/B testing should replace frequentist NHST and is strictly superior in all aspects. Our goal here is to clarify the myth by looking at both advantages and issues of Bayesian methods. In particular, we propose an objective Bayesian A/B testing framework for which we hope to bring the best from Bayesian and frequentist methods together. Unlike traditional methods, this method requires the existence of historical A/B test data to objectively learn a prior. We have successfully applied this method to Bing, using thousands of experiments to establish the priors.

Cited By

View all
  • (2021)On Post-selection Inference in A/B TestingProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining10.1145/3447548.3467129(2743-2752)Online publication date: 14-Aug-2021
  • (2021)Online Experimentation with Surrogate Metrics: Guidelines and a Case StudyProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441737(193-201)Online publication date: 8-Mar-2021
  • (2021)Comparative Probability Metrics: Using Posterior Probabilities to Account for Practical Equivalence in A/B testsThe American Statistician10.1080/00031305.2021.2000495(1-34)Online publication date: 2-Nov-2021
  • Show More Cited By

Index Terms

  1. Objective Bayesian Two Sample Hypothesis Testing for Online Controlled Experiments

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '15 Companion: Proceedings of the 24th International Conference on World Wide Web
    May 2015
    1602 pages
    ISBN:9781450334730
    DOI:10.1145/2740908

    Sponsors

    • IW3C2: International World Wide Web Conference Committee

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 18 May 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. a/b testing
    2. bayesian statistics
    3. controlled experiments
    4. empirical bayes
    5. multiple testing
    6. objective bayes
    7. optional stopping
    8. prior

    Qualifiers

    • Other

    Conference

    WWW '15
    Sponsor:
    • IW3C2

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)On Post-selection Inference in A/B TestingProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining10.1145/3447548.3467129(2743-2752)Online publication date: 14-Aug-2021
    • (2021)Online Experimentation with Surrogate Metrics: Guidelines and a Case StudyProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441737(193-201)Online publication date: 8-Mar-2021
    • (2021)Comparative Probability Metrics: Using Posterior Probabilities to Account for Practical Equivalence in A/B testsThe American Statistician10.1080/00031305.2021.2000495(1-34)Online publication date: 2-Nov-2021
    • (2018)A note on Type S/M errors in hypothesis testingBritish Journal of Mathematical and Statistical Psychology10.1111/bmsp.1213272:1(1-17)Online publication date: 23-Mar-2018
    • (2017)A/B Testing at ScaleProceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3077136.3082060(1395-1397)Online publication date: 7-Aug-2017
    • (undefined)False Discovery in A/B TestingSSRN Electronic Journal10.2139/ssrn.3718802
    • (undefined)(Implication of Financial Reforms in China and Vietnam for North Korea)SSRN Electronic Journal10.2139/ssrn.2782307

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media