other

Objective Bayesian Two Sample Hypothesis Testing for Online Controlled Experiments

Author:

Alex DengAuthors Info & Claims

WWW '15 Companion: Proceedings of the 24th International Conference on World Wide Web

Page 913

https://doi.org/10.1145/2740908.2743062

Published: 18 May 2015 Publication History

Get Access

Abstract

As A/B testing gains wider adoption in the industry, more people begin to realize the limitations of the traditional frequentist null hypothesis statistical testing (NHST). The large number of search results for the query "Bayesian A/B testing" shows just how much the interest in the Bayesian perspective is growing. In recent years there are also voices arguing that Bayesian A/B testing should replace frequentist NHST and is strictly superior in all aspects. Our goal here is to clarify the myth by looking at both advantages and issues of Bayesian methods. In particular, we propose an objective Bayesian A/B testing framework for which we hope to bring the best from Bayesian and frequentist methods together. Unlike traditional methods, this method requires the existence of historical A/B test data to objectively learn a prior. We have successfully applied this method to Bing, using thousands of experiments to establish the priors.

Cited By

View all

Deng ALi YLu JRamamurthy VZhu FChin Ooi BMiao CWang HSkrypnyk IHsu WChawla S(2021)On Post-selection Inference in A/B TestingProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining10.1145/3447548.3467129(2743-2752)Online publication date: 14-Aug-2021
https://dl.acm.org/doi/10.1145/3447548.3467129
Duan WBa SZhang CLewin-Eytan LCarmel DYom-Tov EAgichtein EGabrilovich E(2021)Online Experimentation with Surrogate Metrics: Guidelines and a Case StudyProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441737(193-201)Online publication date: 8-Mar-2021
https://dl.acm.org/doi/10.1145/3437963.3441737
Stevens NHagar L(2021)Comparative Probability Metrics: Using Posterior Probabilities to Account for Practical Equivalence in A/B testsThe American Statistician10.1080/00031305.2021.2000495(1-34)Online publication date: 2-Nov-2021
https://doi.org/10.1080/00031305.2021.2000495
Show More Cited By

Index Terms

Objective Bayesian Two Sample Hypothesis Testing for Online Controlled Experiments
1. Mathematics of computing
  1. Probability and statistics
    1. Statistical paradigms
      1. Statistical graphics

Recommendations

Objective Bayesian Two Sample Hypothesis Testing for Online Controlled Experiments
WWW '15 Companion: Proceedings of the 24th International Conference on World Wide Web

As A/B testing gains wider adoption in the industry, more people begin to realize the limitations of the traditional frequentist null hypothesis statistical testing (NHST). The large number of search results for the query ``Bayesian A/B testing'' shows ...
Bayesian Hypothesis Testing Illustrated: An Introduction for Software Engineering Researchers
Bayesian data analysis is gaining traction in many fields, including empirical studies in software engineering. Bayesian approaches provide many advantages over traditional, or frequentist, data analysis, but the mechanics often remain opaque to beginners ...
Objective Bayesian analysis of accelerated competing failure models under Type-I censoring

This paper discusses the Bayesian inference of accelerated life tests (ALT) in the presence of competing failure causes. The time to failure due to a specific cause is described by a Weibull distribution. A two-stage approach is utilized to obtain the ...

Comments

Information & Contributors

Information

Published In

WWW '15 Companion: Proceedings of the 24th International Conference on World Wide Web

May 2015

1602 pages

ISBN:9781450334730

DOI:10.1145/2740908

General Chairs:
Aldo Gangemi
National Research Council, Italy & Paris 13 University-CNRS, France
,
Stefano Leonardi
Sapienza University of Rome, Italy
,
Alessandro Panconesi
Sapienza University of Rome, Italy

In-Cooperation

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 May 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Other

Conference

WWW '15

Sponsor:

IW3C2

WWW '15: 24th International World Wide Web Conference

May 18 - 22, 2015

Florence, Italy

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

7
Total Citations
View Citations
72
Total Downloads

Downloads (Last 12 months)3
Downloads (Last 6 weeks)0

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Deng ALi YLu JRamamurthy VZhu FChin Ooi BMiao CWang HSkrypnyk IHsu WChawla S(2021)On Post-selection Inference in A/B TestingProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining10.1145/3447548.3467129(2743-2752)Online publication date: 14-Aug-2021
https://dl.acm.org/doi/10.1145/3447548.3467129
Duan WBa SZhang CLewin-Eytan LCarmel DYom-Tov EAgichtein EGabrilovich E(2021)Online Experimentation with Surrogate Metrics: Guidelines and a Case StudyProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441737(193-201)Online publication date: 8-Mar-2021
https://dl.acm.org/doi/10.1145/3437963.3441737
Stevens NHagar L(2021)Comparative Probability Metrics: Using Posterior Probabilities to Account for Practical Equivalence in A/B testsThe American Statistician10.1080/00031305.2021.2000495(1-34)Online publication date: 2-Nov-2021
https://doi.org/10.1080/00031305.2021.2000495
Lu JQiu YDeng A(2018)A note on Type S/M errors in hypothesis testingBritish Journal of Mathematical and Statistical Psychology10.1111/bmsp.1213272:1(1-17)Online publication date: 23-Mar-2018
https://doi.org/10.1111/bmsp.12132
Deng ADmitriev PGupta SKohavi RRaff PVermeer LKando NSakai TJoho HLi Hde Vries AWhite R(2017)A/B Testing at ScaleProceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3077136.3082060(1395-1397)Online publication date: 7-Aug-2017
https://dl.acm.org/doi/10.1145/3077136.3082060
Berman RVan den Bulte C(undefined)False Discovery in A/B TestingSSRN Electronic Journal10.2139/ssrn.3718802
https://doi.org/10.2139/ssrn.3718802
Lim HKim YBang HKim JChoi P(undefined)(Implication of Financial Reforms in China and Vietnam for North Korea)SSRN Electronic Journal10.2139/ssrn.2782307
https://doi.org/10.2139/ssrn.2782307

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

Cited By

Index Terms

Recommendations