Semi-parametric analysis of multi-rater data

Rogers, Simon; Girolami, Mark; Polajnar, Tamara

doi:10.1007/s11222-009-9125-z

Semi-parametric analysis of multi-rater data

Published: 24 April 2009

Volume 20, pages 317–334, (2010)
Cite this article

Statistics and Computing Aims and scope Submit manuscript

Simon Rogers¹,
Mark Girolami¹ &
Tamara Polajnar¹

250 Accesses
8 Citations
Explore all metrics

Abstract

Datasets that are subjectively labeled by a number of experts are becoming more common in tasks such as biological text annotation where class definitions are necessarily somewhat subjective. Standard classification and regression models are not suited to multiple labels and typically a pre-processing step (normally assigning the majority class) is performed. We propose Bayesian models for classification and ordinal regression that naturally incorporate multiple expert opinions in defining predictive distributions. The models make use of Gaussian process priors, resulting in great flexibility and particular suitability to text based problems where the number of covariates can be far greater than the number of data instances. We show that using all labels rather than just the majority improves performance on a recent biological dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Albert, J., Chib, S.: Sequential ordinal modeling with applications to survival data. Biometrics 57, 829–836 (2001)
Article MathSciNet Google Scholar
Albert, J.H., Chib, S.: Bayesian analysis of binary and polychotomous response data. J. Am. Stat. Assoc. 88(422), 669–679 (1993)
Article MATH MathSciNet Google Scholar
Bickel, S., Brefeld, U., Faulstich, L., Hakenberg, J., Leser, U., Plake, C., Scheffer, T.: A support vector machine classifier for gene name recognition. In: EMBO Workshop: A Critical Assessment of Text Mining Methods in Molecular Biology, Granada, Spain, March 2004
Chu, W., Ghahramani, Z.: Gaussian processes for ordinal regression. J. Mach. Learn. Res. 6, 1–48 (2005)
MathSciNet Google Scholar
Cohen, K., Fox, L., Ogren, P., Hunter, L.: Corpus design for biomedical natural language processing. In: Proceedings of the ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Databases (Jan. 2005)
Cowles, M.K.: Accelerating Monte Carlo Markov Chain convergence for cumulative-link generalized linear models. Stat. Comput. 6, 101–111 (1996)
Article Google Scholar
Dawid, A.P., Skene, A.M.: Maximum likelihood estimation of observer error-rates using the em algorithm. Appl. Stat. 28(1), 20–28 (1979)
Article Google Scholar
Gelman, A., Carlin, J., Stern, H., Rubin, D.: Bayesian Data Analysis. Chapman&Hall, London (2004)
MATH Google Scholar
Girolami, M., Rogers, S.: Variational Bayesian multinomial probit regression with Gaussian process priors. Neural Comput. 18(8), 1790–1817 (2006). doi:10.1162/neco.2006.18.8.1790
Article MATH MathSciNet Google Scholar
Girolami, M., Zhong, M.: Data integration for classification problems emplying Gaussian process priors. Adv. Neural Inf. Process. Syst. 21 (2007)
Johnson, V.: An alternative to traditional GPA for evaluating student performance. Stat. Sci. 12(4), 251–269 (1997)
Article Google Scholar
Johnson, V., Albert, J.: Ordinal Data Modeling. books.google.com (Jan. 1999)
Johnson, V.E.: On Bayesian analysis of multirater ordinal data: An application to automated essay grading. J. Am. Stat. Assoc. 91(433), 42–51 (1996)
Article MATH Google Scholar
Rogers, S., Girolami, M.: Multi-class semi-supervised learning with the ε-truncated multinomial probit Gaussian process. J. Mach. Learn. Res. Workshop Conf. Proc. 1, 17–32 (2007)
Google Scholar
Smyth, P., Fayyad, U., Burl, M., Perona, P., Baldi, P.: Inferring ground truth from subjective labelling of venus images. Adv. Neural Inf. Process. Syst. 7 (1995)
Uebersax, J.S.: Statistical modeling of expert ratings on medical treatment appropriateness. J. Am. Stat. Assoc. 88(422), 421–427 (1993)
Article Google Scholar
Versley, Y.: Disagreement dissected: Vagueness as a source of ambiguity in nominal (co-) reference. In: Ambiguity in Anaphora Workshop Proceedings (2006)
Wilbur, W.J., Rzhetsky, A., Shatkay, H.: New directions in biomedical text annotation: definitions, guidelines and corpus construction. BMC Bioinf. 7, 356–356 (2006)
Article Google Scholar
Williams, C.K., Barber, D.: Bayesian classification with Gaussian processes. IEEE Trans. Pattern Anal. Mach. Intell. 20(12), 1342–1351 (1998)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing Science, University of Glasgow, Glasgow, G12 8QQ, UK
Simon Rogers, Mark Girolami & Tamara Polajnar

Authors

Simon Rogers
View author publications
You can also search for this author in PubMed Google Scholar
Mark Girolami
View author publications
You can also search for this author in PubMed Google Scholar
Tamara Polajnar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Simon Rogers.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rogers, S., Girolami, M. & Polajnar, T. Semi-parametric analysis of multi-rater data. Stat Comput 20, 317–334 (2010). https://doi.org/10.1007/s11222-009-9125-z

Download citation

Received: 09 May 2008
Accepted: 01 April 2009
Published: 24 April 2009
Issue Date: July 2010
DOI: https://doi.org/10.1007/s11222-009-9125-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Semi-parametric analysis of multi-rater data

Abstract

Access this article

Similar content being viewed by others

Inferring Rater Agreement with Ordinal Classification

Evaluation of a Confidence Interval Approach for Relative Agreement in a Crossed Three-Way Random Effects Model

Interrater reliability estimators tested against true interrater reliabilities

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Semi-parametric analysis of multi-rater data

Abstract

Access this article

Similar content being viewed by others

Inferring Rater Agreement with Ordinal Classification

Evaluation of a Confidence Interval Approach for Relative Agreement in a Crossed Three-Way Random Effects Model

Interrater reliability estimators tested against true interrater reliabilities

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation