Finite-sample investigation of likelihood and Bayes inference for the symmetric von Mises–Fisher distribution
Introduction
Data points as three-dimensional rotation matrices arise in investigations in materials science (e.g., crystal orientations in metals Mackenzie, 1957, Bingham et al., 2009a) as well as studies of human kinematics (Rancourt et al., 2000), and the matrix Fisher is the most widely referenced distribution for modeling such observations (Downs, 1972). A large literature exists on likelihood methodology for this distribution, but there has been considerably less work on Bayes analyses. For example, advances in maximum likelihood inference for the matrix Fisher distribution have been made by Khatri and Mardia (1977) and Jupp and Mardia (1979), with further work done by Prentice (1986), Mardia and Jupp (2000), and Chikuse (2003). Chang and Rivest (2001) also developed -estimation connected to likelihood inference. In terms of Bayes inference for the matrix Fisher distribution, Chang and Bingham (1996) outlined an approach for stipulating informative priors, while in an unpublished work, Camano-Garcia (unpublished) considered Gibbs samplers for general Langevin (or matrix Fisher) distributions.
Despite these developments and the clear popularity of the matrix Fisher distribution, relatively little appears to be known about the potential benefits of Bayes inference for this distribution or about the relative finite-sample performances of likelihood and Bayes methods for this model. For example, the Bayes development in Chang and Bingham (1996) targeted large-sample approximations of posterior distributions, often with informative priors amenable to such approximations. We aim instead to explore a practical implementation of Bayes methods based on non-informative priors, under the common parametrization of the symmetric matrix Fisher used by Chang and Bingham (1996).
This matrix Fisher distribution is useful for modeling symmetric random errors around a fixed “central location” , which is a 3×3 rotation matrix that constitutes an unusual model parameter but a key model summary in the existing literature. We show that Bayes methods offer computationally straightforward inference for this and, in even small samples, can produce inference regions with frequentist coverage rates that match nominal credible levels. Additionally, while a likelihood approach can provide a confidence region for through a usual chi-square calibration, the likelihood function itself does not “order” the matrix-valued parameter space for in a geometrically clear way that easily conveys the notion of inference set “size”. As a consequence, likelihood confidence regions have mathematically convenient set-theoretic definitions, but do not clearly convey information about statistical precision. We illustrate, however, that non-informative Bayes regions for may be constructed to have a simple geometrical structure that indicates precision (size) while retaining accurate frequentist coverages. In these ways, the Bayes methods can provide improved frequentist inference for the symmetric matrix Fisher distribution.
Beyond the one-sample problem, we also build and illustrate Bayes inference in one-way random effects models based on the symmetric matrix Fisher distribution. Our intent in this is to stimulate greater interest in Bayes methods for the Fisher model by demonstrating how such methods extend naturally from the one-sample situation to more complex inference scenarios. The Bayes framework may be more tractable than a purely likelihood approach for some problems and may open new inference possibilities for the matrix Fisher distribution.
Section 2 provides a preliminary framework, including a constructive description of the symmetric matrix Fisher distribution and a likelihood formulation. Although we focus on the matrix Fisher distribution in particular, the Bayes and likelihood methodologies presented here can apply to a larger class of distributions for random rotations which, as explained in Section 2, have the same kind of “constructive” definition and parametrization as the symmetric matrix Fisher. Asymptotic distributional results for one-sample likelihood inference are given based on an Euler angle representation of , and simulations illustrate these approximations applied to the finite-sample distributions of likelihood statistics. Section 3 describes the Bayes methods for the one-sample problem using non-informative priors and a Metropolis–Hastings within Gibbs algorithm. Numerical studies compare the finite-sample properties of likelihood and Bayes inference regions for where we impose a shape constraint on these regions to facilitate interpretation. Simulations indicate that the Bayes regions tend to have better coverage properties, even in samples as small as . Section 4 then examines the performance of Bayes methods for one-way random effects models built on the symmetric matrix Fisher distribution, while Section 5 illustrates the matrix methods with human kinematics data. Section 6 provides some conclusions.
Section snippets
Constructive definition and model density
We first give a simple construction for generating random rotations from the symmetric Fisher model, from which the matrix density follows. Let represent the set of 3×3 rotation matrices (orthogonal matrices that preserve the right hand rule). The symmetric matrix Fisher distribution may be characterized through a location parameter and a spread parameter , where the model itself (denoted by here) describes the deviation of random orientations from a common “central”
One-sample Bayes analyses for the distribution
With established basics of likelihood inference in the symmetric matrix Fisher model, Section 3.1 describes a problem in calibrating likelihood regions for the location parameter which have both a meaningful geometry and correct coverage accuracy. This helps to motivate the Bayes analyses for the one-sample models based on the distribution in Section 3.2. Non-informative priors are discussed along with an approach for simulation from the posterior. A simulation study in Section 3.3
One-way random effects Bayes analyses for the Fisher model
For a three-dimensional rotation version of a one-way random effects model, suppose that there are groups of random rotations and that matrix-valued observations within a group arise as for independent of . Here parameter quantifies the between-group variation, quantifies the within-group variation (with large values indicating small variation), and represent unobservable random orientations for “centering” the groups of
Application to human kinematics
Rancourt et al. (2000) discuss an experiment in which an infra-red camera system was used to obtain orientations of the wrist, elbow, and shoulder of eight individuals when drilling into a vertically positioned metal plane at six different locations. Each subject performed the drilling at each location five times. Of interest here is the variation of the five repetitions as well as the variation between different subjects when performing the same drilling task. (Since the infra-red emitting
Conclusion
For one-sample inference on the symmetric matrix Fisher distribution, we have examined both likelihood and non-informative Bayes inference and found that Bayes methods produce competitive point and set estimates. In particular, the frequentist coverage rates of Bayes regions are close to nominal credible levels in even small samples, and the Bayes approach allows simple calibration of geometrically interpretable conic regions for the model location parameter . As expected, the coverage rates
Acknowledgement
The first author was supported by the NSF grant DMS #0502347 EMSW21-RTG awarded to the Department of Statistics, Iowa State University.
References (20)
- et al.
A statistical model for random rotations
Journal of Multivariate Analysis
(2006) - et al.
Modeling and inference for measured crystal orientations and a tractable class of symmetric distributions for rotations in 3 dimensions
Journal of the American Statistical Association
(2009) - et al.
Bayes one-sample and one-way random effects analyses for 3-D orientations with application to materials science
Bayesian Analysis
(2009) - et al.
Bayesian spherical regression
- et al.
-estimation for location and regression parameters in group models: A case study using stiefel manifolds
The Annals of Statistics
(2001) - Camano-Garcia, G., 2006. Statistics on stiefel manifolds. Ph.D. Thesis. Iowa State University, Ames, IA...
Statistics on Special Manifolds
(2003)Orientation statistics
Biometrika
(1972)- et al.
Bayesian Data Analysis
(2004) - et al.
Maximum likelihood estimators for the matrix von Mises–Fisher and Bingham distributions
The Annals of Statistics
(1979)
Cited by (10)
One-sample Bayes inference for symmetric distributions of 3-D rotations
2014, Computational Statistics and Data AnalysisCitation Excerpt :Note that for the vM-UARS case, the published work demonstrates that the Bayes methodology is completely superior to pseudo-likelihood methods.) Between this paper and the existing literature (Bingham et al., 2009b, 2010a, and Qiu et al., submitted for publication), we have established a complete one-sample non-informative Bayes methodology which is reasonable and effective for 8 parametric UARS symmetric distributions for 3-D rotations, 5 that are regular statistical models and 3 that are non-regular models. Bingham et al. (2009b) demonstrated the possibility of Bayes one-way random effects analyses and Bingham et al. (2010b) demonstrated the possibility of Bayes analysis of hierarchical models for the vM-UARS class that can easily be extended to all classes discussed here.
Algebraic analysis of rotation data
2019, arXivQuantifying spread in three-dimensional rotation data: comparison of nonparametric and parametric techniques
2015, Journal of Statistical Distributions and Applications