Abstract
We propose a very simple algorithm in order to estimate the geometric median, also called spatial median, of multivariate (Small (1990)) or functional data (Gervini (2008)) when the sample size is large. A simple and fast iterative approach based on the Robbins-Monro algorithm (Duflo (1997)) as well as its averaged version (Polyak and Juditsky (1992)) are shown to be effective for large samples of high dimension data. They are very fast and only require O(Nd) elementary operations, where N is the sample size and d is the dimension of data. The averaged approach is shown to be more effective and less sensitive to the tuning parameter. The ability of this new estimator to estimate accurately and rapidly (about thirty times faster than the classical estimator) the geometric median is illustrated on a large sample of 18902 electricity consumption curves measured every half an hour during one week.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
CADRE, B. (2001): Convergent estimators for the L 1 median of Banach valued random variable. Statistics, 35, 509-521.
CHAOUCH, M., GOGA, C. (2010): Design-Based Estimation for Geometric Quantiles. Accepted for publication in Comput. Statist. and Data Analysis.
CHAUDHURI, P. (1996): On a geometric notion of quantiles for multivariate data. J. Amer. Statist. Assoc., 91, 862-871.
DIPPON, J., WALK, H. (2006): The averaged Robbins-Monro method for linear problems in a Banach space. J. Theoret. Probab. 19, (2006), 166-189.
DUFLO, M. (1997): Random Iterative Models. Springer Verlag, Heidelberg.
GERVINI, D. (2008): Robust functional estimation using the spatial median and spherical principal components. Biometrika, 95, 587-600.
GOWER, J.C. (1974): The mediancentre. Applied Statistics, 23, 466-470.
HUBER, P.J., RONCHETTI, E.M. (2009): Robust Statistics. John Wiley & Sons, second edition.
KEMPERMAN, J.H.D. (1987): The median of finite measure of a Banach space. In Statistical data analysis based on the L 1-norm and related methods, eds Y. Dodge, North-Holland, Amsterdam, 217-230.
KUSHNER, H.J, YIN, G.G. (2003): Stochastic Approximation and Recursive Algorithms and Applications. Springer Verlag, New York.
KOENKER, R. (2005). Quantile regression. Cambridge University Press.
POLYAK, B.T., JUDITSKY, A.B. (1992): Acceleration of Stochastic Approximation. SIAM J. Control and Optimization, 30, 838-855.
SMALL, C.G. (1990): A survey of multidimensional medians. Int. Statist. Inst. Rev., 58, \textit {263-277}.
VARDI, Y., ZHANG, C.H. (2000): The multivariate L 1-median and associated data depth. Proc. Natl. Acad. Sci. USA, 97, 1423-1426.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cardot, H., Cénac, P., Chaouch, M. (2010). Stochastic Approximation for Multivariate and Functional Median. In: Lechevallier, Y., Saporta, G. (eds) Proceedings of COMPSTAT'2010. Physica-Verlag HD. https://doi.org/10.1007/978-3-7908-2604-3_40
Download citation
DOI: https://doi.org/10.1007/978-3-7908-2604-3_40
Published:
Publisher Name: Physica-Verlag HD
Print ISBN: 978-3-7908-2603-6
Online ISBN: 978-3-7908-2604-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)