Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Non-asymptotic robustness analysis of regression depth median

Published: 01 January 2024 Publication History

Abstract

The maximum depth estimator (aka depth median) (β R D ∗) induced from regression depth (RD) of Rousseeuw and Hubert (1999) is one of the most prevailing estimators in regression. It possesses outstanding robustness similar to the univariate location counterpart. Indeed, β R D ∗ can, asymptotically, resist up to 33% contamination without breakdown, in contrast to the 0% for the traditional (least squares and least absolute deviations) estimators (see Van Aelst and Rousseeuw (2000)). The results from Van Aelst and Rousseeuw (2000) are pioneering, yet they are limited to regression-symmetric populations (with a strictly positive density), the ϵ-contamination, maximum-bias model, and in asymptotical sense. With a fixed finite-sample size practice, the most prevailing measure of robustness for estimators is the finite-sample breakdown point (FSBP) (Donoho and Huber, 1983). Despite many attempts made in the literature, only sporadic partial results on FSBP for β R D ∗ were obtained whereas an exact FSBP for β R D ∗ remained open in the last twenty-plus years. Furthermore, is the asymptotic breakdown value 1 / 3 (the limit of an increasing sequence of finite-sample breakdown values) relevant in the finite-sample practice? (Or what is the difference between the finite-sample and the limit breakdown values?). Such discussions are yet to be given in the literature. This article addresses the above issues, revealing an intrinsic connection between the regression depth of β R D ∗ and the newly obtained exact FSBP. It justifies the employment of β R D ∗ as a robust alternative to the traditional estimators and demonstrates the necessity and the merit of using the FSBP in finite-sample real practice.

References

[1]
Rousseeuw P.J., Hubert M., Regression depth (with discussion), J. Amer. Statist. Assoc. 94 (1999) 388–433.
[2]
Van Aelst S., Robustness of deepest regression, J. Multivariate Anal. 73 (2000) 82–106.
[3]
Van Aelst S., The deepest regression method, J. Multivariate Anal. 81 (2002) 138–166.
[4]
Donoho D.L., Huber P.J., The notion of breakdown point, in: Bickel P.J., Doksum K.A., Jr. J.L. Hodges (Eds.), A Festschrift for Erich L. Lehmann, Wadsworth, Belmont, CA, 1983, pp. 157–184.
[5]
Rousseeuw P.J., Least median of squares regression, J. Amer. Statist. Assoc. 79 (1984) 871–880.
[6]
Rousseeuw P.J., Leroy A., Robust Regression and Outlier Detection, Wiley, New York, 1987.
[7]
Donoho D.L., Breakdown properties of multivariate location parameters and dispersion matrices, Ann. Statist. 20 (1992) 1803–1827.
[8]
Mizera I., On depth and deep points: a calculus, Ann. Statist. 30 (2002) 1681–1736.
[9]
J.L. Jr. Hodges, Efficiency in normal samples and tolerance of extreme values for some estimates of location, in: Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1, 1967, pp. 163–168.
[10]
Hampel F.R., Contributions to the theory of robust estimation, (Ph.D. thesis) University of California, Berkeley, 1968.
[11]
Hampel F.R., A general qualitative definition of robustness, Ann. Math. Stat. 42 (1971) 1887–1896.
[12]
Hampel F.R., Ronchetti E.M., Rousseeuw P.J., Stahel W.A., Robust Statistics. The Approach Based on Influence Functions, John Wiley & Sons, New York, 1986.
[13]
Huber P.J., Robust Statistics, Wiley, New York, 1981.
[14]
Donoho D.L., Breakdown properties of multivariate location estimators, (Ph.D. qualifying paper) Dept. Statistics, Harvard University, 1982.
[15]
Lopuhaä H.P., Breakdown points of affine equivariant estimators of multivariate location and covariance matrices, Ann. Statist. 19 (1991) 229–248.
[16]
Maronna R.A., Yohai V.J., The breakdown point of simultaneous general M estimates of regression and scale, J. Amer. Statist. Assoc. 86 (1991) 699–703.
[17]
Lopuhaä H.P., Highly efficient estimators of multivariate location with high breakdown point, Ann. Statist. 20 (1992) 398–413.
[18]
Tyler D.E., Finite sample breakdown points of projection based multivariate location and scatter statistics, Ann. Statist. 22 (1994) 1024–1044.
[19]
Müller C.H., Breakdown points for designed experiments, J. Statist. Plann. Inference 45 (1995) 413–427.
[20]
Ghosh S.K., Sengupta D., On multivariate monotonic measures of location with high breakdown point, Sankhyā A 61 (1999) 362–380.
[21]
Davies P.L., Asymptotic behavior of S-estimates of multivariate location parameters and dispersion matrices, Ann. Statist. 15 (1987) 1269–1292.
[22]
Davies P.L., The asymptotics of S-estimators in the linear regression model, Ann. Statist. 18 (1990) 1651–1675.
[23]
Davies P.L., Aspects of robust linear regression, Ann. Statist. 21 (1993) 1843–1899.
[24]
Davies P.L., Breakdown and groups, Ann. Statist. 33 (2005) 977–988.
[25]
Maronna R.A., Martin R.D., Yohai V.J., Robust Statistics: Theory and Methods, John Wiley & Sons, 2006.
[26]
Liu X., Zuo Y., Finite sample breakdown point of Tukey’s halfspace median, Sci. China Math. 60 (2017) 861–974.
[27]
Zuo Y., Some quantitative relationships between two types of finite sample breakdown point, Statist. Probab. Lett. 51 (2001) 369–375.
[28]
Zuo Y., Large sample properties of the regression depth induced medain, Stat. Probab. Lett. 166 (2020).
[29]
Zuo Y., On general notions of depth for regression, Statist. Sci. 36 (1) (2021) 142–157.
[30]
Rousseeuw P.J., Struyf A., Characterizing angular symmetry and regression symmetry, J. Statist. Plann. Inference 122 (2004) 161–173.
[31]
Zuo Y., Robustness of deepest projection regression depth functional, Statist. Papers 62 (2021) 1167–1193.
[32]
Huber P.J., Finite sample breakdown of M- and P-estimators, Ann. Statist. 12 (1984) 119–126.
[33]
Davies P.L., The breakdown point — exampoles and counterexamples, Revstat. – Stat. J. 5 (2007) 1–17.
[34]
Tukey J.W., Mathematics and the picturing of data, in: James R.D (Ed.), Proceeding of the International Congress of Mathematicians, Vol. 2, Canadian Mathematical Congress, Montreal, 1975, pp. 523–531.
[35]
Chen Z., Robustness of the half-space median, J. Stat. Plan. Inference 46 (1995) 175–184.
[36]
Chen Z., Tyler D.E., The influence function and maximum bias of Tukeys median, Ann. Statist. 30 (2002) 1737–1759.
[37]
Rousseeuw P.J., Struyf A., Computing location depth and regression depth in higher dimensions, Stat. Comput. 8 (1998) 193–203.
[38]
Liu X., Zuo Y., Computing halfspace depth and regression depth, Comm. Statist. Simulation Comput. 43 (2014) 969–985.
[39]
Segaert P., Hubert M., Rousseeuw P.J., Vakili K., Depth Measures in Multivariate, Regression and Functional Settings, 2023, R, Package ‘mrfDepth’.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Journal of Multivariate Analysis
Journal of Multivariate Analysis  Volume 199, Issue C
Jan 2024
215 pages

Publisher

Academic Press, Inc.

United States

Publication History

Published: 01 January 2024

Author Tags

  1. primary
  2. secondary

Author Tags

  1. Finite-sample breakdown point
  2. Maximum regression depth estimator
  3. Non-asymptotic robustness analysis
  4. Regression depth
  5. Regression median

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Jan 2025

Other Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media