Weighted composite quantile regression for single index model with missing covariates at random

Liu, Huilan; Yang, Hu; Peng, Changgen

doi:10.1007/s00180-019-00886-y

Weighted composite quantile regression for single index model with missing covariates at random

Original paper
Published: 27 March 2019

Volume 34, pages 1711–1740, (2019)
Cite this article

Computational Statistics Aims and scope Submit manuscript

Huilan Liu^1,2,
Hu Yang³ &
Changgen Peng¹

555 Accesses
7 Citations
Explore all metrics

Abstract

This paper considers weighted composite quantile estimation of the single-index model with missing covariates at random. Under some regularity conditions, we establish the large sample properties of the estimated index parameters and link function. The large sample properties of the parametric part show that the estimator with estimated selection probability have a smaller limiting variance than the one with the true selection probability. However, the large sample properties of the estimated link function indicate that whether weights were estimated or not has no effect on the asymptotic variance. Studies of simulation and the real data analysis are presented to illustrate the behavior of the proposed estimators.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Weighted local linear CQR for varying-coefficient models with missing covariates

Article 21 January 2015

Bayesian quantile regression for partially linear single-index model with longitudinal data

Article 12 December 2024

Single-index composite quantile regression for ultra-high-dimensional data

Article 16 September 2021

References

Chaudhuri P, Doksum K, Samarov A (1997) On average derivative quantile regression. Ann Stat 25:715–744
Article MathSciNet Google Scholar
Guo X, Xu W, Zhu L (2014) Multi-index regression models with missing covariates at random. J Multivar Anal 123:345–363
Article MathSciNet Google Scholar
Härdle W, Stoker TM (1989) Investigating smooth multiple regression by the method of average derivatives. J Am Stat Assoc 84:986–995
MathSciNet MATH Google Scholar
Hjort NL, Pollard D (2011) Asymptotics for minimisers of convex processes. arXiv:1107.3806
Horvitz DG, Thompson DJ (1952) A generalization of sampling without replacement from a finite universe. J Am Stat Assoc 47:663–685
Article MathSciNet Google Scholar
Jiang R, Zhou Z, Qian W, Shao W (2012) Single-index composite quantile regression. J Korean Stat Soc 41:323–332
Article MathSciNet Google Scholar
Kai B, Li R, Zou H (2010) Local composite quantile regression smoothing: an efficient and safe alternative to local polynomial regression. J R Stat Soc Ser B 72:49–69
Article MathSciNet Google Scholar
Kai B, Li R, Zou H (2011) New efficient estimation and variable selection methods for semiparametric varying-coefficient partially linear models. Ann Stat 39:305–332
Article MathSciNet Google Scholar
Knight K (1998) Limiting distributions for L1 regression estimators under general conditions. Ann Stat 26:755–770
Article Google Scholar
Koenker R, Bassett G (1978) Regression quantiles. Econometrica 46:33–50
Article MathSciNet Google Scholar
Kong E, Xia Y (2014) An adaptive composite quantile approach to dimension reduction. Ann Stat 42:1657–1688
Article MathSciNet Google Scholar
Li KC (1991) Sliced inverse regression for dimension reduction. J Am Stat Assoc 86:316–327
Article MathSciNet Google Scholar
Li KC (1992) On principal Hessian directions for data visualization and dimension reduction: another application of Stein’s lemma. J Am Stat Assoc 87:1025–1039
Article MathSciNet Google Scholar
Li T, Yang H (2016) Inverse probability weighted estimators for single-index models with missing covariates. Commun Stat-Theory Methods 45:1199–1214
Article MathSciNet Google Scholar
Li J, Li Y, Zhang R (2017) B spline variable selection for the single index models. Stat Pap 58:691–706
Article MathSciNet Google Scholar
Liang H (2008) Generalized partially linear models with missing covariates. J Multivar Anal 99:880–895
Article MathSciNet Google Scholar
Liang H, Wang S, Robins JM, Carroll RJ (2011) Estimation in partially linear models with missing covariates. J Am Stat Assoc 99:357–367
Article MathSciNet Google Scholar
Little RJ, Rubin DB (1987) Statistical analysis with missing data. Wiley, New York
MATH Google Scholar
Liu H, Yang H (2017) Estimation and variable selection in single-index composite quantile regression. Commun Stat-Simul Comput 46:7022–7039
Article MathSciNet Google Scholar
Liu H, Yang H, Xia X (2017) Robust estimation and variable selection in censored partially linear additive models. J Korean Stat Soc 46:88–103
Article MathSciNet Google Scholar
Lv Y, Zhang R, Zhao W, Liu J (2014) Quantile regression and variable selection for the single-index model. J Appl Stat 41:1565–1577
Article MathSciNet Google Scholar
Mack YP, Silverman BW (1982) Weak and strong uniform consistency of kernel regression estimates. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete 61:405–415
Article MathSciNet Google Scholar
Peng H, Huang T (2011) Penalized least squares for single index models. J Stat Plan Inference 141:1362–1379
Article MathSciNet Google Scholar
Sherwood B, Wang L, Zhou XH (2013) Weighted quantile regression for analyzing health care cost data with missing covariates. Stat Med 32:4967–4979
Article MathSciNet Google Scholar
Wang CY, Wang S, Gutierrez RG, Carroll RJ (1998) Local linear regression for generalized linear models with missing data. Ann Stat 26:1028–1050
Article MathSciNet Google Scholar
Wang CY, Chen HY (2001) Augmented inverse probability weighted estimator for Cox missing covariate regression. Biometrics 57:414–419
Article MathSciNet Google Scholar
Wong H, Guo S, Chen M, Wai-Cheung IP (2009) On locally weighted estimation and hypothesis testing of varying-coefficient models with missing covariates. J Stat Plan Inference 139:2933–2951
Article MathSciNet Google Scholar
Wu T, Yu K, Yu Y (2010) Single-index quantile regression. J Multivar Anal 101:1607–1621
Article MathSciNet Google Scholar
Xia Y, Tong H, Li WK, Zhu LX (2002) An adaptive estimation of dimension reduction space. J R Stat Soc Ser B 64:363–410
Article MathSciNet Google Scholar
Xia Y, Härdle W (2006) Semi-parametric estimation of partially linear single-index models. J Multivar Anal 97:1162–1184
Article MathSciNet Google Scholar
Yang H, Liu HL (2016) Penalized weighted composite quantile estimators with missing covariates. Stat Pap 57:69–88
Article MathSciNet Google Scholar
Zou H, Yuan M (2008) Composite quantile regression and the oracle model selection theory. Ann Stat 36:1108–1126
Article MathSciNet Google Scholar

Download references

Acknowledgements

The authors sincerely thank the Editor, the Associate Editor and two Reviewers for their helpful comments and suggestions which lead to a significant improvement on this paper. Liu’s work is supported by the National Natural Science Foundation of China (Grant No. 11761020), China Postdoctoral Science Foundation(Grant No. 2017M623067), Open Foundation of Guizhou Provincial Key Laboratory of Public Big Data(Grant No. 2017BDKFJJ030), Scientific Research Foundation for Young Talents of Department of Education of Guizhou Province(Grant No. 2017104), Science and Technology Foundation of Guizhou Province (Grant No. QKH20177222). Yang’s work is supported by the National Natural Science Foundation of China (Grant No. 11671059). Peng’s work is supported by the National Natural Science Foundation of China (Grant No. 61662009), Science and Technology Foundation of Guizhou Province (Grant No. QKH20183001).

Author information

Authors and Affiliations

Guizhou Provincial Key Laboratory of Public Big Data, Guizhou University, Guiyang, 550025, Guizhou Province, People’s Republic of China
Huilan Liu & Changgen Peng
College of Mathematics and Statistics, Guizhou University, Guiyang, 550025, Guizhou Province, People’s Republic of China
Huilan Liu
College of Mathematics and Statistics, Chongqing University, Chongqing, 401331, People’s Republic of China
Hu Yang

Authors

Huilan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hu Yang
View author publications
You can also search for this author in PubMed Google Scholar
Changgen Peng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Huilan Liu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Lemma 1

Let $(X_{1}, Y_{1})$, $(X_{2}, Y_{2}),\ldots , (X_{n}, Y_{n})$ be independent and identically distributed random vectors, where $Y_{i}$ are scalar random variables. Furthermore, assume that $E|Y|^{s}<\infty $, and $\sup _{x}\int |Y|^{s}f(x,y)dy<\infty $, where $f(\cdot ,\cdot )$ denotes the joint density of (X, Y). Let K be a bounded positive function with a bounded support, satisfying the Lipschitz condition. Given that $n^{2\varepsilon -1}h\rightarrow \infty $ for some $\varepsilon <1-s^{-1}$, then

$$\begin{aligned} \sup _{x}|\frac{1}{n}\sum _{i=1}^{n}[K_{h}(X_{i}-x)Y_{i}-E(K_{h}(X_{i}-x)Y_{i})]|=O_{p}(\sqrt{\log (1/h)/(nh)}). \end{aligned}$$

Proof

This follows immediately from the result obtained by Mack and Silverman (1982). $\square $

Lemma 2

(Quadratic Approximation Lemma) Suppose $A_{n}(s)$ is convex and can be represented as $\frac{1}{2}s^{T}Vs+U^{T}_{n}s+C_{n}+r_{n}(s)$ , where V is symmetric and positive definite, $U_{n}$ is stochastically bounded, $C_{n}$ is arbitrary, and $r_{n}(s)$ goes to zero in probability for each s. Then $\alpha _{n}$, the minimizer of $A_{n}(s)$, is only $o_{p}(1)$ away from $\beta _{n}=-V^{-1}U_{n}$, the minimizer of $\frac{1}{2}s^{T}Vs+U^{T}_{n}s+C_{n}$. If also $U_{n}\xrightarrow {d} U$, then $\alpha _{n}\xrightarrow {d} -V^{-1}U$.

The proof of Lemma 2 is available from Hjort and Pollard (2011). The proof of the Quadratic Approximation Lemma indicates that the positive definiteness of the matrix V can be reduced into semi-positive definiteness, even to the existence of a generalized inverse, see Wu et al. (2010).

Lemma 3

Suppose that Conditions (C1)–(C7) hold. Let ${\hat{\beta }}^{0}$ be the initial estimator of $\beta _{0}$. As $h\rightarrow 0$ and $\delta _{n}\rightarrow 0$, we have

$$\begin{aligned}&\sqrt{nh}\left( \begin{array}{c} {\hat{a}}_{WCQR1}(u,{\hat{\beta }}^{0})-g(u)-c_{1}+g^{'}(u)E(X|u)^{T}{\tilde{\beta }}-\frac{h^{2}}{2}g^{''}(u)\mu _{2}+O(h^{2}\delta _{\beta }+\delta ^{2}_{\beta }+h^{3})\\ ... \\ {\hat{a}}_{WCQRq}(u,{\hat{\beta }}^{0})-g(u)-c_{q}+g^{'}(u)E(X|u)^{T}{\tilde{\beta }}-\frac{h^{2}}{2}g^{''}(u)\mu _{2}+O(h^{2}\delta _{\beta }+\delta ^{2}_{\beta }+h^{3})\\ h({\hat{g}}_{WCQR}'(u,{\hat{\beta }}^{0})-g'(u)+O(\delta _{\beta }+h^{2}))\\ \end{array} \right) \\&\quad =-\frac{S^{-1}}{f_{U_{0}}(u)}W_{n1}+o_{p}(1), \end{aligned}$$

where ${\tilde{\beta }}={\hat{\beta }}^{0}-\beta _{0}$, $\delta _{n}=\sqrt{\log (1/h)/(nh)}$ and $\delta _{\beta }=|{\hat{\beta }}^{0}-\beta _{0}|$.

Proof

Recall that $({\hat{a}}_{WCQR1}(u,{\hat{\beta }}^{0}),...,{\hat{a}}_{WCQRq}(u,{\hat{\beta }}^{0}),{\hat{g}}_{WCQR}'(u,{\hat{\beta }}^{0}))$ are obtained by minimizing the following objective function

$$\begin{aligned}&\sum _{k=1}^q \sum _{i=1}^n\frac{V_{i}}{\pi (Y_{i})}\rho _{\tau _{k}}[Y_{i}-a_{k}-b(X_{i}^{T}{\hat{\beta }}^{0}-u)]K_{i}(u), \end{aligned}$$

(A.1)

where $K_{i}(u)=K_{h}(X_{i}^{T}{\hat{\beta }}^{0}-u).$ Let ${\hat{\theta }}=\sqrt{nh}\{({\hat{a}}_{WCQR1}(u,{\hat{\beta }}^{0})-g(u)-c_{1}),...,({\hat{a}}_{WCQRq}(u,{\hat{\beta }}^{0})-g(u)-c_{q}),h({\hat{g}}_{WCQR}'(u,{\hat{\beta }}^{0})-g'(u))\}$, $r_{i}(u)=g(X_{i}^{T}\beta _{0})-g(u)-g^{'}(u)(X_{i}^{T}{\hat{\beta }}^{0}-u)$, $r^{0}_{i}(u)=g(X_{i}^{T}{\hat{\beta }}^{0})-g(u)-g^{'}(u)(X_{i}^{T}{\hat{\beta }}^{0}-u)$, $\eta _{ik}=I(\epsilon _{i}-c_{k}<0)-\tau _{k}$, $\eta _{ik}(u)=I(\epsilon _{i}-c_{k}+r_{i}(u)<0)-\tau _{k}$, $X_{ik}=(e_{k},(X_{i}^{T}{\hat{\beta }}^{0}-u)/h)^{T}$, $X^{*}_{k}=(e_{k},(X^{T}{\hat{\beta }}^{0}-u)/h)^{T}$, $e_{k}$ is a q-vector with 1 on the kth position and 0 elsewhere. Then, ${\hat{\theta }}$ is also the minimizer of

$$\begin{aligned} L_{n}(\theta )=\sum _{k=1}^q \sum _{i=1}^n\frac{V_{i}}{\pi (Y_{i})}\left\{ \rho _{\tau _{k}}[\epsilon _{i}-c_{k}{+}r_{i}(u){-}\frac{X_{ik}^{T}\theta }{\sqrt{nh}}]-\rho _{\tau _{k}}[\epsilon _{i}-c_{k}+r_{i}(u)]\right\} K_{i}(u). \end{aligned}$$

(A.2)

By Knight (1998), for any $x\ne 0$, we have

$$\begin{aligned} \rho _{\tau }(x-y)-\rho _{\tau }(x)=y[I(x<0)-\tau ]+\int _{0}^y[I(x\le t)-I(x\le 0)]dt. \end{aligned}$$

Then, we can get

$$\begin{aligned} L_{n}(\theta )=W_{n}^{T}\theta +\sum _{k=1}^q B_{n,k}(\theta ), \end{aligned}$$

where

$$\begin{aligned} W_{n}=\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i=1}^n\frac{V_{i}}{\pi (Y_{i})}\eta _{ik}(u)K_{i}(u)X_{ik} \end{aligned}$$

and

$$\begin{aligned} B_{n,k}(\theta )=\sum _{i=1}^n\frac{V_{i}}{\pi (Y_{i})}\int _{0}^{\frac{X_{ik}^{T}\theta }{\sqrt{nh}}}[I(\epsilon _{i}-c_{k}+r_{i}(u)\le t)-I(\epsilon _{i}-c_{k}+r_{i}(u)\le 0)]dt K_{i}(u). \end{aligned}$$

Since $B_{n,k}(\theta )$ is a summation of i.i.d. random variables of the kernel dorm, we obtain by Lemma 1

$$\begin{aligned} B_{n,k}(\theta )=E[B_{n,k}(\theta )]+O_{p}(\delta _{n}). \end{aligned}$$

Let ${\hat{\chi }}^{0}$ be the $\sigma $-field generated by $\{X_{1}^{T}{\hat{\beta }}^{0},...,X_{n}^{T}{\hat{\beta }}^{0}\}$. Since ${\hat{\beta }}^{0}$ is a $\sqrt{n}$-consistent estimator of $\beta _{0}$, we have $f_{U}(.)=f_{U_{0}}(.)(1+o_{p}(1))$. Hence, the expectation of $\sum _{k=1}^qB_{n,k}(\theta )$ is

$$\begin{aligned} \sum _{k=1}^qE[B_{n,k}(\theta )]=\sum _{k=1}^qE\{E[B_{n,k}(\theta )|{\hat{\chi }}^{0}]\}=\frac{f_{U_{0}}(u)}{2}\theta ^{T}S\theta +O(\delta _{\beta })+O(h^{2}), \end{aligned}$$

where $S=diag(C,\mu _{2}c)$, $C=diag(f_{\epsilon }(c_{1}),...,f_{\epsilon }(c_{q}))$, $c=\sum _{k=1}^qf_{\epsilon }(c_{k})$. Hence, we have

$$\begin{aligned} L_{n}(\theta )&=W_{n}^{T}\theta +\frac{f_{U_{0}}(u)}{2}\theta ^{T}S\theta +O(\delta _{\beta })+O(h^{2})+O_{p}(\delta _{n})\nonumber \\&=W_{n}^{T}\theta +\frac{f_{U_{0}}(u)}{2}\theta ^{T}S\theta +o_{p}(1). \end{aligned}$$

(A.3)

By Lemma 2, we have

$$\begin{aligned} {\hat{\theta }}=-\frac{S^{-1}}{f_{U_{0}}(u)}W_{n}+o_{p}(1). \end{aligned}$$

(A.4)

Let $W_{n1}=\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i=1}^n\frac{V_{i}}{\pi (Y_{i})}\eta _{ik}K_{i}(u)X_{ik}$, we can get $Var(W_{n}-W_{n1})=o_{p}(1)$, which implies

$$\begin{aligned} W_{n}-W_{n1}-E[W_{n}]=o_{p}(1). \end{aligned}$$

Then, we get

$$\begin{aligned} W_{n}=W_{n1}+E[W_{n}]+o_{p}(1). \end{aligned}$$

(A.5)

Combining (A.4) and (A.5), we obtain

$$\begin{aligned} {\hat{\theta }}=-\frac{S^{-1}}{f_{U_{0}}(u)}W_{n1}-\frac{S^{-1}}{f_{U_{0}}(u)}E[W_{n}]+o_{p}(1). \end{aligned}$$

(A.6)

Considering $E[W_{n}]$, we have

$$\begin{aligned} \frac{1}{\sqrt{nh}}E[W_{n}]&=-\frac{1}{nh}\sum _{k=1}^q \sum _{i=1}^nE\{f_{\epsilon }(c_{k})[g(X_{i}^{T}\beta _{0})-g(X_{i}^{T}{\hat{\beta }}^{0})\nonumber \\&\quad +r^{0}_{i}(u)]K_{i}(u)X_{ik}(1+O_{p}(n^{-1/2}+h^{2}))\}\nonumber \\&=\frac{1}{nh}\sum _{k=1}^q \sum _{i=1}^nE\{f_{\epsilon }(c_{k})[g(X_{i}^{T}{\hat{\beta }}^{0})-g(X_{i}^{T}\beta _{0})]K_{i}(u)X_{ik}\}\nonumber \\&\quad -\frac{1}{nh}\sum _{k=1}^q \sum _{i=1}^nE[f_{\epsilon }(c_{k})r^{0}_{i}(u)K_{i}(u)X_{ik}]+o_{p}(1)\nonumber \\&=I_{1}-I_{2}+o_{p}(1). \end{aligned}$$

It is easy to show that

$$\begin{aligned} I_{1}&=\left( \begin{array}{c} g^{'}(u)f_{U_{0}}(u)E(X|u)^{T}({\hat{\beta }}^{0}-\beta _{0})\sum _{k=1}^q f_{\epsilon }(c_{k})e_{k}+O(h^{2}\delta _{\beta }+\delta ^{2}_{\beta })\\ O(h\delta _{\beta })\\ \end{array} \right) \end{aligned}$$

and

$$\begin{aligned} I_{2}&=\frac{1}{2}\sum _{k=1}^q f_{\epsilon }(c_{k})\{\int g^{''}(u)t^{2}h^{2}\left( \begin{array}{c} e_{k}K(t) \\ 0\\ \end{array} \right) f_{U_{0}}(x)dx\}+O(h^{3})\nonumber \\&=\frac{h^{2}}{2}g^{''}(u)f_{U_{0}}(u)\left( \begin{array}{c} \sum _{k=1}^q f_{\epsilon }(c_{k})e_{k}\mu _{2} \\ 0\\ \end{array} \right) +O(h^{3}). \end{aligned}$$

Therefore,

$$\begin{aligned}&\frac{1}{\sqrt{nh}}E[W_{n}]\nonumber \\&\quad =\left( \begin{array}{c} f_{U_{0}}(u)\sum _{k=1}^q f_{\epsilon }(c_{k})e_{k}[ g^{'}(u)E(X|u)^{T}({\hat{\beta }}^{0}-\beta _{0})-\frac{h^{2}}{2}g^{''}(u)\mu _{2}] +O(h^{2}\delta _{\beta }+\delta ^{2}_{\beta }+h^{3})\\ O(h\delta _{\beta }+h^{3})\\ \end{array} \right) . \end{aligned}$$

(A.7)

Combining (A.6) and (A.7), we complete the proof of Lemma 3. $\square $

Lemma 4

Suppose that Conditions (C1)–(C7) hold. As $h\rightarrow 0$ and $\delta _{n}\rightarrow 0$, we have

$$\begin{aligned}&\sqrt{nh}\left( \begin{array}{c} {\hat{a}}_{NWCQR1}(u,{\hat{\beta }}^{0})-g(u)-c_{1}+g^{'}(u)E(X|u)^{T}{\tilde{\beta }}-\frac{h^{2}}{2}g^{''}(u)\mu _{2}+O(\Delta _{(n,h)})\\ ... \\ {\hat{a}}_{NWCQRq}(u,{\hat{\beta }}^{0})-g(u)-c_{q}+g^{'}(u)E(X|u)^{T}{\tilde{\beta }}-\frac{h^{2}}{2}g^{''}(u)\mu _{2}+O(\Delta _{(n,h)})\\ h({\hat{g}}_{NWCQR}'(u,{\hat{\beta }}^{0})-g'(u)+O(\delta _{\beta }+h^{2}+h+1/(nh^{2}))\\ \end{array} \right) \\&\quad =-\frac{S^{-1}}{f_{U_{0}}(u)}[W_{n1}-W_{n2}]+o_{p}(1), \end{aligned}$$

where $\Delta _{(n,h)}=(h^{2}+1/(nh)+h^{2}\delta _{\beta }+\delta _{\beta }^{2}+h^{3})$, ${\tilde{\beta }}$, $\delta _{n}$ and $\delta _{\beta }$ are defined in Lemma 3.

Proof

Note that $({\hat{a}}_{NWCQR1}(u,{\hat{\beta }}^{0}),...,{\hat{a}}_{NWCQRq}(u,{\hat{\beta }}^{0}),{\hat{g}}_{NWCQR}'(u,{\hat{\beta }}^{0}))$ are obtained by minimizing the following objective function

$$\begin{aligned}&\sum _{k=1}^q \sum _{i=1}^n\frac{V_{i}}{{\hat{\pi }}(Y_{i})}\rho _{\tau _{k}}[Y_{i}-a_{k}-b(X_{i}^{T}{\hat{\beta }}^{0}-u)]K_{i}(u). \end{aligned}$$

(A.8)

Let ${\hat{\theta }}^{*}=\sqrt{nh}\{({\hat{a}}_{NWCQR1}(u,{\hat{\beta }}^{0})-g(u)-c_{1}),...,({\hat{a}}_{NWCQRq}(u,{\hat{\beta }}^{0})-g(u)-c_{q}),h({\hat{g}}_{NWCQR}'(u,{\hat{\beta }}^{0})-g'(u))\}$. Then, ${\hat{\theta }}^{*}$ is also the minimizer of

$$\begin{aligned} L_{n}(\theta ^{*})&=\sum _{k=1}^q \sum _{i=1}^n\frac{V_{i}}{{\hat{\pi }}(Y_{i})}\left\{ \rho _{\tau _{k}}[\epsilon _{i}{-}c_{k}+r_{i}(u){-}\frac{X_{ik}^{T}\theta ^{*}}{\sqrt{nh}}]{-}\rho _{\tau _{k}}[\epsilon _{i}-c_{k}+r_{i}(u)]\right\} K_{i}(u)\nonumber \\&=W_{n}^{*T}\theta ^{*}+\sum _{k=1}^q B^{*}_{n,k}(\theta ^{*}), \end{aligned}$$

(A.9)

where

$$\begin{aligned} W_{n}^{*}=\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i=1}^n\frac{V_{i}}{{\hat{\pi }}(Y_{i})}\eta _{ik}(u)K_{i}(u)X_{ik} \end{aligned}$$

and

$$\begin{aligned} B^{*}_{n,k}(\theta ^{*})&=\sum _{i=1}^n\frac{V_{i}}{{\hat{\pi }}(Y_{i})}\int _{0}^{\frac{X_{ik}^{T}\theta ^{*}}{\sqrt{nh}}}[I(\epsilon _{i}-c_{k}\\&\quad +r_{i}(u)\le t)-I(\epsilon _{i}-c_{k}+r_{i}(u)\le 0)]dt K_{i}(u). \end{aligned}$$

Notice that

$$\begin{aligned}&B^{*}_{n,k}(\theta ^{*})\\&\quad =\sum _{i=1}^n\frac{V_{i}}{\pi (Y_{i})}\int _{0}^{\frac{X_{ik}^{T}\theta ^{*}}{\sqrt{nh}}}[I(\epsilon _{i}-c_{k}+r_{i}(u)\le t)-I(\epsilon _{i}-c_{k}+r_{i}(u)\le 0)]dt K_{i}(u)\\&\qquad -\sum _{i=1}^n\frac{V_{i}[{\hat{\pi }}(Y_{i})-\pi (Y_{i})]}{\pi (Y_{i}){\hat{\pi }}(Y_{i})}\int _{0}^{\frac{X_{ik}^{T}\theta ^{*}}{\sqrt{nh}}}[I(\epsilon _{i}-c_{k}+r_{i}(u) \le t)\\&\qquad -I(\epsilon _{i}-c_{k}+r_{i}(u)\le 0)]dt K_{i}(u).\\&\quad =I_{3}-I_{4}, \end{aligned}$$

where $I_{3}$ is the first summation, $I_{4}$ is the second one. From the proof of Lemma 3, we have $I_{3}=\frac{f_{U_{0}}(u)}{2}\theta ^{*T}S\theta ^{*}+o_{p}(1)=O_{p}(1)$. Considering the fact $sup_{y}|{\hat{\pi }}(y)-\pi (y)|=o_{p}(1)$, we have $I_{4}=o_{p}(1)$. Then, it follows that

$$\begin{aligned} \sum _{k=1}^qB^{*}_{n,k}(\theta ^{*})=\frac{f_{U_{0}}(u)}{2}\theta ^{*T}S\theta ^{*}+o_{p}(1). \end{aligned}$$

(A.10)

Recalling the definition of $W_{n}^{*}$, we have

$$\begin{aligned} W_{n}^{*}&=W_{n}-\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i=1}^n\frac{V_{i}[{\hat{\pi }}(Y_{i})-\pi (Y_{i})]}{\pi ^{2}(Y_{i})}\eta _{ik}(u)K_{i}(u)X_{ik}+o_{p}(1)\nonumber \\&=W_{n}-\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i=1}^n\frac{[V_{i}-\pi (Y_{i})][{\hat{\pi }}(Y_{i})-\pi (Y_{i})]}{\pi ^{2}(Y_{i})}\eta _{ik}(u)K_{i}(u)X_{ik}\nonumber \\&\quad -\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i=1}^n\frac{[{\hat{\pi }}(Y_{i})-\pi (Y_{i})]}{\pi (Y_{i})}\eta _{ik}(u)K_{i}(u)X_{ik}+o_{p}(1)\nonumber \\&=W_{n1}+E[W_{n}]-I_{5}-I_{6}+o_{p}(1), \end{aligned}$$

(A.11)

where

$$\begin{aligned} I_{5}&=\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i=1}^n\frac{[V_{i}-\pi (Y_{i})][{\hat{\pi }}(Y_{i})-\pi (Y_{i})]}{\pi ^{2}(Y_{i})}\eta _{ik}(u)K_{i}(u)X_{ik},\\ I_{6}&=\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i=1}^n\frac{[{\hat{\pi }}(Y_{i})-\pi (Y_{i})]}{\pi (Y_{i})}\eta _{ik}(u)K_{i}(u)X_{ik}, \end{aligned}$$

$W_{n}$ and $W_{n1}$ are given in Lemma 3.

Recalling the definition of ${\hat{\pi }}(y)$, we have

$$\begin{aligned} {\hat{\pi }}(y)-\pi (y)&=\frac{\sum _{j=1}^n(V_{j}-\pi (Y_{j}))L_{h}(Y_{j}-y)}{\sum _{j=1}^nL_{h}(Y_{j}-y)}\\&\quad +\frac{\sum _{j=1}^n(\pi (Y_{j})-\pi (y))L_{h}(Y_{j}-y)}{\sum _{j=1}^nL_{h}(Y_{j}-y)}\\&=\frac{1}{nf_{Y}(y)}\sum _{j=1}^n(V_{j}-\pi (Y_{j}))L_{h}(Y_{j}-y)\\&\quad +\frac{\sum _{j=1}^n(\pi (Y_{j})-\pi (y))L_{h}(Y_{j}-y)}{nf_{Y}(y)}\\&=\frac{1}{nf_{Y}(y)}\sum _{j=1}^n(V_{j}-\pi (Y_{j}))L_{h}(Y_{j}-y)\\&\quad +O(h^{2})+O_{p}\left( 1/\sqrt{nh^{-1}}\right) , \end{aligned}$$

where $O(h^{2})$ and $O_{p}(1/\sqrt{nh^{-1}})$ don’t depend on the $V_{j}$, $(j=1,...,n)$.

Noting that $E\sum _{k=1}^q[\eta _{jk}(u)K_{j}(u)X_{jk}|Y_{j}]=O_{p}(h)$, we have

$$\begin{aligned} I_{5}&=\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i=1}^n\sum _{j=1}^n\frac{[V_{i}-\pi (Y_{i})][V_{j}-\pi (Y_{j})]}{nf_{Y}(Y_{i})\pi ^{2}(Y_{i})}\eta _{ik}(u)K_{i}(u)X_{ik}L_{h}(Y_{j}-Y_{i})\\&\quad +O(h^{2})+O_{p}\left( 1/\sqrt{nh^{-1}}\right) \\&=\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i\ne j}^n\frac{[V_{i}-\pi (Y_{i})][V_{j}-\pi (Y_{j})]}{nf_{Y}(Y_{i})\pi ^{2}(Y_{i})}\eta _{ik}(u)K_{i}(u)X_{ik}L_{h}(Y_{j}-Y_{i})\\&\quad +\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i=1}^n\frac{[V_{i}-\pi (Y_{i})]^2}{nf_{Y}(Y_{i})\pi ^{2}(Y_{i})}\eta _{ik}(u)K_{i}(u)X_{ik}L_{h}(0)+O(h^{2})\\&\quad +\,O_{p}\left( 1/\sqrt{nh^{-1}}\right) \\&=I^{1}_{5}+I^{2}_{5}+O(h^{2})+O_{p}\left( 1/\sqrt{nh^{-1}}\right) . \end{aligned}$$

By calculating the mean and the variance, we can get

$$\begin{aligned} I^{1}_{5}=O_{p}\left( 1/\sqrt{nh}\right) ,\quad I^{2}_{5}=O_{p}\left( 1/\sqrt{nh}\right) . \end{aligned}$$

Then, we have

$$\begin{aligned} I_{5}=O_{p}\left( 1/\sqrt{nh}\right) +O(h^{2}). \end{aligned}$$

(A.12)

For $I_{6}$, we can get

$$\begin{aligned} I_{6}&=\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i=1}^n\frac{1}{\pi (Y_{i})}\eta _{ik}(u)K_{i}(u)X_{ik}\frac{1}{nf_{Y}(Y_{i})}\sum _{j=1}^n(V_{j}-\pi (Y_{j}))L_{h}(Y_{j}-Y_{i})\\&\quad +O(h^{2})\sqrt{nh}+O_{p}(h) \end{aligned}$$

Denote $D_{ij}=\sum _{k=1}^q \frac{1}{nf_{Y}(Y_{i})\pi (Y_{i})}L_{h}(Y_{j}-Y_{i})\eta _{ik}(u)K_{i}(u)X_{ik}$. For given $Y_{j}$, $D_{ij}$ is independent with $D_{lj}$, $i\ne l$. Hence, we have

$$\begin{aligned} \sum _{i=1}^nD_{ij}=\frac{1}{\pi (Y_{j})}E\left[ \sum _{k=1}^q\eta _{jk}(u)K_{j}(u)X_{jk}|Y_{j}\right] +O_{p}(n^{-1/2}). \end{aligned}$$

Then, $I_{6}$ can be rewritten as

$$\begin{aligned} I_{6}&=\frac{1}{\sqrt{nh}}\sum _{k=1}^q\sum _{j=1}^n\frac{(V_{j}-\pi (Y_{j}))}{\pi (Y_{j})}E[\eta _{jk}(u)K_{j}(u)X_{jk}|Y_{j}]+O_{p}\left( 1/\sqrt{nh}\right) \nonumber \\&\quad +O(h^{2})\sqrt{nh}+O_{p}(h)\nonumber \\&=W_{n2}+o_{p}(\sqrt{h})+O_{p}\left( 1/\sqrt{nh}\right) +O(h^{2})\sqrt{nh}+O_{p}(h), \end{aligned}$$

(A.13)

where

$$\begin{aligned} W_{n2}=\frac{1}{\sqrt{nh}}\sum _{k=1}^q\sum _{j=1}^n\frac{V_{j}-\pi (Y_{j})}{\pi (Y_{j})}E[\eta _{jk}K_{j}(u)X_{jk}|Y_{j}]. \end{aligned}$$

Combining (A.11)–(A.13), we have

$$\begin{aligned} W^{*}_{n}=W_{1n}-W_{2n}+E[W_{n}]+O_{p}\left( 1/\sqrt{nh}\right) +O(h^{2})\sqrt{nh}+O_{p}(h). \end{aligned}$$

By Lemma 2, we complete the proof of Lemma 4. $\square $

Proof of Theorem 3.1

When the selection probability is known, we denote ${\hat{\beta }}_{WCQR}^{1}$ and ${\hat{c}}_{WCQRk}^{1}$, $(k=1,...,q)$ to be the estimators of $\beta _{0}$ and $c_{k}$, $(k=1,...,q)$ which are obtained in 1th iteration in the algorithm proposed in Sect. 2.1. Then, ${\hat{\beta }}_{WCQR}^{1}$ and ${\hat{c}}_{WCQRk}^{1}$, $(k=1,...,q)$ are the minimizer of the following loss function:

$$\begin{aligned}&\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{\pi (Y_{i})}\rho _{\tau _{k}}\Big [Y_{i}-c_{k}-{\hat{g}}_{WCQR}(X^{T}_{i}{\hat{\beta }}^{0},{\hat{\beta }}^{0})\nonumber \\&\quad -{\hat{g}}'_{WCQR}(X_{i}^{T}{\hat{\beta }}^{0},{\hat{\beta }}^{0})(X_{i}^{T}\beta -X_{i}^{T}{\hat{\beta }}^{0})\Big ]. \end{aligned}$$

(A.14)

Let ${\hat{\zeta }}_{WCQR}=\sqrt{n}({\hat{\beta }}_{WCQR}^{1}-\beta _{0})$, ${\hat{v}}_{WCQRk}=\sqrt{n}({\hat{c}}_{WCQRk}^{1}-c_{k})$, $r_{ik}=c_{k}+{\hat{g}}_{WCQR}(X^{T}_{i}{\hat{\beta }}^{0},{\hat{\beta }}^{0})-g(X^{T}_{i}\beta _{0})+{\hat{g}}'_{WCQR}(X_{i}^{T}{\hat{\beta }}^{0},{\hat{\beta }}^{0})(X_{i}^{T}\beta _{0}-X_{i}^{T}{\hat{\beta }}^{0})$ and $N_{i}={\hat{g}}'_{WCQR}(X_{i}^{T}{\hat{\beta }}^{0},{\hat{\beta }}^{0})X_{i}$. Then, ${\hat{\zeta }}_{WCQR}$ and ${\hat{v}}_{WCQRk}$, $(k=1,...,q)$ minimize the following:

$$\begin{aligned}&Q_{n}(v_{1},...,v_{q},\zeta ,\pi (Y_{i}))\nonumber \\&\quad =\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{\pi (Y_{i})}\rho _{\tau _{k}}(\epsilon _{i}-r_{ik}-\frac{1}{\sqrt{n}}(N^{T}_{i}\zeta +v_{k}))\nonumber \\&\quad \quad -\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{\pi (Y_{i})}\rho _{\tau _{k}}(\epsilon _{i}-r_{ik})\nonumber \\&\quad =\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{\pi (Y_{i})}[I(\epsilon _{i}< r_{ik})-\tau _{k}]N^{T}_{i}\zeta \nonumber \\&\quad \quad +\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{\pi (Y_{i})}[I(\epsilon _{i}< r_{ik})-\tau _{k}]v_{k} \nonumber \\&\quad \quad +\sum _{k=1}^q \sum _{i=1}^n\frac{V_{i}}{\pi (Y_{i})}\int _{0}^{\frac{1}{\sqrt{n}}(N^{T}_{i}\zeta +v_{k})}[I(\epsilon _{i}\le r_{ik}+t)-I(\epsilon _{i}\le r_{ik})]dt\nonumber \\&\quad =M^{T}_{n}\zeta +\frac{1}{2}\zeta ^{T}\Gamma \zeta +\sum _{k=1}^qz_{nk}v_{k}+\frac{1}{2}\sum _{k=1}^qf_{\epsilon }(c_{k})v_{k}^{2}+o_{p}(1), \end{aligned}$$

(A.15)

where

$$\begin{aligned} M_{n}&=\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{\pi (Y_{i})}[I(\epsilon _{i}< r_{ik})-\tau _{k}]N_{i},\\ z_{nk}&=\frac{1}{\sqrt{n}}\sum _{i=1}^n \frac{V_{i}}{\pi (Y_{i})}[I(\epsilon _{i}< r_{ik})-\tau _{k}] \end{aligned}$$

and

$$\begin{aligned} \Gamma =cE[g'(X^{T}\beta _{0})^{2}XX^{T}]. \end{aligned}$$

Letting $M_{n1}=\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{\pi (Y_{i})}[I(\epsilon _{i}< c_{k})-\tau _{k}]N_{i}$, we can get $Var(M_{n}-M_{n1}|{\hat{\chi }}^{0},X)=o_{p}(1)$. Then,

$$\begin{aligned} M_{n}&=M_{n1}+E[M_{n}|{\hat{\chi }}^{0},X]+o_{p}(1)\nonumber \\&=M_{n1}+\frac{1}{\sqrt{n}}\sum _{k=1}^q\sum _{i=1}^nf_{\epsilon }(c_{k})(r_{ik}-c_{k})N_{i}+o_{p}(1). \end{aligned}$$

(A.16)

With the help of Lemma 3 and Condition (C9), we have

$$\begin{aligned} r_{ik}&=g(X^{T}_{i}{\hat{\beta }}^{0})+c_{k}-g'(X^{T}_{i}{\hat{\beta }}^{0})E(X_{i}|X^{T}_{i}{\hat{\beta }}^{0})({\hat{\beta }}^{0}-\beta _{0})+\frac{h^{2}}{2}g''(X^{T}_{i}{\hat{\beta }}^{0})\mu _{2}\\&\quad -\frac{1}{q}\sum _{k=1}^q\frac{1}{nhf_{U_{0}}(X^{T}_{i}{\hat{\beta }}^{0})f_{\epsilon }(c_{k})}\sum _{j=1}^n \frac{V_{j}}{\pi (Y_{j})}\eta _{jk}K_{j}(X^{T}_{i}{\hat{\beta }}^{0})\\&\quad +O(h^{2}\delta _{\beta }+\delta ^{2}_{\beta }+h^{3})-g(X^{T}_{i}\beta _{0})\\&\quad +[g'(X_{i}^{T}{\hat{\beta }}^{0})-O(\delta _{n}/h)+O(\delta _{\beta }+h^{2})](X_{i}^{T}\beta _{0}-X_{i}^{T}{\hat{\beta }}^{0})\\&=c_{k}-g'(X^{T}_{i}{\hat{\beta }}^{0})E(X_{i}|X^{T}_{i}{\hat{\beta }}^{0})({\hat{\beta }}^{0}-\beta _{0})\\&\quad -\frac{1}{q}\sum _{k=1}^q\frac{1}{nhf_{U_{0}}(X^{T}_{i}{\hat{\beta }}^{0})f_{\epsilon }(c_{k})}\sum _{j=1}^n \frac{V_{j}}{\pi (Y_{j})}\eta _{jk}K_{j}(X^{T}_{i}{\hat{\beta }}^{0})+o_{p}(n^{-1/2}). \end{aligned}$$

Thus, we have

$$\begin{aligned}&\frac{1}{\sqrt{n}}\sum _{k=1}^q\sum _{i=1}^nf_{\epsilon }(c_{k})(r_{ik}-c_{k})N_{i}\nonumber \\&=\frac{1}{\sqrt{n}}\sum _{k=1}^q\sum _{i=1}^nf_{\epsilon }(c_{k})[-g'(X^{T}_{i}{\hat{\beta }}^{0})E(X_{i}|X^{T}_{i}{\hat{\beta }}^{0})({\hat{\beta }}^{0}-\beta _{0})\nonumber \\&\quad -\frac{1}{nhf_{U_{0}}(X^{T}_{i}{\hat{\beta }}^{0})f_{\epsilon }(c_{k})}\sum _{j=1}^n \frac{V_{j}}{\pi (Y_{j})}\eta _{jk}K_{j}(X^{T}_{i}{\hat{\beta }}^{0})]N_{i}\nonumber \\&=-\frac{1}{\sqrt{n}}\sum _{k=1}^q\sum _{i=1}^nf_{\epsilon }(c_{k})N_{i}g'(X^{T}_{i}{\hat{\beta }}^{0})E(X_{i}|X^{T}_{i}{\hat{\beta }}^{0})({\hat{\beta }}^{0}-\beta _{0})\nonumber \\&\quad -\frac{1}{\sqrt{n}}\sum _{k=1}^q\sum _{i=1}^n\sum _{j=1}^n \frac{N_{i}}{nhf_{U_{0}}(X^{T}_{i}{\hat{\beta }}^{0})} \frac{V_{j}}{\pi (Y_{j})}\eta _{jk}K_{j}(X^{T}_{i}{\hat{\beta }}^{0})+o_{p}(1)\nonumber \\&=-I_{7}-I_{8}+o_{p}(1). \end{aligned}$$

(A.17)

For $I_{7}$, we have

$$\begin{aligned}&I_{7}=\sqrt{n}C_{0}({\hat{\beta }}^{0}-\beta _{0})+o_{p}(1), \end{aligned}$$

(A.18)

where $C_{0}=cE[g'(X^{T}\beta _{0})^2E(X|X^{T}\beta _{0})E(X|X^{T}\beta _{0})^{T}]$ and c defined in Lemma 3.

Considering $I_{8}$, we have

$$\begin{aligned} I_{8}&=\frac{1}{\sqrt{n}nh}\sum _{k=1}^q\sum _{j=1}^n\frac{V_{j}}{\pi (Y_{j})}\eta _{jk} \sum _{i=1}^n\frac{N_{i}}{f_{U_{0}}(X^{T}_{i}{\hat{\beta }}^{0})} K_{j}(X^{T}_{i}{\hat{\beta }}^{0})\nonumber \\&=\frac{1}{\sqrt{n}}\sum _{k=1}^q\sum _{j=1}^n\frac{V_{j}}{\pi (Y_{j})}\eta _{jk}{\hat{g}}'(X_{j}^{T}{\hat{\beta }}^{0})E[X_{j}|X^{T}_{j}\beta _{0}]+o_{p}(1)\nonumber \\&=C_{n}+o_{p}(1), \end{aligned}$$

(A.19)

where $C_{n}=\frac{1}{\sqrt{n}}\sum _{k=1}^q\sum _{j=1}^n\frac{V_{j}}{\pi (Y_{j})}\eta _{jk}{\hat{g}}'(X_{j}^{T}{\hat{\beta }}^{0},{\hat{\beta }}^{0})E[X_{j}|X^{T}_{j}\beta _{0}]$.

Combining (A.15)–(A.19), we obtain

$$\begin{aligned} Q_{n}(v_{1},...,v_{q},\zeta ,\pi (Y_{i}))&=[M_{n1}-C_{n}-\sqrt{n}C_{0}({\hat{\beta }}^{0}-\beta _{0})]\zeta +\frac{1}{2}\zeta ^{T}\Gamma \zeta \\&\quad +\sum _{k=1}^qz_{nk}v_{k}+\frac{1}{2}\sum _{k=1}^qf_{\epsilon }(c_{k})v_{k}^{2}+o_{p}(1). \end{aligned}$$

By Lemma 2, we have

$$\begin{aligned} {\hat{\zeta }}_{WCQR}=-\Gamma ^{-1}[M_{n1}-C_{n}]+\sqrt{n}\Gamma ^{-1}C_{0}({\hat{\beta }}^{0}-\beta _{0})+o_{p}(1). \end{aligned}$$

Recalling the definition of ${\hat{\zeta }}_{WCQR}$, we have

$$\begin{aligned}&({\hat{\beta }}_{WCQR}^{1}-\beta _{0})=-\frac{1}{\sqrt{n}}\Gamma ^{-1}[M_{n1}-C_{n}]+\Gamma ^{-1}C_{0}({\hat{\beta }}^{0}-\beta _{0})+o_{p}\left( 1/\sqrt{n}\right) . \end{aligned}$$

(A.20)

Denote ${\tilde{\Gamma }}=\Gamma ^{-\frac{1}{2}} C_{0}\Gamma ^{-\frac{1}{2}}$ and ${\hat{\beta }}_{WCQR}^{k}$ to be the estimator of $\beta _{0}$ gained in the kth iteration. For k, replace ${\hat{\beta }}_{WCQR}^{1}$ and ${\hat{\beta }}^{0}$ by ${\hat{\beta }}_{WCQR}^{k+1}$ and ${\hat{\beta }}_{WCQR}^{k}$, respectively, Equation (A.20) still holds. Letting $\vartheta ^{k}=\Gamma ^{\frac{1}{2}}({\hat{\beta }}_{WCQR}^{k}-\beta _{0})$, we have

$$\begin{aligned}&\vartheta ^{k+1}=-\frac{1}{\sqrt{n}}\Gamma ^{-\frac{1}{2}}[M_{n1}-C_{n}]+{\tilde{\Gamma }}\vartheta ^{k}+o_{p}\left( 1/\sqrt{n}\right) . \end{aligned}$$

Similar to Xia and Härdle (2006), we can get the convergence of our algorithm. For sufficiently large k, we have

$$\begin{aligned}&\Gamma ^{\frac{1}{2}}({\hat{\beta }}_{WCQR}-\beta _{0})=-\frac{1}{\sqrt{n}}\Gamma ^{-\frac{1}{2}}[M_{n1}-C_{n}]+{\tilde{\Gamma }}\Gamma ^{\frac{1}{2}}({\hat{\beta }}_{WCQR}-\beta _{0})+o_{p}\left( 1/\sqrt{n}\right) . \end{aligned}$$

Then,

$$\begin{aligned}&(\Gamma -\Gamma ^{\frac{1}{2}}{\tilde{\Gamma }}\Gamma ^{\frac{1}{2}})({\hat{\beta }}_{WCQR}-\beta _{0})=-\frac{1}{\sqrt{n}}[M_{n1}-C_{n}]+o_{p}(1/\sqrt{n}). \end{aligned}$$

(A.21)

Recalling the definition of ${\tilde{\Gamma }}$, (A.21) can be rewritten as

$$\begin{aligned}&\sqrt{n}D({\hat{\beta }}_{WCQR}-\beta _{0})=-[M_{n1}-C_{n}]+o_{p}(1), \end{aligned}$$

(A.22)

where $D=\Gamma -C_{0}$.

By Central limit theorem, we know

$$\begin{aligned}&-[M_{n1}-C_{n}]\xrightarrow {d}N(\mathbf 0 , D_{1}), \end{aligned}$$

(A.23)

where $D_{1}$ is given in Theorem 3.1.

Combining Equations (A.22) and (A.23), we obtain the asymptotic normality of ${\hat{\beta }}_{WCQR}$. $\square $

Proof of Theorem 3.2

Denote ${\hat{\beta }}_{NWCQR}^{1}$ and ${\hat{c}}_{NWCQRk}^{1}$, $(k=1,...,q)$ to be the calculation results of the 1th iteration when using the estimated selection probability. ${\hat{\beta }}_{NWCQR}^{1}$ and ${\hat{c}}_{NWCQRk}^{1}$, $(k=1,...,q)$ are obtained by minimizing the following objective function:

$$\begin{aligned}&\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{{\hat{\pi }}(Y_{i})}\rho _{\tau _{k}}{[}Y_{i}-c_{k}-{\hat{g}}_{NWCQR}(X^{T}_{i}{\hat{\beta }}^{0},{\hat{\beta }}^{0})\nonumber \\&\qquad -{\hat{g}}'_{NWCQR}(X_{i}^{T}{\hat{\beta }}^{0},{\hat{\beta }}^{0})(X_{i}^{T}\beta -X_{i}^{T}{\hat{\beta }}^{0})]. \end{aligned}$$

(A.24)

Let ${\hat{\zeta }}_{NWCQR}=\sqrt{n}({\hat{\beta }}_{NWCQR}^{1}-\beta _{0})$, ${\hat{v}}_{NWCQRk}=\sqrt{n}({\hat{c}}_{NWCQRk}^{1}-c_{k})$, $N^{*}_{i}={\hat{g}}'_{NWCQR}(X_{i}^{T}{\hat{\beta }}^{0},{\hat{\beta }}^{0})X_{i}$ and $r^{*}_{ik}=c_{k}+{\hat{g}}_{NWCQR}(X^{T}_{i}{\hat{\beta }}^{0},{\hat{\beta }}^{0})-g(X^{T}_{i}\beta _{0})+{\hat{g}}'_{NWCQR}(X_{i}^{T}{\hat{\beta }}^{0},{\hat{\beta }}^{0})(X_{i}^{T}\beta _{0}-X_{i}^{T}{\hat{\beta }}^{0})$ , $(k=1,...,q)$. Then, ${\hat{\zeta }}_{NWCQR}$, ${\hat{v}}_{NWCQRk}$, $(k=1,...,q)$ minimizes the following

$$\begin{aligned}&Q_{n}(v_{1},...,v_{q},\zeta ,{\hat{\pi }}(Y_{i}))\nonumber \\&\quad =\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{{\hat{\pi }}(Y_{i})}\rho _{\tau _{k}}(\epsilon _{i}-r^{*}_{ik}-\frac{1}{\sqrt{n}}(N^{*T}_{i}\zeta +v_{k}))\nonumber \\&\quad \quad -\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{{\hat{\pi }}(Y_{i})}\rho _{\tau _{k}}(\epsilon _{i}-r^{*}_{ik})\nonumber \\&\quad =\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{{\hat{\pi }}(Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}]N^{*T}_{i}\zeta \nonumber \\&\quad \quad +\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{{\hat{\pi }}(Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}]v_{k} \nonumber \\&\quad \quad +\sum _{k=1}^q \sum _{i=1}^n\frac{V_{i}}{{\hat{\pi }}(Y_{i})}\int _{0}^{\frac{1}{\sqrt{n}}(N^{*T}_{i}\zeta +v_{k})}[I(\epsilon _{i}\le r^{*}_{ik}+t)-I(\epsilon _{i}\le r^{*}_{ik})]dt\nonumber \\&\quad ={\hat{M}}^{*T}_{n}\zeta +\frac{1}{2}\zeta ^{T}\Gamma \zeta +\sum _{k=1}^q{\hat{z}}^{*}_{nk}v_{k}+\frac{1}{2}\sum _{k=1}^qf_{\epsilon }(c_{k})v_{k}^{2}+o_{p}(1), \end{aligned}$$

(A.25)

where

$$\begin{aligned} {\hat{M}}^{*}_{n}&=\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{{\hat{\pi }}(Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}]N^{*}_{i},\\ {\hat{z}}^{*}_{nk}&=\frac{1}{\sqrt{n}}\sum _{i=1}^n \frac{V_{i}}{{\hat{\pi }}(Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}] \end{aligned}$$

and $\Gamma $ is given in the proof of Theorem 3.1. Note that

$$\begin{aligned} {\hat{M}}^{*}_{n}&=M^{*}_{n}-\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}[{\hat{\pi }}(Y_{i})-\pi (Y_{i})]}{\pi ^{2}(Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}]N^{*}_{i}+o_{p}(1) \nonumber \\&=M^{*}_{n}-\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{[V_{i}-\pi (Y_{i})][{\hat{\pi }}(Y_{i})-\pi (Y_{i})]}{\pi ^{2}(Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}]N^{*}_{i} \nonumber \\&\quad -\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{[{\hat{\pi }}(Y_{i})-\pi (Y_{i})]}{\pi (Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}]N^{*}_{i}+o_{p}(1) \nonumber \\&=M^{*}_{n}-I_{9}-I_{10}+o_{p}(1), \end{aligned}$$

(A.26)

where

$$\begin{aligned} M^{*}_{n}&=\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}}{\pi (Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}]N^{*}_{i},\\ I_{9}&=\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{[V_{i}-\pi (Y_{i})][{\hat{\pi }}(Y_{i})-\pi (Y_{i})]}{\pi ^{2}(Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}]N^{*}_{i}, \end{aligned}$$

and

$$\begin{aligned}&I_{10}=\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{[{\hat{\pi }}(Y_{i})-\pi (Y_{i})]}{\pi (Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}]N^{*}_{i}. \end{aligned}$$

Using Lemma 4 and Conditions (C8)–(C9), and following the proof of Theorem 3.1, we have

$$\begin{aligned}&M^{*}_{n}=M_{n1}-C_{n}+C_{n1}-\sqrt{n}C_{0}({\hat{\beta }}^{0}-\beta _{0})+o_{p}(1), \end{aligned}$$

(A.27)

where

$$\begin{aligned}&C_{n1}=\frac{1}{\sqrt{n}}\sum _{k=1}^q\sum _{j=1}^n\frac{V_{j}-\pi (Y_{j})}{\pi (Y_{j})}E\{\eta _{jk}g^{'}(X^{T}_{j}\beta _{0})E[X_{j}|X^{T}_{j}\beta _{0}]|Y_{j}\}. \end{aligned}$$

Considering $I_{9}$, we have

$$\begin{aligned} I_{9}&=\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{V_{i}-\pi (Y_{i})}{\pi ^{2}(Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}]N^{*}_{i}\nonumber \\&\qquad \left\{ \frac{1}{nf_{Z}(Z_{i})}\sum _{j=1}^n[V_{j}-\pi (Y_{j})]L_{h}(Y_{j}-Y_{i})+O(h^{2})+O_{p}\left( 1/\sqrt{nh^{-1}}\right) \right\} \nonumber \\&\quad =\frac{1}{\sqrt{n}n}\sum _{k=1}^q \sum _{i\ne j}^n \frac{[V_{i}-\pi (Y_{i})][V_{j}-\pi (Y_{j})]}{\pi ^{2}(Y_{i})f_{Y}(Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}]N^{*}_{i}L_{h}(Y_{j}-Y_{i})\nonumber \\&\qquad +\frac{1}{\sqrt{n}n}\sum _{k=1}^q \sum _{i=1}^n \frac{[V_{i}-\pi (Y_{i})]^2}{\pi ^{2}(Y_{i})f_{Y}(Y_{i})}\left[ I\left( \epsilon _{i}< r^{*}_{ik}\right) -\tau _{k}\right] \nonumber \\&\qquad N^{*}_{i}L_{h}(0)+O(h^{2})+O_{p}\left( 1/\sqrt{nh^{-1}}\right) \nonumber \\&\quad =O_{p}\left( 1/\sqrt{nh}\right) +O_{p}\left( 1/\sqrt{nh^{2}}\right) +O(h^{2})+O_{p}\left( 1/\sqrt{nh^{-1}}\right) =o_{p}(1). \end{aligned}$$

(A.28)

Using Conditions (C8)–(C9), we have

$$\begin{aligned} I_{10}&=\frac{1}{\sqrt{n}}\sum _{k=1}^q \sum _{i=1}^n \frac{1}{\pi (Y_{i})}[I(\epsilon _{i}< r^{*}_{ik})-\tau _{k}]N^{*}_{i}\nonumber \\&\qquad \left\{ \frac{1}{nf_{Y}(Y_{i})}\sum _{j=1}^n[V_{j}-\pi (Y_{j})]L_{h}(Y_{j}-Y_{i})+O(h^{2})+O_{p}\left( 1/\sqrt{nh^{-1}}\right) \right\} \nonumber \\&=\frac{1}{\sqrt{n}}\sum _{k=1}^q\sum _{j=1}^n\frac{V_{j}-\pi (Y_{j})}{\pi (Y_{j})}E\left\{ [I(\epsilon _{j}< r^{*}_{jk})-\tau _{k}]N^{*}_{j}|Y_{j}\right\} +O\left( \sqrt{n}h^{2}\right) +o_{p}(1)\nonumber \\&=\frac{1}{\sqrt{n}}\sum _{k=1}^q\sum _{j=1}^n\frac{V_{j}-\pi (Y_{j})}{\pi (Y_{j})}E\{\eta _{jk}g^{'}(X^{T}_{j}\beta _{0})X_{j}|Y_{j}\}+O\left( \sqrt{n}h^{2}\right) +o_{p}(1)\nonumber \\&=M_{n2}+o_{p}(1), \end{aligned}$$

(A.29)

where

$$\begin{aligned}&M_{n2}=\frac{1}{\sqrt{n}}\sum _{k=1}^q\sum _{j=1}^n\frac{V_{j}-\pi (Y_{j})}{\pi (Y_{j})}E\{\eta _{jk}g^{'}(X^{T}_{j}\beta _{0})X_{j}|Y_{j}\}. \end{aligned}$$

Combining (A.26)–(A.29), we have

$$\begin{aligned}&{\hat{M}}^{*}_{n}=M_{n1}-C_{n}-M_{n2}+C_{n1}-\sqrt{n}C_{0}({\hat{\beta }}^{0}-\beta _{0})+o_{p}(1). \end{aligned}$$

(A.30)

By Central limit theorem, we have

$$\begin{aligned}&-[M_{n1}-C_{n}-M_{n2}+C_{n1}]\rightarrow _{d}N(\mathbf 0 ,D_{2}), \end{aligned}$$

(A.31)

where $D_{2}$ is given in Theorem 3.2. Following the proof of Theorem 3.1, we obtain the asymptotic normality of ${\hat{\beta }}_{NWCQR}$. $\square $

Proof of Theorem 3.3

By Theorem 3.1, we can see that ${\hat{\beta }}_{WCQR}$ is a $\sqrt{n}$-consistent estimator of $\beta _{0}$. Following the proof of Lemma 3, the asymptotic normality of ${\hat{g}}(u,{\hat{\beta }}_{WCQR})$ is obtained. $\square $

Proof of Theorem 3.4

By Theorem 3.2, we can see that ${\hat{\beta }}_{NWCQR}$ is a $\sqrt{n}$-consistent estimator of $\beta _{0}$. Suppose that Conditions (C1)–(C7) and (C10) hold, by Lemma 4, we have

$$\begin{aligned}&\sqrt{nh}\left( \begin{array}{c} {\hat{a}}_{1}(u,{\hat{\beta }}_{NWCQR})-g(u)-c_{1}-\frac{h^{2}}{2}g''(u)\mu _{2}+O(h^{2})\\ ... \\ {\hat{a}}_{q}(u,{\hat{\beta }}_{NWCQR})-g(u)-c_{q}-\frac{h^{2}}{2}g''(u)\mu _{2}+O(h^{2})\\ \end{array} \right) \\&\quad =-\frac{C^{-1}}{f_{U_{0}}(u)}[W_{n1*}-W_{n2*}]+o_{p}(1), \end{aligned}$$

where C is defined in Lemma 3,

$$\begin{aligned}&W_{n1*}=\frac{1}{\sqrt{nh}}\sum _{k=1}^q \sum _{i=1}^n\frac{V_{i}}{\pi (Y_{i})}\eta _{ik}K\left( \frac{X^{T}_{i}{\hat{\beta }}_{NWCQR}-u}{h}\right) e_{k} \end{aligned}$$

and

$$\begin{aligned}&W_{n2*}=\frac{1}{\sqrt{nh}}\sum _{k=1}^q\sum _{j=1}^n\frac{V_{j}-\pi (Y_{j})}{\pi (Y_{j})}E[\eta _{jk}K\left( \frac{X^{T}_{j}{\hat{\beta }}_{NWCQR}-u}{h}\right) e_{k} |Y_{j}]. \end{aligned}$$

By calculating the expectation and variance, we obtain the asymptotic normality of ${\hat{g}}(u,{\hat{\beta }}_{NWCQR})$. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, H., Yang, H. & Peng, C. Weighted composite quantile regression for single index model with missing covariates at random. Comput Stat 34, 1711–1740 (2019). https://doi.org/10.1007/s00180-019-00886-y

Download citation

Received: 30 September 2016
Accepted: 15 March 2019
Published: 27 March 2019
Issue Date: December 2019
DOI: https://doi.org/10.1007/s00180-019-00886-y

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Weighted composite quantile regression for single index model with missing covariates at random

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Weighted local linear CQR for varying-coefficient models with missing covariates

Bayesian quantile regression for partially linear single-index model with longitudinal data

Single-index composite quantile regression for ultra-high-dimensional data

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix

Lemma 1

Proof

Lemma 2

Lemma 3

Proof

Lemma 4

Proof

Proof of Theorem 3.1

Proof of Theorem 3.2

Proof of Theorem 3.3

Proof of Theorem 3.4

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Weighted composite quantile regression for single index model with missing covariates at random

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Weighted local linear CQR for varying-coefficient models with missing covariates

Bayesian quantile regression for partially linear single-index model with longitudinal data

Single-index composite quantile regression for ultra-high-dimensional data

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix

Appendix

Lemma 1

Proof

Lemma 2

Lemma 3

Proof

Lemma 4

Proof

Proof of Theorem 3.1

Proof of Theorem 3.2

Proof of Theorem 3.3

Proof of Theorem 3.4

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation