Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Preprint
Article

The Consistency of the Distribution Function Conditional Estimate and Application on the Consistency and Asymptotic Normality of the Conditional Hazard Function Estimate for High Dimensional Quasi-Associated Data

Altmetrics

Downloads

154

Views

57

Comments

0

Submitted:

02 August 2023

Posted:

03 August 2023

You are already at the latest version

Alerts
Abstract
The objective of this study is to examine a nonparametric estimate , using the kernel approach, of the conditional distribution function of a scalar response variable that is given a random variable whose values take place in a separable real Hilbert space. The observations will be dependent on one another in a quasi-associated fashion. The pointwise practically perfect consistencies with rates of this estimator are established by us under some broad conditions. The study’s major objective is to investigate the convergence rate of the proposed estimator and its application in the convergence rate and asymptotic normality of the hazard function. The asymptotic normality of the developed estimator is established precisely. Simulation studies were conducted to investigate the behavior of the asymptotic property in the context of finite sample data.
Keywords: 
Subject: Computer Science and Mathematics  -   Mathematics

1. Introduction

Mathematics and statistical analysis techniques have been shown to be of considerable relevance in a variety of scientific sectors in recent years, including engineering, the economics, clinical medicine, and healthcare. In particular, the application of methods of mathematical and statistical analysis to engineering, the economics, healthcare, and clinical medicine, as well as demonstrating how these approaches may assist in such vital areas as comprehension, prediction, correlation, diagnosis, therapy, and data processing.
Functional data is the subject of this research. [3] excerpt: Statistics’ functional data analysis (FDA) analyzes infinite-dimensional variables including curves, sets, and pictures. The "Big Data" revolution has spurred its rapid expansion over the past 20 years.
This may be demonstrated by researching the topic’s past (for an example, see [1]). In [24], the topics of density and mode estimation for normed vector space data are discussed. In addition to that, he discussed the problem of excessive dimensionality in functional data and offered potential remedies. Nonparametric models were investigated for use in regression estimation in [22].
The treatment of functional data today typically involves contemporary theory. For example, the reference [18] presented the consistency rates of a variety of conditional distribution functionals, such as the regression function, conditional cumulative distribution, and conditional density, uniformly over a subset of the explanatory variable. These functionals include the conditional cumulative distribution and conditional density. The conditional cumulative distribution as well as the conditional density are both included in these functionals.
Additional examples include the conditional cumulative distribution and the conditional density. Uniformly in bandwidth (UIB) consistency was extended to the ergodic scenario, and the rates of consistency for different functional nonparametric models were investigated in [26]. The regression function was a part of these models, a conditional hazard function, conditional distribution, and conditional density.
In the field of statistical mathematics, in recent years, there has been a surge of curiosity on the statistical analysis of functional data. These numbers are used in econometrics, medicine, environmental science, and many other fields. In the statistical functional, [22] made the first attempt to estimate the conditional density function and its derivatives . In addition, they were the ones who were the first to do so in the scientific community.
These authors reached an extent of convergence in the case i.i.d. that was really near to being finished. Since this work was published, much more research has been done on estimating the conditional density and its derivatives, especially for computing the conditional mode. Since the publication of this work, the conditional mode has been the subject of a significant amount of research.
In point of fact, [19] demonstrated that a kernel estimator of the conditional mode will almost certainly converge to the true value. They did this by taking into account data that included a-mixing. The conditional density is maximized by the random variable that corresponds to this mode, which is defined as.
The point that nullifies the kernel density estimator derivative was used by [16,17] to estimate the conditional mode. This was done in order to determine the conditional mode. The outcomes were comparable using this approach.
The latter put more of an emphasis on the estimator’s asymptotic normality, which was provided in both the iid and mixed circumstances respectively. Both scenarios included mixing. [27] was able to identify with what level of accuracy the terms that dominate the quadratic error that is produced by the kernel density estimator.
We suggest that the reader check out [28] for further information on the topic of the smoothing parameter that should be used in the process of estimating the conditional density in relation to the functional explanatory variable.
The concept of a quasi-association variable refers to a variable that exhibits some degree of association with another variable, Examples of research that processed data under both positive and negative dependent random variables are [34], [32], and [33]. [14] were the first people to offer the concept of quasi-association for the purpose of conducting an analysis of real-valued stochastic occurrences. This is a very striking illustration of the idea of weak reliance. It was used by [7] to real valued random fields, and it provides a unified technique for the study of families of positively dependent as well as negatively dependent random variable families.
According to our knowledge, the nonparametric estimation of quasi-associated random variables is addressed in a vanishingly small number of published papers. The study conducted by [13] focuses on a limit theorem for quasi associated Hilbertian random variables. Meanwhile, the research conducted by [5] explores asymptotic results for an M-estimator of the regression function for quasi-associated processes; and [36], who investigated both quasi-associated processes and asymptotic results. [9] investigated the asymptotic normality of this final estimator as part of their study, which focused on the single-index structure of the conditional hazard function.
To solve relative regression, [31] explored the nonparametric estimate for linked random variables. it was found to be significant by [10] and [10]. Both of these results were found to be significant. Both Daoudi and Mechab were responsible for conducting these investigations in their own separate ways.
It is important to keep in mind that the phenomena that our results are connected to the model’s functional space in some manner, just as they are related to every previous asymptotic statistics functional nonparametric finding.
In this research, we examine the conditional distribution function within the context of the quasi-associated situation, focusing on its asymptotic bias and dispersion. This study examined pointwise consistency for nonparametric conditional density function estimates.
This objective was successfully accomplished through the conduct of research on the rate of the almost-complete convergence (a.co.) for nonparametric estimates of the conditional distribution function.
The following is the structure of our paper: In the following section, we will describe our model. In the part 3, we present the hypotheses and remarks, along with a few notations, followed by a discussion of supplementary findings and supporting in paragraph 4. The Main Results are found in Section 5. The part 6 covers the application on consistency rates and the user is referring to the concepts of consistency and asymptotic normality in the context of estimating a conditional hazard function. he establishment of confidence bands for estimations in part 7. In Section 8, we will analyze and assess the behavior of our asymptotic normality results, on finite sample data. In conclusion, Auxiliary results and proofs are presented in Section 9.

2. Model and Estimator

To commence, we provide a precise delineation of quasi-association pertaining to random variables that possess values within a separable Hilbert space.
Consider a separable Hilbert space ( H , < . , . > ) furnished with an orthonormal basis. The sequence of elements is represented by e k , k 1 . Let ( R n ) n N be a sequence of real random variables with values in H . The statement is made that this sequence exhibits quasi-association in relation to the basis. The user defines e k as a variable. The term "quasi-associated" is used to describe the sequence e k if, for any positive integer d, the d-dimensional sequence { ( < R i , e j 1 > , . . . , < R i , e j d > ) , i N } is also quasi-associated.
In this analysis, we will examine a group of n quasi-associated random variables, which we will represent as W i = ( R i , S i ) 1 i n . The random variables mentioned exhibit the same distribution as the random variable W = ( R , S ) , which represents values in a separable real Hilbert space denoted as H × R . This Hilbert space is equipped with an inner product denoted as < . , . > , which generates the norm. The user’s text does not contain any meaningful information. The semi-metric d is considered, as given by. r , r H / d ( x , x ) = r r . A point r in the H space, its fixed neighborhood N r , and a compact subset of R all have the notation S For each r N r , there is a S such that R = r . Using a sample of n dependent observations from random variables with the same distribution as W : = ( R , S ) , the conditional distribution function F r ( s ) is estimated.
We present the F ^ r estimator of F r , a kernel type estimator, defined as:
F ^ r ( s ) = i = 1 n K ( h H 1 d ( r , R i ) ) H ( h H 1 ( s S i ) ) K ( h H 1 d ( r , R i ) ) s R
As n increases to infinity, the sequence of positive real integers h K = h K , n (resp. h H = h H , n ) converges to zero.
Let K denote the kernel and H represent a particular distribution function.

3. Hypotheses and Notations

When there is no chance of misunderstanding, we shall designate any strictly positive generic constants along the actual paper by the notation l or/and l . This will only happen when there is no risk of mistake. w in the process signifies a fixed point in H , and N w stands for a fixed neighborhood of w. Taking into account the fact that the random couple { ( W i , T i ) , i N } is a process that is stationary.
Let λ k denote the covariance coefficient, which may be found by using the equation:
λ k = s u p s k | i j | s λ i , j .
Where
λ i , j = k = 1 l = 1 | C o v ( W i k , W j l ) | + k = 1 | C o v ( W i k , T j ) | + l = 1 | C o v ( W i , X j l ) | + | C o v ( T i , T j ) | .
The k t h component of W i , as represented by R i k , where R i k : = < W i , e k > is the definition of R k i . Let’s denote the ball as B ( w , h ) : = { r H / d ( w , w ) < h } . This represents the ball with a center of w and a radius h , when h > 0 .
For the purpose of establishing the virtually full convergence of the estimator of the F ^ r ( s ) , the following assumptions are required, and in order to have a functional explanation, we need to incorporate he following assumptions:
(P1)
P ( R B ( r , h ) ) = ϕ r ( h ) > 0 and the function ϕ r ( h ) is a differentiable at 0.
(P2)
The conditional distribution function F r ( s ) satisfies the Holder condition, that is: ( r 1 , r 2 ) N r × N x , ( s 1 , s 2 ) S 2
| F r 1 ( s 1 ) F r 2 ( s 2 ) | l d z 1 ( r 1 , r 2 ) + | s 1 s 2 | z 2 , z 1 > 0 , z 2 > 0 .
S is a fixed compact subset of R .
(P3)
The kernel, denoted by H, is a differentiable function, and its inverse, denoted by H , is a positive, bounded, and Lipschitzian continuous function:
| t | z 2 H ( t ) d t < a n d H 2 ( t ) d t < .
(P4)
For K, we have the necessary conditions for it to be a bounded continuous Lipschitz function, which are:
l [ 0 , 1 ] ( . ) < K ( . ) < l [ 0 , 1 ] ( . )
[ 0 , 1 ] is referred to as an indicator function.
(P5)
A quasi-association exists between the sequence of random pairings ( R i , S i ) , where i N , and the covariance coefficient λ k , k N , as long as the conditions are met.
a > 0 , l > 0 , t h e n λ k l e a k
(P6)
The joint distribution functions are defined so that they hold for each and every pair ( i , j ) .
Ψ i , j ( h ) = P [ ( R i , R j ) B ( r , h ) × B ( r , h ) ) ]
satisfy :
sup i j Ψ i , j ( h ) = O ( ϕ r 2 ( h k ) ) > 0 .
(P7)
The sequences of positive values that make up the bandwidths h K and h H fulfill the following equations when j is equal to 0 and 1:
lim n log 5 ( n ) n h H j ϕ r ( h k ) = 0 .

4. Remarks Concerning the Hypotheses

The attribute of concentration of the explanatory variable is denoted by the assumption (P1) in the context of tiny balls. The condition (P2) is applied to our model in order to regulate the regularity of the functional space. They are essential for calculating the bias component of the convergence rates and are needed to be present. Both the (P3) and (P4) hypotheses center their attention on the cumulative function H as well as the kernels K H , and K , respectively. In particular, this assumption is utilized to get rid of the word "bias" in the conclusion of asymptotic normalcy. The fifth assumption, denoted as P5, is widely regarded as a classical assumption in functional estimation within spaces of both finite and infinite dimensions. The hypothesis (P6) is a notation that refers to a structural condition that is applied to the quasi-associated data. In order for us to demonstrate the asymptotic normality of our model when it is exposed to quasi-association, it is required for us to have the assumption (P7). Following is a definition of the asymptotic behavior of the joint distribution of the pair ( R i , R j ).

5. Main Results

Theorem 1. 
Under hypotheses ( P 1 ) ( P 7 ) , we have:
F ^ r ( s ) F r ( s ) = O ( h K b 1 + h K b 2 ) + O L o g ( n ) n ϕ R ( h K ) 1 2
The following decomposition, together with the lemmas that are listed below, provide the foundation for the proof of Theorem 1:
F ^ r ( s ) F r ( s ) = 1 F ^ D r F ^ N r ( s ) E F ^ N r ( s ) F r ( s ) E F ^ N r ( s ) + F r ( s ) F ^ D r F ^ D r E F ^ D r
We can write:
F ^ r ( s ) = F ^ N r ( s ) F ^ D r
Where:
F ^ N r ( s ) = 1 n E [ K 1 ( r ) ] i = 1 n K i ( r ) H i ( s )
and:
F ^ D ( r ) = 1 n E [ K 1 ( r ) ] i = 1 n K i ( r )
with :
K i ( r ) = K ( h K 1 d ( r , R i ) ) a n d H i ( s ) = H ( h H 1 ( s S i ) )
Lemma 1. 
Under hypotheses ( P 1 ) - ( P 4 ) and ( P 6 ) :
1 F ^ D r ( s ) F ^ N r ( s ) E F ^ N r ( s ) = O a . c o L o g ( n ) n ϕ R ( h K ) 1 2
Corollary 1. 
Under hypotheses ( P 1 ) - ( P 4 ) and ( P 6 ) , we have:
i = 1 P | F ^ D r ( s ) | < 1 / 2 <
Lemma 2. 
Under hypotheses ( P 1 ) - ( P 6 ) , we have:
1 F ^ D r ( s ) F r ( s ) E F ^ N r ( s ) = O ( h K b 1 + h K b 2 )
Lemma 3. 
Under hypotheses ( P 1 ) - ( P 3 ) and ( P 6 ) , we have:
F ^ D r ( s ) E F ^ D r ( s ) = O a . c o l o g ( n ) n ϕ r ( h K ) 1 2

6. Application on the Conditional Hazard Function Estimate

Using our most important result, The estimator of the conditional hazard function Z ^ r ( s ) exhibits convergence behavior , it was examined in our analysis. This estimator is defined as follows:
Z ^ r ( s ) = f ^ r ( s ) 1 F ^ r ( s ) s R ,
where f ^ r ( s ) is an estimate of the conditional density given by
f ^ r ( s ) = h H 1 i = 1 n K ( h K 1 d ( r , R i ) ) H ( h H 1 ( s S i ) ) i = 1 n K ( h K 1 d ( r , R i ) ) s R
This is the derivative of H expressed as H .

6.1. Almost Complete Convergence with the Rate

Theorem 2. 
Assuming (P1)-(P7) and Theorem 1. of [23], we obtain:
| Z ^ r ( s ) Z r ( s ) | = O h K b 1 + h K b 2 + O a . c o log n n h H ϕ r ( h K ) 1 2 .
The proof of Theorem 2 is the sum of this paper’s proof and theorem’s proof in [23]

6.2. Asymptotic Normality

Theorem 3. 
Assuming (P1)-(P7) and Theorem 1. of [10] and [11], we obtain: for any r A
n h H ϕ ( r , h K ) Z ^ r ( s ) Z r ( s ) D N ( 0 , σ h 2 ( r ) ) a s   n
where
A = { r H , f r ( s ) ( 1 F r ( s ) ) 0 }
σ h 2 ( r ) = C 2 Z r ( s ) C 1 2 ( 1 F ( s , r ) ) H 2 ( t ) d t ,
In
C j = K ( 1 ) 0 1 ( K j ) ( s ) β ( r , s ) d s , f o r j = 1 , 2 .
The symbol D denotes convergence in distribution.

7. Confidence Bands

One important aspect in statistical analysis is the establishment of confidence bands for estimations. These confidence bands provide a range of values within which we can be confident that the true value lies. By calculating and interpreting these confidence bands, we can gain a better understanding of the uncertainty associated with our estimations. The objective of this section is to utilize our asymptotic normality result (Theorem 3) to create confidence intervals for the actual value of Z r ( s ) for a specified curve with the format < X , X > = < x , x > . Estimation using nonparametric methods relies on the asymptotic variance, which is determined by a number of unknown functions. Regarding our situation, we have
σ Z 2 ( r ) = C 2 Z r ( s ) C 1 2 ( 1 F r ( s ) )
The variables Z r ( s ) , F r ( s ) , C 1 , and C 2 are not known in advance and must be approximated during the practical implementation. It is possible to derive confidence bands even when σ Z 2 ( r ) is functionally given. An estimate for the asymptotic standard deviation σ ^ Z 2 ( r ) may be produced using the estimators Z ^ ( r ) ( s ) , F ^ ( r ) ( s ) , C ^ 1 , and C ^ 2 for Z ( r ) ( s ) , F ( r ) ( s ) , C 1 , and C 2 , respectively.
σ ^ Z 2 ( r ) = C ^ 2 Z ^ r ( s ) C ^ 1 2 ( 1 F ^ r ( s ) ) .
The constants C 1 and C 2 are estimated empirically in the following manner:
C ^ 1 = 1 n ϕ ( r , h K ) i = 1 n K ( h K 1 d ( r , R i ) ) , C ^ 2 = 1 n ϕ ( r , h K ) i = 1 n K 2 ( h K 1 d ( r , R i ) )
where ϕ ( r , h K ) = 1 n i = 1 n I { | < r R i > | < h K } .
Now the asymptotic confidence band at asymptotic level 1 ζ for Z r ( s ) is given by
Z ^ r ( s ) u 1 ζ 2 σ ^ Z 2 ( r ) n ϕ ( r , h K ) 1 / 2 , Z ^ r ( s ) + u 1 ζ 2 σ ^ Z 2 ( r ) n ϕ ( r , h K ) 1 / 2
where u 1 ζ 2 denotes the 1 ζ 2 quantile of the standard normal distribution.

8. A simulation Study

In this part, we examine how our asymptotic normality conclusions behave across data from a limited sample.
Our primary purpose is to demonstrate how simple the conditional hazard function is to develop and to study the effect of dependency on this asymptotic characteristic.
We produce functional observations for this purpose by examining the following functional nonparametric model:
Z i = r ( W i ) + ϵ i for i = 1 , , n
Where ϵ i N ( 0 , . 5 ) . The linear process with quasi-associated variables is well known to satisfy the requirement (H7).
As a result, the functional regressor with quasi-associates shown below is constructed.
Z i ( t ) = j = i + 1 i + m Γ j ( t )
Where
Γ j ( t ) = s j t 2 + h j t + g j t [ 0 , 1 ]
( s j ) j N ( 0 , 1 2 ) (resp. ( h j ) j N ( 1 , 1 2 ) and ( g j ) j N ( 1 , 1 2 ) ).
The Figure 1 shows the Z i ’s curves discretized in the same 100-point grid in [0, 1] for three separate m=1 (independent case), and 10 values
Furthermore, the regression operator computes the scalar variable Z i :
r ( w ) = 5 0 1 exp W ( t ) d t .
Given W = w , we may derive the theoretical conditional from the error distribution ( ϵ i ). The distribution function of Z is supplied by e i ’s distribution shifted by r ( w ) . As a result, the theoretical conditional hazard function is simple to compute. To demonstrate this function’s asymptotic normality, we fix one curve, w = W 0 , and z = Z 0 , from the created data, then collect m independent n-samples of the same data and compute the quantity:
n h H ϕ ( r , h K ) σ ^ h K 1 ( r ) Z ^ r ( s ) Z r ( s )
where σ ^ h K is the previous section’s standard deviation estimation.
The collected sample is next tested for normality. While the cross-validation approach determines , We utilized a quadratic kernel K represented as: the bandwidth parameters h H and h K dependent on the number of nearest neighbors and we have used quadratic kernel K expressed by
K ( t ) = 3 2 ( 1 t 2 ) I [ 0 , 1 ] .
The Figure 2 depicts a Graph of the collected sample vs a typical normal distribution at varied m=1 and 10 values.
It is evident that our estimator is simple to use and behaves well in practice. Furthermore, the pace of asymptotic normality convergence is substantially related to data correlation. It diminishes in particular with respect to m values.

9. Auxiliary Results and Proofs

Corollary 2. 
19 (See, [13]). Let ( R n ) n N be a quasi-associated sequence of random variables with values in H . Let h B L ( H I ) L and M B L ( H J ) L , for some finite disjoint subsets I , J N . Then
C o v ( h ( R i , i I ) , M ( R j , j J ) ) L i p ( h ) L i p ( M ) i I j J k = 1 l = 1 C o v ( R i k , R j l )
where ( B L ( H u ; u > 0 ) is the set of bounded Lipschitz functions h : H u R and L is the set of bounded functions.
Corollary 3.(See, [25]). Let R 1 , . . . , R n the real random variables such that E ( R j ) = 0 and P ( R j M ) = 1 for all j = 1 , . . . , n and some M < , Let σ n 2 = V a r ( i = 1 n Δ i ) .
Assume, furthermore, that there exist K < and β > 0 such that, for all u-uplets ( s 1 , . . . , s u ) N u , ( t 1 , . . . , t v ) N v with 1 s 1 s u t 1 t v n .
The following inequality is fulfilled:
c o v ( R s 1 . . . R s u , R t 1 . . . R t v ) K 2 M u + v 2 v e β ( t 1 s u ) .
Then,
P ( j = 1 n R j > t ) exp { t 2 / 2 A n + B n 1 / 3 t 5 / 2 }
for some
A n σ n 2
and
B n = ( 16 n K 2 9 A n ( 1 e β ) 1 ) 2 ( K M ) 1 e β .
The proof of Lemma 1
We put:
Δ i = 1 n h H E [ K 1 ] χ ( R i , S i ) , 1 i n .
where :
ψ ( R i , S i ) = K ( h K 1 d ( r , R i ) ) H ( s S i ) E [ K 1 H 1 ] 1 i n ,
R i H , S i R : E ( Δ i ) = 0 and,
| F ^ N r ( s ) E F ^ N r ( s ) | = i = 1 n Δ i
Moreover, we can write :
| | ψ | | 2 C | | K | | | | H | |
and :
L i p ( ψ ) C h K 1 | | H | | L i p ( K ) + h H 1 | | K | | L i p ( H ) .
In order to use the lemma that was developed by [25], we need to do an evaluation of the variance term V a r ( i = 1 n Δ i ) as well as the covariance term C o v i = 1 u Δ s i , j = 1 v Δ t j . Both of these terms are denoted by the following for all ( s 1 , . . . , s u ) N u , ( t 1 , . . . , t v ) N v with 1 s 1 . . . s u t 1 . . . t v n .
We explore the following cases with respect to the covariance term: if t 1 = s u . Utilizing the Reality that
E [ | K 1 H 1 | ] = O a . c o ϕ r ( h K )
and
E [ | K 1 | ] = O a . c o ϕ r ( h K )
we have:
| C o v i = 1 u Δ s i , j = 1 v Δ t j | C n E [ K 1 ] u + v E ψ | R 1 , S 1 | u + v C | | K | | | | H | | n E [ K 1 ] u + v E | K 1 H 1 | ϕ r ( h K ) C n ϕ r ( h K ) u + v
If it t 1 > s u , Under ( P 5 ) , the quasi-association yields:
| C o v i = 1 u Δ s i , j = 1 v Δ t j | h K 1 L i p ( K ) + h H 1 L i p ( H ) n E [ K 1 ] 2 × C n E [ K 1 ] u + v 2 i = 1 u j = 1 v λ s i , t j ( h K 1 L i p ( K ) + h H 1 L i p ( H ) ) 2 C n E [ K 1 ] u + v v λ t 1 s u ( h K 1 L i p ( K ) + h H 1 L i p ( H ) ) 2 C ϕ r ( h K ) u + v v e α ( t 1 s u )
On the other hand, if we take into account ( P 6 ) , we have:
| C o v i = 1 u Δ s i , j = 1 v Δ t j | C | | K | | | | H | | n E [ K 1 ] u + v 2 E | Δ s u , Δ t 1 | + E | Δ s u | E | Δ t 1 | C | | K | | | | H | | n E [ K 1 ] u + v 2 C n E [ K 1 ] × h H ( s u p i j P ( ( R i , R j ) B ( r , h K ) × B θ ( r , h K ) + P ( R 1 B ( r , h K ) ) 2 ) ) C h H ϕ r ( h K ) u + v ( ϕ r ( h K ) ) 2
In addition, by taking a γ p o w e r of (10), and a ( 1 γ ) p o w e r of (11), we are able to derive an upper-bound of the tree terms as follows, for: 1 s 1 . . . s u t 1 . . . . t v n ,
| C o v i = 1 u Δ s i , j = 1 v Δ t j | h H ϕ r ( h K ) C n h H ϕ r ( h K ) u + v
Second, we entered the following information for the V a r ( i = 1 n Δ i ) variance term for all 1 i n :
| V a r i = 1 u Δ s i , j = 1 v Δ t j | = 1 n E [ K 1 ] 2 i = 1 n j = 1 n C o v ( K i H i , K j H j ) = 1 n E [ K 1 ] 2 V a r ( K 1 H 1 ) T 1 + 1 n E [ K 1 ] 2 i = 1 n j = 1 , i j n C o v ( K i H i , K j H j ) T 2
For the first term T 1 , we have :
V a r ( K 1 H 1 ) = E ( K 1 2 H 1 2 ) ( E ( K 1 H 1 ) ) 2
Then,
E [ K 1 2 H 1 2 ] = E [ K 1 2 E [ H 1 2 / X 1 ] ]
As a result, considering (P2) and (P3) and integrating over the real component y gives us the following:
E H 1 2 R 1 = O h H
As for all j 1 :
I E [ K 1 j ] = O ϕ x ( h K ) .
Then,
I E [ K 1 2 H 1 2 ] = O ( ϕ x ( h K ) ) .
It follows that:
1 n ( I E [ K 1 ] ) 2 2 V a r ( K 1 H 1 ) = O ( n ϕ x ( h K ) ) .
Regarding the covariance term in ( r e f 26 ) , the following decomposition will be utilized.
i = 1 n j = 1 , i j n C o v ( K i H i , K j H j ) = i = 1 n j = 1 , 0 < i j m n n C o v ( K i H i , K j H j ) I + i = 1 n j = 1 , i j > m n n C o v ( K i H i , K j H j ) I I
where ( m n ) is an infinite series of positive integers as n tends to infinity. The ( P 1 ) - ( P 3 ) and ( P 6 ) Presuppositions
we obtain, if i j :
I n m n max i j E K i H i K j H j + E K 1 H 1 2 C n m n ϕ r 2 h K + ϕ r h K 2 error ϕ r 2 h K .
H and K are both bounded and Lipschitz kernels, we obtain:
I I h K 1 L i p ( K ) + h H 1 L i p ( H ) 2 i = 1 u j = 1 i j > m n v λ i , j C h K 1 L i p ( K ) + h H 1 L i p ( H ) 2 i = 1 u j = 1 i j > m n v λ i , j C n h K 1 L i p ( K ) + h H 1 L i p ( H ) 2 λ m n C n h K 1 L i p ( K ) + h H 1 L i p ( H ) 2 e α m n .
Then, by (20) and (21), we get
j = 1 , i j n C o v ( K i H i , K j H j ) C n m n ( h H 2 ϕ r 2 ( h K ) ) + n ( h K 1 L i p ( K ) + h H 1 L i p ( H ) ) 2 e α m n
By choosing :
m n = log h K 1 Lip ( K ) + h H 1 Lip ( H ) 2 α ϕ r 2 h K
We get :
1 ϕ r ( h K ) j = 1 , i j n C o v ( K i H i , K j H j ) n + 0 .
Finally, we get this result by integrating the previous three results (13), (18), and (22).
V a r i = 1 n Δ i = O 1 n ϕ r ( h K )
Therefore, the criteria of the lemma are satisfied by the variables Δ i , where i = 1 , . . . , n .
K n = C n ϕ r ( h K ) , M n = C n ϕ r ( h K ) , a n d A n = V a r i = 1 n Δ i
Thus,
P F ^ N r ( s ) E F ^ N r ( s ) > η log n n ϕ r h K = P i = 1 n Δ i > η log n n ϕ r h K exp η 2 log n n ϕ r h K Var i = 1 n Δ i + log 5 / 6 n n ϕ r h K ( 7 / 6 ) exp η 2 log n C + ( log n ) 5 / 6 n ϕ r h K ( 7 / 6 ) C exp C η 2 log n
by ( P 7 ) . Last but not least, with an appropriate selection of η , Borel-Cantelli’s lemma makes it possible to conclude the proof of Lemma 1.
The proof of Corollary 1
We have:
| F D R | 1 2 | F D R 1 | > 1 2
Therefor,
P | F D R | 1 2 P | F D R 1 | > 1 2 P | F D X E F D X | > 1 2
For E ( F D R ) = 1 , we apply the result of Lemma 1 we show that :
P | F D R | 1 2 <
The proof of the Corollary 1 has finished.
The proof of Lemma 2
We have :
E F ^ N R ( S ) F R ( S ) = 1 n E K 1 i = 1 n E K i H i ( y ) F R ( S ) = 1 E K 1 E K 1 H 1 s S i h H F R ( S ) = 1 E K 1 E K 1 E H 1 h H 1 s S i / R F R ( S )
We derive the following by making use of the stationarity of the data, the conditioning by the variable that is doing the explaining, and the customary change in the variable t = s u h H :
E H 1 h H 1 s S i / R = R H s u h H f R ( u ) d u = R H ( 1 ) ( t ) F R s h H t d t
and we deduce ,
E H 1 h H 1 s S i / R F R ( s ) = R H ( 1 ) ( t ) F R s h H t d t F R ( s ) R H ( 1 ) ( t ) F R s h H t F R ( s ) d t
So, by (H2) we get
E H 1 h H 1 s S i / R F R ( s ) A R R H ( 1 ) h K b 1 + | t | b 2 h H b 2 d t
This inequality holds true everywhere in y and after substituting in (29) and simplifying the expression E K 1 we get that:
E F ^ N R ( s ) F R ( s ) A R h K b 1 R H ( 1 ) ( t ) d t + h H b 2 R | t | b 2 H ( 1 ) ( t ) d t
In conclusion, the evidence of Lemma 2 is provided by Hypothesis (P4) and Corollary 1.
Substituting for ψ ( . , . ) in Lemma 1’s proof of Lemma 3 yields a simple application of Lemma 3.
ψ ( R i ) = K ( h K 1 d ( r , R i ) ) E [ K 1 ] , R i H .
The proof of Theorem 1 has finished. □

Author Contributions

Author Contributions Conceptualization, Z.C.M. and D.H.; methodology, Z.C.M. and D.H.; software, D.H.; validation, Z.C.M. and D.H.; formal analysis, Z.C.M., F. A. and D.H.; investigation, Z.C.M., F. A. and D.H.; resources, Z.C.M., F. A. and D.H.; data curation, D.H.; writing—original draft preparation, D.H.; writing—review and editing, Z.C.M., F. A. and D.H.; visualization, D.H.; supervision, Z.C.M. and F. A.; project administration, Z.C.M.; funding acquisition, Z.C.M. and F.A.; All authors have read and agreed to the published version of the manuscript.

Funding

This research project was funded by (1) Princess Nourah bint Abdulrahman University Researchers Supporting Project Number (PNURSP2023R358), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia, and (2) The Deanship of Scientific Research at King Khalid University this work through Small group Research Project under grant number R.G.P. 1/366/44.

Informed Consent Statement

Not applicable

Data Availability Statement

Not applicable

Data Availability Statement

Not applicable

Acknowledgments

The authors express their gratitude to the Editors, the Associate Editor, and the anonymous reviewers for their insightful comments and suggestions, which significantly enhanced the overall quality of a previous iteration of this manuscript. The authors acknowledge the project funding: Princess Nourah bint Abdulrahman University Researchers Supporting Project Number (PNURSP2023R358), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia; The Deanship of Scientific Research at King Khalid University through the Research Groups Program under grant number R.G.P. 1/366/44.

Conflicts of Interest

No conflict of interest.

References

  1. Araujo, A.; Giné, E. The Central Limit Theorem for Real and Banach Valued Random Variables; Wiley Series in Probability and Mathematical Statistics; JohnWiley and Sons: New York, NY, USA; Chichester, UK; Brisbane, Australia, 1980; p. xiv+233. [Google Scholar]
  2. Aït Saidi, A.; Ferraty, F.; Kassa, R. Cross-validated estimation inthe single functional index model. Statistics 2008, 42, 475–494. [Google Scholar] [CrossRef]
  3. Aneiros, G.; Cao, R.; Fraiman, R.; Genest, C.; Vieu, P. Recent advances in functional data analysis and high-dimensional statistics. . Multivar. Anal 2019, 170, 3–9. [Google Scholar] [CrossRef]
  4. Attaoui, S.; Laksaci, A.; Ould-Saïd, E. A note on the conditional densityestimate in the single functional index model. Statist. Probab. 2011, 81, 45–53. [Google Scholar] [CrossRef]
  5. Attaoui, S.; Laksaci, A.; Ould-Saïd, E. Asymptotic Results for an Mestimatorof the Regression Function for Quasi-Associated Processes. FunctionalStatistics and Applications. Contributions to Statistics 2015. [Google Scholar] [CrossRef]
  6. Attaoui, S.; Ling, N. Asymptotic results of a nonparametric conditionalcumulative distribution estimator in the single functional indexmodeling for time series data with applications. Metrika 2016, 79, 485511. [Google Scholar] [CrossRef]
  7. Bulinski, A.; Suquet, C. Normal approximation for quasi-associatedrandom fields. Statist. Probab. Lett 2001, 54, 215–226. [Google Scholar] [CrossRef]
  8. Bulinski, A.; Shabanovich, E. Asymptotical behaviour of some functionalsof positively and negatively dependent random fields. (in Russian). Fundam. Appl. Math 1998, 4, 479–492. [Google Scholar]
  9. Daoudi, H.; Mechab, B.; Chikr Elmezouar, Z. Asymptotic normality ofa conditional hazard function estimate in the single index for quasi-associateddata, Communications in Statistics - Theory and Methods, 2020 49:3, 513–530.
  10. Daoudi, H.; Mechab, B. , Asymptotic Normality of the Kernel Estimate of Conditional Distribution Function for the quasi-associated data. Pakistan Journal of Statistics and Operation Research 2019, 15(4), 999–1015. [Google Scholar]
  11. Daoudi, H.; Mechab, B.; Benaissa, S.; Rabhi., A. Asymp-totic normality of the nonparametric conditional density function esti-mate with functional variables for the quasi-associated data. Interna-tional Journal of Statistics and Economics 2019, 20(3), 94–106. [Google Scholar]
  12. Doob, J.L. StochasticProcesses, John Wiley and Sons, New York, 1953.
  13. Douge, L. Théorèmes limites pour des variables quasi-associées hilbertiennes. Ann. I.S.U.P, 2010, 54, 51–60. [Google Scholar]
  14. Doukhan, P. Louhichi,S., A new weak dependence condition andapplications to moment inequalities. Stoch. Proc. Appl 1999, 84, 313–342. [Google Scholar] [CrossRef]
  15. Esary, J. D. Proschan,F., Walkup,D.,.Association of random variables with applications. Ann. Math. Statist 1967, 38, 1466–1476. [Google Scholar] [CrossRef]
  16. Ezzahrioui, M.; Ould-Saïd, E. Asymptotic normality of nonparametric estimators of the conditional mode for functional data. Technical report 2005, 249. [Google Scholar] [CrossRef]
  17. Ezzahrioui, M.; Ould-Saïd, E. On the asymptotic properties of a nonparametric estimator of the conditional mode for functional dependent data. Preprint 2008, 277. [Google Scholar]
  18. Ferraty, F.; Laksaci, A.; Tadj, A.; Vieu, P. Rate of uniform consistency for nonparametric estimates with functional variables. J. Statist. Plann. Inference 2010, 140, 335–352. [Google Scholar] [CrossRef]
  19. Ferraty, F.; Peuch, A.; Vieu, P. Modèle à indice fonctionnel simple. C. R. A. S.Mathématiques Paris 2003, 336, 1025–1028. [Google Scholar] [CrossRef]
  20. Ferraty, F.; Rabhi, A.; Vieu, P. Estimation nonparametric de lafonction de hasard avec variable explicative fonctionnelle. Rev. Roumaine Math. Pures Appl 2008, 53, 1–18. [Google Scholar]
  21. Ferraty, F.; Mas, A.; Vieu, P. Advances in nonparametric regression for functional variables, Aust. and New Zeal. J. of Statist 2007, 49, 1–20. [Google Scholar]
  22. Ferraty, F.; Vieu., P. Nonparametric functional data analysis: theoryand practice, Springer Series in Statistics: New York, USA, 2006.
  23. Bouaker, I.; Belguerna, A.; Daoudi, H. The consistency of the kernel estimation of the Function conditional density for associated Data in high-dimensional statistics, Journal of science and arts, 22:2, 247–256, 2022.
  24. Gasser, T.; Hall, P.; Presnell, B. Nonparametric estimation of the mode of a distribution of random curves. J. R. Stat. Soc. Ser. B Stat. Methodol. 1998, 60, 681–691. [Google Scholar] [CrossRef]
  25. Kallabis, R.S.; Neumann, M.H. An exponential inequality under weakdependence. Bernoulli 2006, 12, 333–350. [Google Scholar] [CrossRef]
  26. Kara-Zaitri, L.; Laksaci, A.; Rachdi, M.; Vieu, P. ,Uniform in bandwidth consistency for various kernel estimators involving functional data. J. Nonparametr. Stat 2017, 29, 85–107. [Google Scholar] [CrossRef]
  27. Laksaci, A. Convergence en moyenne quadratique de l’estimateur à noyau de la densité conditionnelle avec variable explicative fonctionnelle. Ann. I.S.U.P 2007, 51, 69–80. [Google Scholar]
  28. Laksaci, A.; Maref, F. Estimation non paramétrique de quantiles conditionnels pour des variables fonctionnelles spatialement dépendantes. C. R. Math. Acad. Sci. Paris 2009, 347, 1075–1080. [Google Scholar] [CrossRef]
  29. Laksaci, A.; Mechab, B. Estimation non-paramétrique de la fonctionde hasard avec variable explicative fonctionnelle : cas des données spatiales. Rev. Roumaine Math. Pures Appl 2010, 55, 35–51. [Google Scholar]
  30. Laksaci,A. ; Mechab, B. Conditional hazard estimate for functionalrandom fields. Journal of Statistical Theory and Practice 2014, 8, 192–200. [Google Scholar] [CrossRef]
  31. Laksaci, A.; Mechab, W. Nonparametric relative regression for associatedrandom variables. Metron 2016. [Google Scholar] [CrossRef]
  32. Matula, P. A note on the almost sure convergence of sums of negatively dependent random variables. Statist. Probab. Lett 1992, 15, 209–213. [Google Scholar] [CrossRef]
  33. Roussas, G. G. Positive and negative dependence with some statistical applications In : Asymptotics, Nonparametrics and Time Series (S. Ghosh,Ed). Marcell Dekker, Inc., New York, 1999; pp. 757–788.
  34. Newman, C.M. , Asymptotic independence and limit theorems for positivelyand negatively dependent random variables : In Inequalities in Statisticsand Probability, IMS Lect. Notes-Monographs Series 1984, 5, 127–140. [Google Scholar]
  35. Quintela-del-RÃ o, A. Hazard function given of functional variable: Nonparametricestimation under strong mixing conditions. J. Nonparametr. Stat 2008, 20, 413–430. [Google Scholar] [CrossRef]
  36. Tabti, H.; Ait Saïdi, A. Estimation and simulation of conditionalhazard function in the quasi-associated framework when the observationsare linked via a functional single-index structure, Commu. Stat. Theory and Methods 2017, 47, 816–838. [Google Scholar]
Figure 1. A sample of 200 curves.
Figure 1. A sample of 200 curves.
Preprints 81357 g001
Figure 2. The QQ-plot of the obtained sample.
Figure 2. The QQ-plot of the obtained sample.
Preprints 81357 g002
Table 1. Kolmogorov-Smirnov analysis.
Table 1. Kolmogorov-Smirnov analysis.
m 1 10
P-value 0.88 0.43
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated