A Kendall Correlation Coefficient Between Functional Data: Dalia Valencia Rosa E. Lillo Juan Romo
A Kendall Correlation Coefficient Between Functional Data: Dalia Valencia Rosa E. Lillo Juan Romo
A Kendall Correlation Coefficient Between Functional Data: Dalia Valencia Rosa E. Lillo Juan Romo
https://doi.org/10.1007/s11634-019-00360-z
REGULAR ARTICLE
Abstract
Measuring dependence is a very important tool to analyze pairs of functional data.
The coefficients currently available to quantify association between two sets of curves
show a non robust behavior under the presence of outliers. We propose a new robust
numerical measure of association for bivariate functional data. We extend in this paper
Kendall coefficient for finite dimensional observations to the functional setting. We
also study its statistical properties. An extensive simulation study shows the good
behavior of this new measure for different types of functional data. Moreover, we
apply it to establish association for real data, including microarrays time series in
genetics.
1 Introduction
Many processes currently used in different fields of science and research lead to
random observations that can be analyzed as curves. We can also find a large amount
of data for which it would be more appropriate to use some interpolation techniques
and consider them as functional data. This approach turns out to be essential when
data have been observed at different time intervals. Examples of functional data are
B Dalia Valencia
djvalega@gmail.com
Rosa E. Lillo
rosaelvira.lillo@uc3m.es
Juan Romo
juan.romo@uc3m.es
1 Department of Statistics, Universidad Carlos III de Madrid, Calle Madrid 126, Madrid, Spain
2 Department of Statistics, UC3M-Santander Big Data Institute, Universidad Carlos III de Madrid,
Madrid, Spain
123
1084 D. Valencia et al.
found in areas such as meteorology, where, for example, the ozone level measured
during a day is a curve; finance, where, for example, an asset price takes values at very
close time instants, and medicine, where the observed gene expressions over time can
also be considered as realizations of random curves.
Several multivariate methods have been extended to functional data. Multivariate
techniques such as regression (Cardot et al. 1999; He et al. 2000), analysis of variance
(Cuevas et al. 2004; Delicado 2007), principal components (Pezulli and Silverman
1993), generalized linear model (Escabias et al. 2004) and depth (López-Pintado and
Romo 2007, 2009) have already been extended to a functional context. Other useful
methodologies can be found in Ramsay and Silverman (2005). However, there are
still some concepts that have not been fully explored for functional data: measures of
association and dependence structures between curves, for example.
Leurgans et al. (1993) considered the canonical correlation between two sets of
curves. This technique provides a pair of functions called canonical variates and the
sample correlation among these variates leads to the canonical correlation between the
two sets of curves. He et al. (2000) propose an alternative way of finding the canonical
correlation through the extension of multivariate analysis ideas. Opgen-Rhein and
Strimmer (2006) proposed an estimator of the dynamic correlation that provides a
measure of similarity between pairs of functional observations. It is based on the
concept of dynamical correlation introduced by Dubin and Müller (2005) to analyze
a nonparametric method to quantify the covariation of components of multivariate
longitudinal observations.
In this paper, we extend the Kendall τ correlation coefficient to the functional
framework. Kendall’s τ allows dependence to be measured in the bivariate case through
the definition of concordance, which is based on the idea of order. Since there exists
no total order among functions, we use preorders that allow us to sort the functional
observations and count the concordant and discordant pairs of a bivariate sample of
curves. Once a preorder is introduced, the functional τ coefficient can be defined in a
way similar to the bivariate τ coefficient. We show that it fulfils natural properties for a
dependence measure and also establish the consistency of the sample version. Finally,
we illustrate with simulated and real data the performance of this new dependence
measure as well as its robustness, which is a basic characteristic of the Kendall τ in
its bivariate version. We analyze a data set corresponding to a microarray time series
from a human T-cell experiment with 58 genes, 10 time points and 44 replications.
We obtain the functional τ for each pair of genes and construct the partial correlation
matrix to compare the gene network resulting from functional τ with those from
dynamical correlation.
This paper is organized as follows. In Sect. 2, the functional τ is defined extending
the concept of concordance for bivariate random variables. Section 3 is devoted to
some properties of this correlation coefficient and to studying convergence results.
A summary of the classic techniques, simulation results and sensitivity analysis are
given in Sect. 4. Section 5 contains a study of dependence using the genes data set.
In Sect. 6, we present a robustness empirical study. Finally, in Sect. 7, we outline the
main conclusions of this paper.
123
A Kendall correlation coefficient between functional data 1085
S
τ̂ = ,
n
2
123
1086 D. Valencia et al.
It follows easily that, for constant functions defined in the same compact interval
I , both preorders are equivalent to the usual ordering on the real line. Given any
preorder definition among functions, we may define the concordance concept between
functions.
It is clear that Un ∈ B. Hence, the U -statistic (5) with a B-valued kernel Φ is called a
U B-statistic. In particular, if B = R, it is called a U R-statistic and if B = H , where
H is a real separable Hilbert space, it is called a U H -statistic.
123
A Kendall correlation coefficient between functional data 1087
The following theorem provides an asymptotic result, which will be very useful in
what follows.
Theorem 1 (Borovskikh 1996, page 73) Assume that the B-valued kernel Φ is such
that E Φ < ∞. Then,
Un → θ a.s n → ∞,
and
E||Un − θ || → 0.
Φ[(xi , yi ), (x j , y j )] = 2I (xi ≺ x j , yi ≺ y j ) + 2I (x j ≺ xi , y j ≺ yi ) − 1,
3 Properties of functional
123
1088 D. Valencia et al.
are a natural extension of the well known properties of the bivariate τ itself (Kendall
1938).
Let (X (t), Y (t)) be a bivariate stochastic process. Then,
1. τ (X (t), Y (t)) = τ (Y (t), X (t)).
2. −1 ≤ τ (X (t), Y (t)) ≤ 1.
3. τ (−X (t), Y (t)) = −τ (X (t), Y (t)).
4. τ (X (t), g(X (t))) = 1, for any monotone increasing function g.
5. τ (X (t), g(X (t))) = −1, for any monotone decreasing g.
6. If X (t) and Y (t) are stochastically independent, then τ (X (t), Y (t)) = 0.
7. The correlation coefficient functional is invariant under strictly increasing and
continuous transformations of the functional variables,
τ̂n → τ a.s. as n → ∞
Φ[(xi , yi ), (x j , y j )] = 2I (xi ≺ x j , yi ≺ y j ) + 2I (x j ≺ xi , y j ≺ yi ) − 1,
which represents the kernel for the τ̂n belongs to the interval [−1, 3]. Then, the func-
tional τ̂ , given in (4) and expressed as the U B-statistic (6), has associated a kernel Φ
such that E Φ is finite. Therefore, from Theorem 1, we have that, if Φ is such that
E Φ < ∞, then the U B-statistic will converge almost surely to the parameter τ .
Observe that Theorem 2 is valid in general for any well-defined preorder ().
To illustrate how the functional τ̂ works in simulated functional samples with
different kinds of dependence, we provide some examples. From now on, τ̂1 , τ̂2 denote
the functional τ̂ when (1) and (2) preorders are considered, respectively. Consider five
joint realizations of the processes X (t) = t 2 + Z 1 and Y (t) = −(t + Z 2 )2 − 8t + Z 2 ,
where (Z 1 , Z 2 ) follow a bivariate standard normal distribution with correlation σ12
representing the random part of the processes. The bivariate functional sample shown
in Fig. 1 was generated with a high value of σ12 close to 1. Each pair of curves is
represented by the same color. In this first case, the ordering for the preorder (1) in
the first group is (red > cyan > green > blue > magenta), and for the second group
123
A Kendall correlation coefficient between functional data 1089
7 2
First
0 First
6 Second
Second
Third
−2 Third
5 Fourth
Fourth
Fifth
−4 Fifth
4
−6
3
−8
2
−10
1 −12
0 −14
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
12 8
First 7 First
10 Second Second
6
Third Third
5
8 fourth fourth
fifth 4 fifth
6 3
2
4
1
0
2
−1
0 −2
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
it is (cyan > green > red > blue > magenta). In both panels, the cyan and green
curves are in the same relative position with respect to the other curves. The blue and
magenta curves are also in the same position in both groups. In this case τ̂1 = 0.6.
For the ordering with preorder (2), in the first group it is (red > cyan > green > blue
> magenta), and for the second group it is (green > cyan > red > blue > magenta).
In both panels, blue and magenta curves are in the same position in the two groups.
At the same time, the remainder of the curves are almost completely ordered in the
opposite way and τ̂2 = 0.4. Figure 2 shows another example where both coefficients
take the same value.
In this Section, we illustrate the performance of the functional τ with both preorders
given in Eqs. 1 and 2, as well as its behavior with respect to other dependence mea-
sures already introduced in the literature. We briefly describe two of them (dynamical
correlation and canonical correlation) and, in order to compare our results with these
dependence measures, we carry out a simulation study.
123
1090 D. Valencia et al.
A commonly used technique to find the correlation between two groups of func-
tions is the dynamical correlation, which is a measure of similarity between two curves.
Dubin and Müller (2005) introduced the dynamical correlation as the following infor-
mal idea: “if both trajectories tend to be mostly on the same side of their time average
(a constant) then the dynamical correlation is positive; if the opposite occurs, then
dynamical correlation is negative”. Opgen-Rhein and Strimmer (2006) proposed an
estimator for the dynamical correlation considering functional data instead of lon-
gitudinal data. We will use the estimator of the dynamical correlation proposed in
Opgen-Rhein and Strimmer (2006), which is a slightly revised version of the dynam-
ical correlation introduced in Dubin and Müller (2005):
1 s
n
ρ̂d = xi (t), yis (t) ,
n−1
i=1
where
x c (t)
x s (t) = ,
n
1
n−1 i=1 xic (t), xic (t)
and x c (t) are functions centered in space and time simultaneously, i.e.,
1
n
x (t) = x(t) − x̄(t), 1,
c
wher e x̄(t) = xi (t),
n
i=1
and · is the usual inner product for functions x(t), y(t) = I x(t)y(t)dt. As we
can see, ρ̂d is an estimator of the population dynamical correlation
ρd = E X S (t), Y S (t) ,
(ξ(s), η(s))
such that ξ X i and ηYi are well correlated with one another and the sample cor-
relation between these variables will be what in Leurgans et al. (1993) was called the
canonical correlation between the two variables or groups. In a formal way, consider
n observed pairs of curves (xi (t), yi (t)) with t in the same finite interval I and all
integrals are taken over I (see Ramsay and Silverman 2005, page 204). Given canon-
ical variates ξ and η, the canonical correlation was defined by Leurgans et al. (1993)
as the sample squared correlation of ξ xi and ηyi , i.e.,
123
A Kendall correlation coefficient between functional data 1091
2
cov ξ xi , ηyi
ρ̂c (ξ, η) = ,
var ξ xi + λ||D 2 ξ ||2 var ηyi + λ||D 2 η||2
2 2
where λ is a positive smoothing parameter and ||D 2 f ||2 = D f , that is, the inte-
grated squared curvature of f that quantifies its roughness. Having a pair of canonical
variables with fairly smooth weight functions and correlations that are not excessively
low is necessarily a good choice for the smoothing parameter. This parameter can be
chosen subjectively, but can also be selected through a cross-validation score if an
automatic procedure is required.
As usual in FDA, calculations related to functional data are performed by observing
the data in certain points of the functions. Hence, it is necessary before introducing
the simulations that are going to be shown in the paper to define the finite dimensional
version of these coefficients and the preorders given in Definition (1). Denote by
t1 , . . . , td the time points from I where the n sampled functions xi = {xi (t)|t ∈
I }, i = 1, . . . , n have been observed. Then,
– x1 (t) m x2 (t) ⇔ max(x1 (t1 ), . . . , x1 (td )) ≤ max(x2 (t1 ), . . . , x2 (td )).
−t1 n−1 −t1
– x1 (t) i x2 (t) ⇔ td2d [x1 (t1 )+x1 (td )+2 i=2 x1 (ti )] ≤ td2d [x2 (t1 )+x2 (td )+
n−1
2 i=2 x2 (ti )].
In the same way we can order the functions yi (t). The last expression corresponds
to the composite trapezoidal rule of numerical integration, which has been used for
calculating the values of the integrals. Then, the coefficients are calculated as
−1
n
n
τ̂ = 2I (xi ≺ x j and yi ≺ y j ) + 2I (x j ≺ xi and y j ≺ yi ) − 1.
2
i< j
123
1092 D. Valencia et al.
The data are simulated in the following way. Consider the bivariate stochastic process
(X (t), Y (t)) = [ f 1 (t, Z 1 ), f 2 (t, Z 2 )], where (Z 1 , Z 2 ) represents the random part of
the process, a bivariate standard normal distribution with correlation σ12 . We consider
a different structure for the functions f i , i = 1, 2 as well as different values for σ12 . In
each case, 50 realizations of the process (X (t), Y (t)) are generated, where the paths
are discretized taking d = 50 points over the interval [0, 1] and calculating the mea-
sures of dependence previously mentioned. This procedure is carried out 100 times
and the results reported refer to the average and deviation over the 100 setups.
Table 2 presents the average of the measures τ̂1 and τ̂2 as well as ρ̂c and ρ̂d , which
denote the canonical correlation and dynamical correlation, respectively. The value in
brackets reports the standard deviation of the measures considered. We also include,
in each case, the value of the correlation σ12 . We can see that the coefficients τ̂1 and
τ̂2 in some cases take different values, which is a consequence of the preorders not
sorting the data in the same way. In the case of processes in which one of them is an
increasing transformation of the other, both coefficients take value 1, which confirms
the perfect dependence between the processes considered. However, this fact does not
occur in the measures used for comparison, see for example rows 3 and 4 in Table 2.
Indeed, the value of ρ̂d in row 4 does not reflect the true dependence between those
processes, which is positive and perfect. Observe that a similar conclusion can be
drawn when the dependence is perfect but negative as may be seen in row 5. There,
only our coefficients were able to capture the negative perfect dependence. Note also
that in the independent case (row 11), our coefficients reflect this fact better than the
other measures. Finally, the standard deviation of τ̂2 in most cases is the smallest
among the other measures.
We can see that the canonical correlation ρ̂c is always positive, which means that it
does not capture the direction of the dependence. This is because it seeks variability
in the two sets of curves that maximize the sample correlation between the pairs
of canonical variates. Dynamical correlation ρ̂d reflects the average of individual
similarities rather than considering the set of curves as a sample of the same population.
This makes the dynamical correlation capture changes at an individual performance
level, while Kendall’s coefficient detects changes at a more general level, since it is
inspired in the bivariate coefficient, which is based on evaluating the relationship of
the two sets of data.
Thus, the functional τ̂ is appropriate to indicate how related two functional variables
are, regardless of the shape of their realizations. This coefficient measures the joint
tendency of the variables to have increasing or decreasing behavior.
In the bivariate case, the decision of which coefficient to use depends on several
factors, such as the type of measurement scale in which each variable is expressed,
the nature of the distributions (continuous or discrete) and if the dependence sought
is linear or nonlinear. Also, each coefficient is designed so that the dependence that it
is able to capture measures different aspects such as concordance and discordance in
the data or the ordering relation of a group of data with respect to another, etc. Having
several coefficients to measure dependency from different points of view is therefore
natural in the bivariate case, as happens in the functional case. The final user of these
measures will make her decisions based on the joint information offered by the pull
of coefficients available to him. In the case of functional data, as in the multivariate
123
Table 2 Dependence measures in simulated data
123
1093
1094 D. Valencia et al.
case, this diversity of coefficients increases because there is no total order of the data,
but partial orders that preclude a canonical definition of the dependency measures.
This is exactly what happens for example in the definition of depth measurements for
functional or multivariate data that, depending on each “idea” of order, originates a
different measure.
As we can see, τ̂ depends on the sample size n and on the number of points d
used to discretize the functions. In order to assess the stability of the functional τ̂ with
respect to (n, d), we perform two sensitivity analysis, using the following two pairs
of stochastic processes:
The first analysis has as its objective to evaluate the sensibility with respect to the
sample size n. In this case, we move n = 25, 50, 100, 150 and 1000 with d = 50
(the number of points to discretize the functions) fixed. This procedure is repeated
100 times and we report their average and standard deviation. Table 3 shows that the
changes in τ̂¯1 , τ̂¯2 are negligible and they are quite stable with respect to the sample
size.
Now, the same scheme is applied to d. Fix n = 50, and consider d =
25, 50, 100, 150 and 1000 points. This procedure is repeated 100 times. Table 4 illus-
trates the sensitivity with respect to d. It is noteworthy that the coefficients present
good stability with respect to the number of points taken to discretize the functions.
We also carry out the sensitivity analysis for other models, but we do not report them
in this work, since we obtain the same conclusions.
We have also smoothed the curves of Table 2 with cubic spline and take the smooth-
ing parameter λ = 0.1, 0.5, and 1. The results are given in Table 5. In general, the
coefficient τ̂2 is more stable than τ̂1 when we vary the smoothing parameter as also
happened when the data are not smoothed.
123
A Kendall correlation coefficient between functional data 1095
λ τ̂1 τ̂2
0.1 0.5 1 0.1 0.5 1
123
1096 D. Valencia et al.
5 Gene data
Existing relationships among genes contain broad information on the structure and
functioning of living beings. Accordingly, the interaction between genes allows us
to understand many life phenomena. These interactions give rise to the construction
of genetic networks. By studying the structural properties of such networks, much
more information may be extracted in order to understand the complex functioning
of living organisms. Different statistical methodologies have been used to estimate
genetic networks, such as graphical models which represent stochastic conditional
dependence between the investigated variables. Graphical Gaussian models and the
Bayesian network are examples of simple graphical models (see, e.g., Whittaker 1990),
but their drawback is that these methods are based on the assumption of identically
and independently distributed variables. Opgen-Rhein and Strimmer (2006) studied
the graphical Gaussian models from the perspective of functional data, where these
two assumptions are not necessary.
Opgen-Rhein and Strimmer (2006) considered the gene expression as a functional
observation, rather than describing the individual time points separately. They built the
networks in the following way: the network nodes are the genes and the correlations
are the connectivity strengths assigned to the edges of the network. They consider the
dynamical correlation introduced in Sect. 4. However, they do not use the dynamical
correlation itself because it represents only marginal dependencies, including indirect
interactions between two variables, since it contains information on the relationships
of each variable with the rest. They use the concept of partial correlation, which
describes the correlation between any two variables i and j, conditioned on all the
other variables, which is the correlation between two variables when the effect of the
other is eliminated. Therefore, if the variables are linearly and conditionally associated,
the partial correlation coefficient is different from zero.
The partial correlation matrix is constructed as follows: Let P = (ρkl ) be the
correlation coefficients, and let Ω be the inverse relationships
Ω = P−1 = (wi j );
−wkl
ρ̃kl = √ and P̃ = (ρ̃kl ).
wkk wll
123
A Kendall correlation coefficient between functional data 1097
To test the significance of these correlations and decide which are significant edges,
they employ a large-scale simultaneous hypothesis testing, the “local fdr”, which is
an empirical Bayes estimator of the false discovery rate proposed by Efron (2004)
and (2005). This method computes the posterior probability for an edge to be present
or absent in the gene network. An important question in the use of this method is
whether we can identify a small percentage of interesting cases that deserve further
investigation. In this study, these cases will be the edges present in the network.
We propose a new form of finding connectivity strengths (edges) using the func-
tional τ̂2 and applying the “local fdr” to investigate valid relations.
In order to compare the results, we use the same pre-processed data as in the paper
by Opgen-Rhein and Strimmer (2006). The data set characterizes the response of a
human T-cell line (Jirkat) to a treatment with PMA and ioconomin. After preprocessing
the time course data, 58 genes measured across 10 time points with 44 replications are
provided, (see Rangel et al. 2004). Data were smoothed with linear spline, taking four
basis functions and a smoothing parameter λ = 0.00001. Table 7 shows the correlation
coefficients including the canonical correlation ρ̂c and dynamical correlation ρ̂d for
some pairs of genes. Note how the correlations vary depending on the coefficient used,
which was considered when we analyzed simulated data in Sect. 4.
In order to compare our results with those obtained by Opgen-Rhein and Strimmer
(2006), we calculate the partial correlation matrix from the correlation matrix found
with the functional τ̂2 and we use the “local fdr” algorithm in GeneNet packages,
available in library R-software, to find whether significant edges are present or absent
in our network, with the same cut-off = 0.2 used for calculating the network with
dynamical correlation.
Figures 3 and 4 show the network proposed by Opgen-Rhein and Strimmer (2006)
and our proposed network, respectively. The network calculated with partial dynamical
correlation contains 15 nodes and 9 edges, whereas the network calculated with partial
functional τ̂2 contains 22 nodes and 12 edges. The common nodes in both networks
are CASP8, SOD1, MAPK9, CDC2 and CCNA.
The advantage of using functional τ̂2 instead of the dynamical correlation studied
in Opgen-Rhein and Strimmer (2006) is that our coefficient identifies relationships
between the variables based on the relative ordering among realizations in each group.
And it is not only based on the shape of individual realizations; our coefficient also
takes into account the temporal evolution of each gene, so it is able to identify additional
and different relationships than those given by the dynamical correlation.
Tables 8 and 9 show the results of partial correlation with dynamical correlation
and partial correlation with functional τ̂2 , respectively, which were found through the
“local fdr” algorithm. In addition, we can see the p value for each of the correlations
as well as the nodes included in the networks.
Finally, to explore the relationship between the dynamical correlation and the func-
tional τ̂2 , we make a regression analysis between the partial dynamical correlation
and partial functional τ̂2 for T-cell data. We obtain a R 2 = 0.0634, which is low and
indicates a weak relationship.
123
1098 D. Valencia et al.
123
A Kendall correlation coefficient between functional data 1099
6 Robustness
123
1100 D. Valencia et al.
We generate 50 curves for the previously studied processes. (Recall that σ12 is the
correlation between the normal random variables Z 1 and Z 2 .)
123
A Kendall correlation coefficient between functional data 1101
Contaminated groups Type of outliers No. outl τ̂1 τ̂2 ρ̂d ρ̂c
Contaminated groups Type of outliers No. outl τ̂1 τ̂2 ρ̂d ρ̂c
123
1102 D. Valencia et al.
Contaminated groups Type of outliers No. outl τ̂1 τ̂2 ρ̂d ρ̂c
of the association measure with respect to its value in the uncontaminated data set. We
compare our results with those obtained by the dynamical correlation and canonical
correlation. We can see that functional τ̂1 and τ̂2 coefficients are not affected by the
presence of shape outliers, while the dynamical correlation and canonical correla-
tion coefficients are sensitive to them. For magnitude outliers and shape–magnitude
outliers, our coefficients present small variations unlike the other coefficients, which
present variations up to 40 percent of the original value. The results are given in
Tables 10, 11 and 12, where the values in bold are those that provide the largest vari-
ation in each of the cases. We can see that the functional τ̂1 as well as the functional
τ̂2 do not present a significant variation, while ρ̂d and ρ̂c give the largest variations in
almost all cases.
7 Conclusions
We have introduced a new numerical dependence measure between two sets of func-
tional data. Our technique is a natural extension of the Kendall τ coefficient when the
data are curves. In order to build this new coefficient, we have also introduced the
concordance concept between pairs of functional data. We have presented examples
of applications showing the usefulness of the new coefficients introduced for both
simulated and real data.
We have compared the performance of our measure with other coefficients, such
as dynamical correlations and canonical correlations. The coefficients presented here
allow us to identify the global dependence between two groups of functional data
regardless of the shape of their realizations. These coefficients are presented as a new
alternative to find dependence between functional data. In addition, this coefficient’s
implementation is straightforward.
123
A Kendall correlation coefficient between functional data 1103
An interesting example with real data is studied. The data set corresponds to a
microarray time series from a human T-cell experiment. We obtain the partial func-
tional τ̂2 for each pair of genes and construct a gene network.
We also study the sensitivity of our coefficients and conclude that these coefficients
present good stability with respect to sample size and the number of points taken
to discretize the functions. However the coefficient defined with the preorder of the
maximun presents less stability when the data are smoothed. In terms of robustness,
our coefficients can be considered quite stable in the presence of functional outliers
in comparison with the measures used as a benchmark.
As one of the referees suggested, it would be very interesting to carry out a deeper
inference study on these new coefficients. In this work, a first approximation with the
boostrap intervals is presented, but an analysis based on hypothesis testing will be the
subject of future work.
References
Borovskikh Y (1996) U-statistics in Banach space. VSP BV, Oud-Beijerland
Cardot H, Ferraty F, Sarda P (1999) Functional linear model. Stat Probab Lett 45:11–22
Cuevas A, Febrero M, Fraiman R (2004) An ANOVA test for functional data. Comput Stat Data Anal
47:111–122
Delicado P (2007) Functional k-sample problem when data are density functions. Comput Stat 22:391–410
Dubin JA, Müller HG (2005) Dynamical correlation for multivariate longitudinal data. J Am Stat Assoc
100:872–881
Efron B (2004) Large-scale simultaneous hypothesis testing: the choice of a null hypothesis. J Am Stat
Assoc 99:96–104
Efron B (2005) Local false discovery rates. Technical report, Department of Statistics, Stanford University
Escabias M, Aguilera A, Valderrama M (2004) Principal components estimation of functional logistic
regression: discussion of two different approaches. J Non Parametr Stat 16(3–4):365–384
Febrero M, Galeano P, González-Manteiga W (2008) Outlier detection in functional data by depth measures,
with application to identify abnormal N O x levels. Envirometrics 19:331–345
He G, Müller HG, Wang JL (2000) Extending correlation and regression from multivariate to functional
data. In: Puri ML (ed) Asymptotics in statistics and probability. VSP, Leiden, pp 197–210
Kendall M (1938) A new measure of rank correlation. Biometrika Trust 30(1/2):81–93
Leurgans SE, Moyeed RA, Silverman BW (1993) Canonical correlation analysis when data are curves. J R
Stat Soc B 55:725–740
López-Pintado S, Romo J (2007) Depth-based inference for functional data. Comput Stat Data Anal
51:4957–4968
López-Pintado S, Romo J (2009) On the concept of depth for functional data. J Am Stat Assoc 104:718–734
Opgen-Rhein R, Strimmer K (2006) Inferring gene dependency networks from genomic longitudinal data:
a functional data approach. REVSTAT 4(1):53–65
Pezulli S, Silverman B (1993) Some properties of smoothed components analysis for functional data.
Comput Stat 8:1–16
Ramsay JO, Silverman BW (2005) Functional data analysis, 2nd edn. Springer, New York
Rangel C, Angus J, Ghahramani Z et al (2004) Modelling T-cell activation using gene expression profiling
and state-space models. Bioinformatics 20:1361–1372
Scarsini M (1984) On measure of concordance. Stochastica 8(3):201–218
Schwabik S, Guoju Y (2005) Topics in Banach space integration. World Scientific Publishing, Singapore
Taylor MD (2007) Multivariate measures of concordance. Ann Inst Stat Math 59:789–806
Taylor MD (2008) Some properties of multivariate measures of concordance. arXiv:0808.3105 [math.PR]
Whittaker J (1990) Graphical models in applied multivariate statistics. Wiley, New York
Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in published maps
and institutional affiliations.
123