Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

A graphical exploration of the relationship between parasite aggregation indices

R. McVinish School of Mathematics and Physics, University of Queensland r.mcvinish@uq.edu.au Β andΒ  R.J.G. Lester School of Biological Sciences, University of Queensland
Abstract.

The level of aggregation in parasite populations is frequently incorporated into ecological studies. It is measured in various ways including variance-to-mean ratio, mean crowding, the kπ‘˜kitalic_k parameter of the negative binomial distribution and indices based on the Lorenz curve such as the Gini index (Poulin’s D) and the Hoover index. Assuming the frequency distributions follow a negative binomial, we use contour plots to clarify the relationships between aggregation indices, mean abundance and prevalence. The contour plots highlight the nonlinear nature of the relationships between these measures and suggest that correlations are not a suitable summary of these relationships.

Key words and phrases:
Aggregation, Gini index, Hoover index, Lorenz order, Negative binomial distribution, Prevalence.

1. Introduction

Investigations into parasite population dynamics frequently require an indicator of the level of aggregation in the parasite population (Tinsley et al, 2020, Kura et al, 2022). As the concept of aggregation in parasites is poorly defined (Pielou, 1977, McVinish and Lester, 2020), aggregation has been measured in various ways. Commonly used indices include prevalence, the Variance-to-Mean Ratio (VMR), and the kπ‘˜kitalic_k parameter of the negative binomial distribution. Closely related to VMR and kπ‘˜kitalic_k are mean crowding and patchiness (Lloyd, 1967) which can be seen as more direct measures of the competitive experience of parasites within a host (Wade et al, 2018). Two other indices are derived from the Lorenz curve (Lorenz, 1905), the most widely accepted quantification of inequality. Poulin (1993) proposed using the Gini index (Gini, 1914), which has since become widely used in parasitology (RodrΓ­guez-HernΓ‘ndez et al, 2021, Bezerra and Bocchiglieri, 2023, Matos et al, 2023). The Hoover index (aka Pietra index) has more recently been proposed to measure parasite aggregation (McVinish and Lester, 2020, Lester and Blomberg, 2021).

This paper clarifies and extends our previous work on aggregation. It was stimulated by a recent paper by Morrill et al (2023) which correlated aggregation indices with mean abundance and prevalence using simulated data. We present a more accurate representation using β€˜contour plots’, calculated directly from the parameters of the negative binomial distributions. The plots provide a simple and more insightful way to comprehend the relationships.

2. Contour plots

The contours show combinations of two indices, specified on the vertical and horizontal axes, that give rise to similar values of the third index. Contour plots, developed in the 16th century (Morato-Moreno, 2017), are widely used in other disciplines but rarely in parasitology (e.g. Kura et al (2022)).

Our analysis assumed that parasite burden is adequately modelled by a negative binomial distribution (Crofton, 1971, Shaw et al, 1998, Poulin, 2011, Morrill et al, 2023). Following the typical practice in parasitology, we parameterised the negative binomial distribution in terms of mean abundance, mπ‘šmitalic_m, and the parameter kπ‘˜kitalic_k which controls the shape of the distribution. We did not make any assumption on the distribution of mπ‘šmitalic_m and kπ‘˜kitalic_k. We used the range of values for mπ‘šmitalic_m and kπ‘˜kitalic_k suggested by the extensive data of Shaw and Dobson (1995). Their values for mπ‘šmitalic_m, kπ‘˜kitalic_k and prevalence are superimposed on several of the contour plots as dot points.

To construct a contour plot of an aggregation index against mπ‘šmitalic_m and kπ‘˜kitalic_k, we expressed the aggregation index as a function of mπ‘šmitalic_m and kπ‘˜kitalic_k. The population values of several indices can be expressed simply in terms of mπ‘šmitalic_m and kπ‘˜kitalic_k:

prevalence=1βˆ’(k/(k+m))k,prevalence1superscriptπ‘˜π‘˜π‘šπ‘˜\text{prevalence}=1-(k/(k+m))^{k},prevalence = 1 - ( italic_k / ( italic_k + italic_m ) ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ,

VMR=1+m/kVMR1π‘šπ‘˜\text{VMR}=1+m/kVMR = 1 + italic_m / italic_k , mean crowding=m+m/kmean crowdingπ‘šπ‘šπ‘˜\text{mean crowding}=m+m/kmean crowding = italic_m + italic_m / italic_k, and patchiness=1+1/kpatchiness11π‘˜\text{patchiness}=1+1/kpatchiness = 1 + 1 / italic_k. The Gini and Hoover indices lack simple expressions in terms of mπ‘šmitalic_m and kπ‘˜kitalic_k however, they can still be evaluated numerically. The Hoover index can be expressed in terms of mπ‘šmitalic_m and kπ‘˜kitalic_k by applying Arnold and Sarabia (2018, Lemma 5.3.3),

H=F⁒(m;k,m)βˆ’F⁒(mβˆ’1;k+1,m+m/k),π»πΉπ‘šπ‘˜π‘šπΉπ‘š1π‘˜1π‘šπ‘šπ‘˜H=F(m;k,m)-F(m-1;k+1,m+m/k),italic_H = italic_F ( italic_m ; italic_k , italic_m ) - italic_F ( italic_m - 1 ; italic_k + 1 , italic_m + italic_m / italic_k ) ,

where F⁒(x;m,k)𝐹π‘₯π‘šπ‘˜F(x;m,k)italic_F ( italic_x ; italic_m , italic_k ) is the cumulative distribution function of the negative binomial distribution with kπ‘˜kitalic_k and mean mπ‘šmitalic_m evaluated at xπ‘₯xitalic_x. Further details are given in the Appendix. The cumulative distribution function of the negative binomial distribution, F𝐹Fitalic_F, is available in statistical packages such as R (R Core Team, 2023). The Gini index can be expressed as

G=(1+mk)2⁒F1⁒(k+1,12,2;βˆ’4⁒mk⁒(1+mk)),𝐺subscript1π‘šπ‘˜2subscript𝐹1π‘˜11224π‘šπ‘˜1π‘šπ‘˜G=\left(1+\frac{m}{k}\right)\,_{2}F_{1}\left(k+1,\frac{1}{2},2;-4\frac{m}{k}% \left(1+\frac{m}{k}\right)\right),italic_G = ( 1 + divide start_ARG italic_m end_ARG start_ARG italic_k end_ARG ) start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_k + 1 , divide start_ARG 1 end_ARG start_ARG 2 end_ARG , 2 ; - 4 divide start_ARG italic_m end_ARG start_ARG italic_k end_ARG ( 1 + divide start_ARG italic_m end_ARG start_ARG italic_k end_ARG ) ) ,

where F12subscriptsubscript𝐹12{}_{2}F_{1}start_FLOATSUBSCRIPT 2 end_FLOATSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is the Gaussian hypergeometric function (Ramasubban, 1958, equation 2.12). This can be evaluated in R using the hypergeo package (Hankin, 2015). Calculating indices directly from the parameters of the negative binomial distribution rather than using simulated data obviates the need to consider the uncertainty of estimates and the effects of different sample sizes.

We also employed contour plots to examine the relationship between aggregation indices, kπ‘˜kitalic_k, and prevalence. This required first solving the equation

prevalence=1⁒–⁒(k/(k+m))kprevalence1–superscriptπ‘˜π‘˜π‘šπ‘˜\text{prevalence}=1–(k/(k+m))^{k}prevalence = 1 – ( italic_k / ( italic_k + italic_m ) ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT

in terms of kπ‘˜kitalic_k for each pair of mπ‘šmitalic_m and prevalence in the contour plot. This equation has a unique solution if m+ln⁑(1βˆ’prevalence)>0π‘š1prevalence0m+\ln(1-\text{prevalence})>0italic_m + roman_ln ( 1 - prevalence ) > 0. On the other hand, if m+ln⁑(1βˆ’prevalence)<0π‘š1prevalence0m+\ln(1-\text{prevalence})<0italic_m + roman_ln ( 1 - prevalence ) < 0, there is no solution to the equation. The solution was found numerically using the uniroot function in R. The expressions for the aggregation indices in terms of mπ‘šmitalic_m and kπ‘˜kitalic_k is then used to construct the contour plot. Regions of mπ‘šmitalic_m and prevalence that are inconsistent with a negative binomial distribution are represented as white in the contour plot.

All contour plots were produced in R using the ggplot2 package (Wickham, 2016). The values for mπ‘šmitalic_m and kπ‘˜kitalic_k reported in Shaw and Dobson (1995) were heavily skewed and spanned several orders of magnitude with mπ‘šmitalic_m ranging between 0.1 and 5200 and kπ‘˜kitalic_k ranging between 0.001 and 16.5. To make the plots clearer, log scaling has been applied to these variables.

3. Relationship between mean abundance, kπ‘˜kitalic_k, and prevalence

The relationship between mπ‘šmitalic_m, kπ‘˜kitalic_k, and prevalence in wild parasite populations has been examined by several authors with conflicting results (Pennycuick, 1971, Scott, 1987, Poulin, 1993, Shaw and Dobson, 1995, Kura et al, 2022). While the expression of prevalence in terms of mπ‘šmitalic_m and kπ‘˜kitalic_k is sufficiently simple to analyse, it is still instructive to construct the contour plot (Fig. 1 left). In it, each colour represents a region of values of mπ‘šmitalic_m and kπ‘˜kitalic_k that give rise to similar values of prevalence.

We see that prevalence is increasing in both mπ‘šmitalic_m and kπ‘˜kitalic_k leading to contours that are roughly L-shaped on the range of mπ‘šmitalic_m and kπ‘˜kitalic_k plotted so prevalence is small when either mπ‘šmitalic_m or kπ‘˜kitalic_k are small, and prevalence is large when both mπ‘šmitalic_m and kπ‘˜kitalic_k are large. The contours also show that there is a non-linear relationship between mπ‘šmitalic_m and kπ‘˜kitalic_k when prevalence is considered fixed. The contours become almost parallel to the horizontal axis as kπ‘˜kitalic_k increases, a consequence of limkβ†’βˆžprevalence=1βˆ’eβˆ’msubscriptβ†’π‘˜prevalence1superscriptπ‘’π‘š\lim_{k\to\infty}\text{prevalence}=1-e^{-m}roman_lim start_POSTSUBSCRIPT italic_k β†’ ∞ end_POSTSUBSCRIPT prevalence = 1 - italic_e start_POSTSUPERSCRIPT - italic_m end_POSTSUPERSCRIPT. On the other hand, the contours continue to move left as mπ‘šmitalic_m increases, a consequence of limmβ†’βˆžprevalence=1subscriptβ†’π‘šprevalence1\lim_{m\to\infty}\text{prevalence}=1roman_lim start_POSTSUBSCRIPT italic_m β†’ ∞ end_POSTSUBSCRIPT prevalence = 1. The contour plot shows that the rate at which prevalence approaches one as mπ‘šmitalic_m increases is slow when kπ‘˜kitalic_k is small.

If we restrict our attention to a single-coloured band, i.e. those values of mπ‘šmitalic_m and kπ‘˜kitalic_k giving rise to similar values of prevalence, we see that, after controlling for prevalence, there is a negative relationship between mπ‘šmitalic_m and kπ‘˜kitalic_k. This relationship is forced by the negative binomial distribution, so it will hold true in natural systems to the extent that those systems are well modelled by the negative binomial distribution. The different widths of the contour lines show the non-linearity of the relationship between mπ‘šmitalic_m, kπ‘˜kitalic_k and prevalence.

The dot points represent estimates of mπ‘šmitalic_m and kπ‘˜kitalic_k from the 269 parasite-host systems reported in Shaw and Dobson (1995). Although several parasite-host systems lie in a region of very high prevalence (both mπ‘šmitalic_m and kπ‘˜kitalic_k large) or very small prevalence (either mπ‘šmitalic_m or kπ‘˜kitalic_k small), many others occupy a region of the parameter space where a moderate change in the parameter values would result in a significant change in prevalence assuming a negative binomial distribution.

As Shaw & Dobson reported prevalences in their review, it is possible to compare these with the prevalence values implied by the negative binomial distribution (Fig. 1 right). In general, there is good agreement; most points within a given contour having the same colour. This demonstrates the accuracy of the contour plots to interpret relationships in real life situations. The few points where the observed prevalences don’t agree with that determined by the negative binomial could be because these distributions did not conform to a negative binomial.

Refer to caption
Figure 1. (Left) Contour plot showing prevalence levels from zero to 1 for values of mπ‘šmitalic_m and kπ‘˜kitalic_k. Each colour band represents a region of values of mπ‘šmitalic_m and kπ‘˜kitalic_k that give rise to similar values of prevalence. The axes for both the mπ‘šmitalic_m and kπ‘˜kitalic_k are on the log scale. Dot points are actual data from Shaw and Dobson (1995). (Right) Scatter plot of mean against kπ‘˜kitalic_k with prevalence data from Shaw & Dobson. Lighter colours indicate higher prevalence level. The lines are from the contour plot (left). In general, there is good agreement between the observed prevalences and the contours. Exceptions may be from samples that did not conform to a negative binomial.

4. Relationship of Hoover & Gini indices with mean abundance, kπ‘˜kitalic_k, and prevalence

Contour plots of the Hoover index and Gini index as functions of mπ‘šmitalic_m and kπ‘˜kitalic_k are shown in Fig 2 left and right. The contour plots are qualitatively very similar and share some similarities with the contour plot of prevalence (Fig. 1). Both Hoover and Gini indices decrease in both mπ‘šmitalic_m and kπ‘˜kitalic_k, taking values close to one when either mπ‘šmitalic_m or kπ‘˜kitalic_k were small, and taking values close to zero when both mπ‘šmitalic_m and kπ‘˜kitalic_k were large. The contours are L-shaped becoming almost parallel to the horizontal axis as mπ‘šmitalic_m increases and almost parallel to the vertical axis as kπ‘˜kitalic_k increases.

Refer to caption
Figure 2. Contour plots of Hoover index (left) and Gini index (right) as functions of mπ‘šmitalic_m and kπ‘˜kitalic_k. Contour lines are shown from 0 to 1, i.e. from least aggregated (darkest band) to most aggregated (yellow band). Comparing the graphs, the Gini index is always larger than the Hoover index and has a smaller range over the region of values for mπ‘šmitalic_m and kπ‘˜kitalic_k observed in wild populations (dot points)

The plots show both indices display some stability over a wide range of mπ‘šmitalic_m and kπ‘˜kitalic_k. Restricting our attention to the Hoover index (Figure 2 left), we see that for m>5π‘š5m>5italic_m > 5 the value of the index is largely determined by the size of kπ‘˜kitalic_k. For m<5π‘š5m<5italic_m < 5 the value is less affected by kπ‘˜kitalic_k but more affected by mπ‘šmitalic_m, as indicated by the number of contours crossed as mπ‘šmitalic_m decreases. For example, starting from k=1π‘˜1k=1italic_k = 1 and m=6π‘š6m=6italic_m = 6, as mπ‘šmitalic_m decreases the value of the index increases quickly crossing several contours from 0.4 to 1. On the other hand, when mπ‘šmitalic_m increases from the same point (1,6) the index stays in the same colour band and there is little change in the Hoover value (0.4 to 0.5). For many of the parasite-host systems reported in Shaw and Dobson (1995), shown on the figure as dot points, an increase in mπ‘šmitalic_m, that is moving the points vertically on the contour plot, does not appear to impact the Hoover index since the point would remain in the same-coloured region. On the other hand, in many of the samples, a moderate change in kπ‘˜kitalic_k, that is moving the point horizontally, has a large impact on the Hoover index. Similar behaviour is observed in the contour plot of the Gini index (Fig. 2 right), with the Gini index appearing to be even less affected by changes in mπ‘šmitalic_m.

There are two noticeable differences between the contour plots for the Hoover and Gini indices (Fig. 2). Firstly, the Gini index is always larger than the Hoover index (Taguchi, 1968) (Arnold and Sarabia, 2018, Section 5.7). This causes the Gini index to have a smaller range over the region of values for mπ‘šmitalic_m and kπ‘˜kitalic_k observed in wild populations. Specifically, for the values of mπ‘šmitalic_m and kπ‘˜kitalic_k reported in Shaw and Dobson (1995), the Gini index exceeds 0.9 in 42% (113/269) of cases compared to 20% (54/269) of cases exceeding 0.9 Hoover index. Second, the contours of the Hoover index are not smooth, unlike those of the Gini index. The bumps that occur on the contours of the Hoover index occur at integer values of the mean, the most prominent occurring when the mean is 1. These bumps quickly become much less noticeable as the mean increases.

The contour plots of the Gini and Hoover indices exhibit greater differences when considered as functions of mπ‘šmitalic_m and prevalence (Fig. 3). First, unlike the Gini index, the contour lines of the Hoover index are parallel to the vertical axis when mπ‘šmitalic_m is less than one. As noted by Morrill et al (2023), when all infected hosts harbour infrapopulations larger than or equal to the overall mean, the Hoover index is equal to one minus prevalence. For the negative binomial distribution, this implies the Hoover index is equal to one minus prevalence when the mean is less than or equal to one. Second, there is less variability in the widths of the contours for the Hoover index compared to the Gini index. This suggests the dependence of the Hoover index on prevalence is more regular. At a given mπ‘šmitalic_m, a change of 0.1 in the prevalence will have roughly the same effect on the value of the Hoover index, regardless of the initial value of prevalence. In contrast, much of the contour plot of the Gini index is coloured yellow, corresponding to values greater than 0.9. Values of the Gini index less than 0.6 are restricted to small region of the plot, indicating that small changes in prevalence in that region will result in a large change in the Gini index.

Refer to caption
Figure 3. Contour plot of Hoover index (left) and Gini index (right) as functions of mπ‘šmitalic_m and prevalence. The contour lines of the Hoover index are parallel to the vertical axis when mπ‘šmitalic_m is less than one. The Gini index is less constrained. However, with means above one, the Hoover index is more evenly spread than the Gini index.

The parasite data from Shaw and Dobson were taken from five taxonomic groups. The data, divided into taxa, were superimposed on the plots of mπ‘šmitalic_m vs kπ‘˜kitalic_k with contour lines of prevalence and Hoover index. They did not show any obvious grouping.

5. Lorenz order and the negative binomial distribution

Both the Hoover and Gini indices are seen in Figure 2 to be decreasing functions of mπ‘šmitalic_m and kπ‘˜kitalic_k, as is 1 - prevalence (Figure 1). This behaviour is due to how these indices relate to the Lorenz curve and how the parameters mπ‘šmitalic_m and kπ‘˜kitalic_k affect the Lorenz curve of the negative binomial distribution.

The Lorenz curve of a distribution with cumulative distribution function F𝐹Fitalic_F is given by

L⁒(u)=∫0uFβˆ’1⁒(y)⁒𝑑ym,u∈[0,1],formulae-sequence𝐿𝑒subscriptsuperscript𝑒0superscript𝐹1𝑦differential-dπ‘¦π‘šπ‘’01L(u)=\frac{\int^{u}_{0}F^{-1}(y)\,dy}{m},\quad u\in[0,1],italic_L ( italic_u ) = divide start_ARG ∫ start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_y ) italic_d italic_y end_ARG start_ARG italic_m end_ARG , italic_u ∈ [ 0 , 1 ] ,

where mπ‘šmitalic_m is the mean of the distribution and Fβˆ’1⁒(x)=supx{x:F⁒(x)≀y}superscript𝐹1π‘₯subscriptsupremumπ‘₯conditional-setπ‘₯𝐹π‘₯𝑦F^{-1}(x)=\sup_{x}\{x:F(x)\leq y\}italic_F start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_x ) = roman_sup start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT { italic_x : italic_F ( italic_x ) ≀ italic_y } for y∈(0,1)𝑦01y\in(0,1)italic_y ∈ ( 0 , 1 ) (Gastwirth, 1971). In our context, the Lorenz curve describes the proportion u𝑒uitalic_u of the host population that is infected with a proportion L⁒(u)𝐿𝑒L(u)italic_L ( italic_u ) of the parasite population. When all hosts have the same parasite burden, the Lorenz curve is given by L⁒(u)=u𝐿𝑒𝑒L(u)=uitalic_L ( italic_u ) = italic_u for all u𝑒uitalic_u in [0,1]. This is called the egalitarian line. Several indices can be defined in terms of the Lorenz curve. Specifically, the Gini index is twice the area between the Lorenz curve and the egalitarian line, and the Hoover index is the greatest vertical distance between the Lorenz curve and the egalitarian line. Even 1βˆ’prevalence1prevalence1-\text{prevalence}1 - prevalence can be viewed as the largest value of u𝑒uitalic_u such that L⁒(u)=0𝐿𝑒0L(u)=0italic_L ( italic_u ) = 0. The Lorenz curve induces a partial ordering of distributions. Assume FAsubscript𝐹𝐴F_{A}italic_F start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT and FBsubscript𝐹𝐡F_{B}italic_F start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT are two distribution functions with finite means. If the Lorenz curve of FAsubscript𝐹𝐴F_{A}italic_F start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT is greater than the Lorenz curve of FBsubscript𝐹𝐡F_{B}italic_F start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT for all u𝑒uitalic_u, then we say that FAsubscript𝐹𝐴F_{A}italic_F start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT is smaller than FBsubscript𝐹𝐡F_{B}italic_F start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT in the Lorenz order and write FA≀FBsubscript𝐹𝐴subscript𝐹𝐡F_{A}\leq F_{B}italic_F start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ≀ italic_F start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT. This ordering corresponds to the notion of aggregation put forward by Poulin (1993), McVinish and Lester (2020). From their connections with the Lorenz curve, we see that if FA≀FBsubscript𝐹𝐴subscript𝐹𝐡F_{A}\leq F_{B}italic_F start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ≀ italic_F start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT, then the Gini and Hoover indices as well as 1βˆ’prevalence1prevalence1-\text{prevalence}1 - prevalence will be smaller for FAsubscript𝐹𝐴F_{A}italic_F start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT than for FBsubscript𝐹𝐡F_{B}italic_F start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT.

The following result shows that the negative binomial distribution decreases in the Lorenz order as mπ‘šmitalic_m increases and as kπ‘˜kitalic_k increases.

Theorem 1.

Let 𝖭𝖑⁒(m,k)π–­π–‘π‘šπ‘˜\mathsf{NB}(m,k)sansserif_NB ( italic_m , italic_k ) denote the negative binomial distribution with parameters mπ‘šmitalic_m and kπ‘˜kitalic_k. If m1<m2subscriptπ‘š1subscriptπ‘š2m_{1}<m_{2}italic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, then

𝖭𝖑⁒(k,m2)≀L𝖭𝖑⁒(k,m1).subscriptπΏπ–­π–‘π‘˜subscriptπ‘š2π–­π–‘π‘˜subscriptπ‘š1\mathsf{NB}(k,m_{2})\leq_{L}\mathsf{NB}(k,m_{1}).sansserif_NB ( italic_k , italic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≀ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT sansserif_NB ( italic_k , italic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) .

If k1<k2subscriptπ‘˜1subscriptπ‘˜2k_{1}<k_{2}italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, then

𝖭𝖑⁒(k2,m)≀L𝖭𝖑⁒(k1,m).subscript𝐿𝖭𝖑subscriptπ‘˜2π‘šπ–­π–‘subscriptπ‘˜1π‘š\mathsf{NB}(k_{2},m)\leq_{L}\mathsf{NB}(k_{1},m).sansserif_NB ( italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_m ) ≀ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT sansserif_NB ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_m ) .

The proof is provided in the Appendix.

The above result explains why Gini and Hoover indices and 1 – prevalence are all decreasing functions of mπ‘šmitalic_m and kπ‘˜kitalic_k. Figure 2 also shows that that the contours of both the Gini and Hoover indices become parallel with the axes. This is due to the limiting behaviour of the negative binomial distribution. Depending on how the parameters are allowed to vary, it is known that the negative binomial distribution will converge to either a Poisson distribution or a gamma distribution (Adell and De La Cal, 1994). Fixing mπ‘šmitalic_m and letting kπ‘˜kitalic_k increase, the negative binomial distribution converges to a Poisson distribution with mean mπ‘šmitalic_m. This causes the contour lines to become parallel with the horizontal axis as kπ‘˜kitalic_k increases. Similarly, fixing kπ‘˜kitalic_k and letting mπ‘šmitalic_m increase, an appropriately scaled negative binomial distribution converges to a Gamma distribution with shape and rate parameters both equal to kπ‘˜kitalic_k. Since the Gini and Hoover indices are scale invariant (Arnold and Sarabia, 2018, Section 3.1), these indices approach their respective values for a Gamma (kπ‘˜kitalic_k, kπ‘˜kitalic_k) distribution as mπ‘šmitalic_m increases. This causes the contour lines to become parallel with the vertical axis as mπ‘šmitalic_m increases.

6. Discussion

In choosing the index to use to measure aggregation, those based on Lorenz curves seem to be the favoured, such as the Hoover and Gini. The Gini returns closer values over a wider range of means, kπ‘˜kitalic_k and prevalence compared to the Hoover, making differences less discernible. The Hoover has a biological interpretation and may be easier to calculate. When mean abundances are below one, the Hoover index has restricted values whereas the Gini has no such restriction, suggesting that Gini may be preferred in such a situation. Nevertheless, both indices provide a figure that seems to measure the same phenomenon, a phenomenon that is still undefined.

The contour graphs provide an easily interpreted demonstration of the effects of the various parameters on the Hoover and Gini indices. These could be deduced by an analysis of the formulae used to calculate the indices but this is not straightforward; indices do not correlate with a particular parameter. When applying an index to compare aggregation between samples or species, it is useful to know which parameter is having the greatest effect on the index. The contour graphs provide the answer.

In producing the graphs we calculated indices directly from the parameters of the negative binomial distribution rather than using simulated data as done by Morrill et al (2023). This obviated the need to consider the uncertainty of estimates and the effects of different sample sizes. Our results demonstrated the deterministic functional relationships between the aggregation indices, and the parameters, mean abundance and prevalence. The relationships were not linear indicating that correlation and principal components analysis may not be the best methods to analyse the relationships (Morrill et al, 2023).

Listing the advantages and disadvantages of Hoover and Gini indices, Morrill et al (2023, Table 2) describe them as having the disadvantages of being β€œstrongly negatively correlated with prevalence” and β€œweakly negatively correlated with mean abundance.” In contrast, the kπ‘˜kitalic_k parameter of the negative binomial distribution and patchiness are described as having the advantages of being β€œnot necessarily correlated with mean abundance” and β€œonly weakly correlated with prevalence.” These comments ignore the fact that the negative binomial distribution, and hence any index computed on that distribution, is completely specified by the mean and prevalence. In other words, the dependence of any index on mean and prevalence is perfectly deterministic. In fact, the dependence on any pair of quantities that can be used to parameterise the negative binomial distribution, like mπ‘šmitalic_m and kπ‘˜kitalic_k is perfectly deterministic.

Morrill et al (2023) argue that the Gini index is to be preferred over the Hoover index on the basis that Hoover index equals one minus prevalence when the mean is less than or equal to one whereas the Gini index has no such restriction. To decide between the Hoover and Gini indices, if one must choose, then the relationship between these indices and mπ‘šmitalic_m, kπ‘˜kitalic_k and prevalence need to be considered more closely. Our contour plots (Fig. 2 & 3) have shown other differences in the behaviour of the Gini and Hoover indices. Compared to the Gini index, the Hoover index has a greater range over the region of values for mπ‘šmitalic_m and kπ‘˜kitalic_k (or prevalence) observed in wild populations and has more regular dependence on prevalence. Given these properties and the Hoover index’s clear biological interpretation, we argue that the Hoover index should be preferred over the Gini index, at least when mπ‘šmitalic_m is greater than one.

Our analysis has used contour plots to examine how the Gini and Hoover indices are affected by changes in mπ‘šmitalic_m, kπ‘˜kitalic_k, and prevalence. This approach could, in principle, be applied to construct contour plots from any three indices, provided two of these can be used to parameterise the negative binomial distribution. For example, one could construct a contour plot of the Gini index as a function of VMR and mean crowding as both mπ‘šmitalic_m and kπ‘˜kitalic_k can be expressed in terms of VMR and mean crowding:

m=mean crowdingβˆ’VMR+1π‘šmean crowdingVMR1m=\text{mean crowding}-\text{VMR}+1italic_m = mean crowding - VMR + 1

and

k=mean crowdingVMRβˆ’1βˆ’1.π‘˜mean crowdingVMR11k=\frac{\text{mean crowding}}{\text{VMR}-1}-1.italic_k = divide start_ARG mean crowding end_ARG start_ARG VMR - 1 end_ARG - 1 .

The contour plot could then be constructed using the expression for the Gini index in terms of mπ‘šmitalic_m and kπ‘˜kitalic_k given in Section 2. Further application of contour plots may unravel other complex relationships in ecological parasitology.

Appendix A Hoover index of the negative binomial distribution

In parasitology the negative binomial distribution is usually parameterised in terms of the the mean mπ‘šmitalic_m and kπ‘˜kitalic_k. The probability mass function is then

f⁒(x;k,m)=(k+xβˆ’1kβˆ’1)⁒(kk+m)k⁒(mk+m)x,xβˆˆβ„•0,formulae-sequence𝑓π‘₯π‘˜π‘šbinomialπ‘˜π‘₯1π‘˜1superscriptπ‘˜π‘˜π‘šπ‘˜superscriptπ‘šπ‘˜π‘šπ‘₯π‘₯subscriptβ„•0f(x;k,m)={k+x-1\choose k-1}\left(\frac{k}{k+m}\right)^{k}\left(\frac{m}{k+m}% \right)^{x},\quad x\in\mathbb{N}_{0},italic_f ( italic_x ; italic_k , italic_m ) = ( binomial start_ARG italic_k + italic_x - 1 end_ARG start_ARG italic_k - 1 end_ARG ) ( divide start_ARG italic_k end_ARG start_ARG italic_k + italic_m end_ARG ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ( divide start_ARG italic_m end_ARG start_ARG italic_k + italic_m end_ARG ) start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT , italic_x ∈ blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ,

and we write 𝖭𝖑⁒(k,m)π–­π–‘π‘˜π‘š\mathsf{NB}(k,m)sansserif_NB ( italic_k , italic_m ). Let F⁒(β‹…;k,m)πΉβ‹…π‘˜π‘šF(\cdot;k,m)italic_F ( β‹… ; italic_k , italic_m ) denote the cumulative distribution function of the 𝖭𝖑⁒(k,m)π–­π–‘π‘˜π‘š\mathsf{NB}(k,m)sansserif_NB ( italic_k , italic_m ) distribution. The first moment distribution of the 𝖭𝖑⁒(k,m)π–­π–‘π‘˜π‘š\mathsf{NB}(k,m)sansserif_NB ( italic_k , italic_m ) distribution, F(1)⁒(β‹…;k,m)superscript𝐹1β‹…π‘˜π‘šF^{(1)}(\cdot;k,m)italic_F start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ( β‹… ; italic_k , italic_m ), is

F(1)⁒(x;k,m)=βˆ‘y≀xy⁒f⁒(y;k,m)m.superscript𝐹1π‘₯π‘˜π‘šsubscript𝑦π‘₯π‘¦π‘“π‘¦π‘˜π‘šπ‘šF^{(1)}(x;k,m)=\frac{\sum_{y\leq x}y\,f(y;k,m)}{m}.italic_F start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ( italic_x ; italic_k , italic_m ) = divide start_ARG βˆ‘ start_POSTSUBSCRIPT italic_y ≀ italic_x end_POSTSUBSCRIPT italic_y italic_f ( italic_y ; italic_k , italic_m ) end_ARG start_ARG italic_m end_ARG .

For any non-negative integer xπ‘₯xitalic_x

x⁒f⁒(x)mπ‘₯𝑓π‘₯π‘š\displaystyle\frac{x\,f(x)}{m}divide start_ARG italic_x italic_f ( italic_x ) end_ARG start_ARG italic_m end_ARG =xm⁒(k+xβˆ’1kβˆ’1)⁒(kk+m)k⁒(mk+m)xabsentπ‘₯π‘šbinomialπ‘˜π‘₯1π‘˜1superscriptπ‘˜π‘˜π‘šπ‘˜superscriptπ‘šπ‘˜π‘šπ‘₯\displaystyle=\frac{x}{m}{k+x-1\choose k-1}\left(\frac{k}{k+m}\right)^{k}\left% (\frac{m}{k+m}\right)^{x}= divide start_ARG italic_x end_ARG start_ARG italic_m end_ARG ( binomial start_ARG italic_k + italic_x - 1 end_ARG start_ARG italic_k - 1 end_ARG ) ( divide start_ARG italic_k end_ARG start_ARG italic_k + italic_m end_ARG ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ( divide start_ARG italic_m end_ARG start_ARG italic_k + italic_m end_ARG ) start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT
=xm⁒(k+xβˆ’1)!(kβˆ’1)!⁒x!⁒(kk+m)k⁒(mk+m)xabsentπ‘₯π‘šπ‘˜π‘₯1π‘˜1π‘₯superscriptπ‘˜π‘˜π‘šπ‘˜superscriptπ‘šπ‘˜π‘šπ‘₯\displaystyle=\frac{x}{m}\frac{(k+x-1)!}{(k-1)!x!}\left(\frac{k}{k+m}\right)^{% k}\left(\frac{m}{k+m}\right)^{x}= divide start_ARG italic_x end_ARG start_ARG italic_m end_ARG divide start_ARG ( italic_k + italic_x - 1 ) ! end_ARG start_ARG ( italic_k - 1 ) ! italic_x ! end_ARG ( divide start_ARG italic_k end_ARG start_ARG italic_k + italic_m end_ARG ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ( divide start_ARG italic_m end_ARG start_ARG italic_k + italic_m end_ARG ) start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT
=(k+xβˆ’1)!k!⁒(xβˆ’1)!⁒(kk+m)k+1⁒(mk+m)xβˆ’1absentπ‘˜π‘₯1π‘˜π‘₯1superscriptπ‘˜π‘˜π‘šπ‘˜1superscriptπ‘šπ‘˜π‘šπ‘₯1\displaystyle=\frac{(k+x-1)!}{k!(x-1)!}\left(\frac{k}{k+m}\right)^{k+1}\left(% \frac{m}{k+m}\right)^{x-1}= divide start_ARG ( italic_k + italic_x - 1 ) ! end_ARG start_ARG italic_k ! ( italic_x - 1 ) ! end_ARG ( divide start_ARG italic_k end_ARG start_ARG italic_k + italic_m end_ARG ) start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT ( divide start_ARG italic_m end_ARG start_ARG italic_k + italic_m end_ARG ) start_POSTSUPERSCRIPT italic_x - 1 end_POSTSUPERSCRIPT
=(k+xβˆ’1)!k!⁒(xβˆ’1)!⁒(k⁒(1+1/k)(k+m)⁒(1+1/k))k+1⁒(m⁒(1+1/k)(k+m)⁒(1+1/k))xβˆ’1absentπ‘˜π‘₯1π‘˜π‘₯1superscriptπ‘˜11π‘˜π‘˜π‘š11π‘˜π‘˜1superscriptπ‘š11π‘˜π‘˜π‘š11π‘˜π‘₯1\displaystyle=\frac{(k+x-1)!}{k!(x-1)!}\left(\frac{k(1+1/k)}{(k+m)(1+1/k)}% \right)^{k+1}\left(\frac{m(1+1/k)}{(k+m)(1+1/k)}\right)^{x-1}= divide start_ARG ( italic_k + italic_x - 1 ) ! end_ARG start_ARG italic_k ! ( italic_x - 1 ) ! end_ARG ( divide start_ARG italic_k ( 1 + 1 / italic_k ) end_ARG start_ARG ( italic_k + italic_m ) ( 1 + 1 / italic_k ) end_ARG ) start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT ( divide start_ARG italic_m ( 1 + 1 / italic_k ) end_ARG start_ARG ( italic_k + italic_m ) ( 1 + 1 / italic_k ) end_ARG ) start_POSTSUPERSCRIPT italic_x - 1 end_POSTSUPERSCRIPT
=((k+1)+(xβˆ’1)+1(k+1)βˆ’1)⁒(k+1k+1+(m+m/k))k+1⁒(m+m/kk+1+(m+m/k))xβˆ’1,absentbinomialπ‘˜1π‘₯11π‘˜11superscriptπ‘˜1π‘˜1π‘šπ‘šπ‘˜π‘˜1superscriptπ‘šπ‘šπ‘˜π‘˜1π‘šπ‘šπ‘˜π‘₯1\displaystyle={(k+1)+(x-1)+1\choose(k+1)-1}\left(\frac{k+1}{k+1+(m+m/k)}\right% )^{k+1}\left(\frac{m+m/k}{k+1+(m+m/k)}\right)^{x-1},= ( binomial start_ARG ( italic_k + 1 ) + ( italic_x - 1 ) + 1 end_ARG start_ARG ( italic_k + 1 ) - 1 end_ARG ) ( divide start_ARG italic_k + 1 end_ARG start_ARG italic_k + 1 + ( italic_m + italic_m / italic_k ) end_ARG ) start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT ( divide start_ARG italic_m + italic_m / italic_k end_ARG start_ARG italic_k + 1 + ( italic_m + italic_m / italic_k ) end_ARG ) start_POSTSUPERSCRIPT italic_x - 1 end_POSTSUPERSCRIPT ,

which is the probability mass function of the 𝖭𝖑⁒(k+1,m+m/k)π–­π–‘π‘˜1π‘šπ‘šπ‘˜\mathsf{NB}(k+1,m+m/k)sansserif_NB ( italic_k + 1 , italic_m + italic_m / italic_k ) distribution evaluated at xβˆ’1π‘₯1x-1italic_x - 1. Hence,

F(1)⁒(x;k,m)=F⁒(xβˆ’1;k+1,m+m/k).superscript𝐹1π‘₯π‘˜π‘šπΉπ‘₯1π‘˜1π‘šπ‘šπ‘˜F^{(1)}(x;k,m)=F(x-1;k+1,m+m/k).italic_F start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ( italic_x ; italic_k , italic_m ) = italic_F ( italic_x - 1 ; italic_k + 1 , italic_m + italic_m / italic_k ) .

Arnold and Sarabia (2018, Lemma 5.3.3) states that the Hoover index can be expressed as

H=F⁒(m;k,m)βˆ’F(1)⁒(m;k,m).π»πΉπ‘šπ‘˜π‘šsuperscript𝐹1π‘šπ‘˜π‘šH=F(m;k,m)-F^{(1)}(m;k,m).italic_H = italic_F ( italic_m ; italic_k , italic_m ) - italic_F start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ( italic_m ; italic_k , italic_m ) .

Hence,

H=F⁒(m;k,m)βˆ’F⁒(mβˆ’1;k+1,m+m/k).π»πΉπ‘šπ‘˜π‘šπΉπ‘š1π‘˜1π‘šπ‘šπ‘˜H=F(m;k,m)-F(m-1;k+1,m+m/k).italic_H = italic_F ( italic_m ; italic_k , italic_m ) - italic_F ( italic_m - 1 ; italic_k + 1 , italic_m + italic_m / italic_k ) .

Appendix B Proof of Theorem 1

We first recall the definition of convex order, which is closely related to the Lorenz order (Shaked and Shanthikumar, 2007, subsection 3.A.1).

Definition: For random variables X𝑋Xitalic_X and Yπ‘ŒYitalic_Y such that 𝔼⁒ϕ⁒(X)≀𝔼⁒ϕ⁒(Y)𝔼italic-ϕ𝑋𝔼italic-Ο•π‘Œ\mathbb{E}\,\phi(X)\leq\mathbb{E}\,\phi(Y)blackboard_E italic_Ο• ( italic_X ) ≀ blackboard_E italic_Ο• ( italic_Y ) for all convex functions Ο•:ℝ→ℝ:italic-ϕ→ℝℝ\phi:\mathbb{R}\to\mathbb{R}italic_Ο• : blackboard_R β†’ blackboard_R for which the expectations exist. Then we say that X𝑋Xitalic_X is smaller than Yπ‘ŒYitalic_Y in the convex order, denoted X≀cxYsubscriptcxπ‘‹π‘ŒX\leq_{\rm{cx}}Yitalic_X ≀ start_POSTSUBSCRIPT roman_cx end_POSTSUBSCRIPT italic_Y

The convex order relates to the Lorenz order in the sense that

X𝔼⁒X≀cxY𝔼⁒Ysubscriptcxπ‘‹π”Όπ‘‹π‘Œπ”Όπ‘Œ\frac{X}{\mathbb{E}X}\leq_{\rm{cx}}\frac{Y}{\mathbb{E}Y}divide start_ARG italic_X end_ARG start_ARG blackboard_E italic_X end_ARG ≀ start_POSTSUBSCRIPT roman_cx end_POSTSUBSCRIPT divide start_ARG italic_Y end_ARG start_ARG blackboard_E italic_Y end_ARG

if and only if X≀LYsubscriptπΏπ‘‹π‘ŒX\leq_{L}Yitalic_X ≀ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT italic_Y, provided the expectations exist (Shaked and Shanthikumar, 2007, equation 3.A.33) or (Arnold and Sarabia, 2018, Corollary 3.2.1).

Proof of Theorem 1.

For part (a), let X2βˆΌπ–­π–‘β’(k,m2)similar-tosubscript𝑋2π–­π–‘π‘˜subscriptπ‘š2X_{2}\sim\mathsf{NB}(k,m_{2})italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∼ sansserif_NB ( italic_k , italic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). Conditional on X2subscript𝑋2X_{2}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, let X1βˆΌπ–‘π—‚π—‡π—ˆπ—†π—‚π–Ίπ—…β’(X2,m1/m2)similar-tosubscript𝑋1π–‘π—‚π—‡π—ˆπ—†π—‚π–Ίπ—…subscript𝑋2subscriptπ‘š1subscriptπ‘š2X_{1}\sim\mathsf{Binomial}(X_{2},m_{1}/m_{2})italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∼ sansserif_Binomial ( italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / italic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). Then X1βˆΌπ–­π–‘β’(k,m1)similar-tosubscript𝑋1π–­π–‘π‘˜subscriptπ‘š1X_{1}\sim\mathsf{NB}(k,m_{1})italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∼ sansserif_NB ( italic_k , italic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ). As 𝔼⁒(X1|X2)=(m1/m2)⁒X2𝔼conditionalsubscript𝑋1subscript𝑋2subscriptπ‘š1subscriptπ‘š2subscript𝑋2\mathbb{E}(X_{1}|X_{2})=(m_{1}/m_{2})X_{2}blackboard_E ( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = ( italic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / italic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and 𝔼⁒X1=(m1/m2)⁒𝔼⁒X2𝔼subscript𝑋1subscriptπ‘š1subscriptπ‘š2𝔼subscript𝑋2\mathbb{E}X_{1}=(m_{1}/m_{2})\,\mathbb{E}X_{2}blackboard_E italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ( italic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / italic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) blackboard_E italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, (Arnold and Sarabia, 2018, Theorem 3.4) implies (m1/m2)⁒X2≀LX1subscript𝐿subscriptπ‘š1subscriptπ‘š2subscript𝑋2subscript𝑋1(m_{1}/m_{2})\,X_{2}\leq_{L}X_{1}( italic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / italic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≀ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Since the Lorenz order is invariant under a change of scale, 𝖭𝖑⁒(k,m2)≀L𝖭𝖑⁒(k,m1)subscriptπΏπ–­π–‘π‘˜subscriptπ‘š2π–­π–‘π‘˜subscriptπ‘š1\mathsf{NB}(k,m_{2})\leq_{L}\mathsf{NB}(k,m_{1})sansserif_NB ( italic_k , italic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≀ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT sansserif_NB ( italic_k , italic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ).

For part (b), standard conditioning arguments show that if (N,tβ‰₯0)𝑁𝑑0(N,\ t\geq 0)( italic_N , italic_t β‰₯ 0 ) is a standard Poisson process and Ξ›βˆΌπ–¦π–Ίπ—†π—†π–Ίβ’(Ξ±,Ξ²)similar-toΛ𝖦𝖺𝗆𝗆𝖺𝛼𝛽\Lambda\sim\mathsf{Gamma}(\alpha,\beta)roman_Ξ› ∼ sansserif_Gamma ( italic_Ξ± , italic_Ξ² ) (Gamma distribution with shape parameter α𝛼\alphaitalic_Ξ± and rate parameter β𝛽\betaitalic_Ξ²), then NΞ›βˆΌπ–­π–‘β’(Ξ±,Ξ±/Ξ²)similar-tosubscript𝑁Λ𝖭𝖑𝛼𝛼𝛽N_{\Lambda}\sim\mathsf{NB}(\alpha,\alpha/\beta)italic_N start_POSTSUBSCRIPT roman_Ξ› end_POSTSUBSCRIPT ∼ sansserif_NB ( italic_Ξ± , italic_Ξ± / italic_Ξ² ). Let TiβˆΌπ–¦π–Ίπ—†π—†π–Ίβ’(ki,ki/m)similar-tosubscript𝑇𝑖𝖦𝖺𝗆𝗆𝖺subscriptπ‘˜π‘–subscriptπ‘˜π‘–π‘šT_{i}\sim\mathsf{Gamma}(k_{i},k_{i}/m)italic_T start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∼ sansserif_Gamma ( italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / italic_m ).

It is known that for every convex function Ο•italic-Ο•\phiitalic_Ο•, 𝔼⁒ϕ⁒(Nt)𝔼italic-Ο•subscript𝑁𝑑\mathbb{E}\phi(N_{t})blackboard_E italic_Ο• ( italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) is a convex it t𝑑titalic_t (Schweder, 1982, Proposition 2). If we can show that T1≀c⁒xT2subscript𝑐π‘₯subscript𝑇1subscript𝑇2T_{1}\leq_{cx}T_{2}italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≀ start_POSTSUBSCRIPT italic_c italic_x end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, then the result will follow from Shaked and Shanthikumar (2007, Theorem 3.A.21).

By construction 𝔼⁒T1=𝔼⁒T2𝔼subscript𝑇1𝔼subscript𝑇2\mathbb{E}T_{1}=\mathbb{E}T_{2}blackboard_E italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = blackboard_E italic_T start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Let gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT be the probability density function of 𝖦𝖺𝗆𝗆𝖺⁒(ki,Ξ²i)𝖦𝖺𝗆𝗆𝖺subscriptπ‘˜π‘–subscript𝛽𝑖\mathsf{Gamma}(k_{i},\beta_{i})sansserif_Gamma ( italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Ξ² start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). Then T1≀c⁒xT2subscript𝑐π‘₯subscript𝑇1subscript𝑇2T_{1}\leq_{cx}T_{2}italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≀ start_POSTSUBSCRIPT italic_c italic_x end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT if g2βˆ’g1subscript𝑔2subscript𝑔1g_{2}-g_{1}italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT exhibits exactly two sign changes in the sequence +, -, + (Shaked and Shanthikumar, 2007, Theorem 3.A.44). As the log\logroman_log function is increasing, log⁑g2βˆ’log⁑g1subscript𝑔2subscript𝑔1\log g_{2}-\log g_{1}roman_log italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - roman_log italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT has the same sequence of sign changes as g2βˆ’g1subscript𝑔2subscript𝑔1g_{2}-g_{1}italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Then

log⁑g2⁒(x)βˆ’log⁑g1⁒(x)subscript𝑔2π‘₯subscript𝑔1π‘₯\displaystyle\log g_{2}(x)-\log g_{1}(x)roman_log italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x ) - roman_log italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x )
=(k2βˆ’1)⁒log⁑xβˆ’k2m⁒xβˆ’((k1βˆ’1)⁒log⁑xβˆ’k1m⁒x)+Cabsentsubscriptπ‘˜21π‘₯subscriptπ‘˜2π‘šπ‘₯subscriptπ‘˜11π‘₯subscriptπ‘˜1π‘šπ‘₯𝐢\displaystyle=(k_{2}-1)\log x-\frac{k_{2}}{m}x-\left((k_{1}-1)\log x-\frac{k_{% 1}}{m}x\right)+C= ( italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 ) roman_log italic_x - divide start_ARG italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_m end_ARG italic_x - ( ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - 1 ) roman_log italic_x - divide start_ARG italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_m end_ARG italic_x ) + italic_C
=(k2βˆ’k1)⁒log⁑xβˆ’(k1βˆ’k2)m⁒x+C,absentsubscriptπ‘˜2subscriptπ‘˜1π‘₯subscriptπ‘˜1subscriptπ‘˜2π‘šπ‘₯𝐢\displaystyle=(k_{2}-k_{1})\log x-\frac{(k_{1}-k_{2})}{m}x+C,= ( italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) roman_log italic_x - divide start_ARG ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_m end_ARG italic_x + italic_C ,

where C𝐢Citalic_C is depends on k1,k2subscriptπ‘˜1subscriptπ‘˜2k_{1},k_{2}italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and mπ‘šmitalic_m but not xπ‘₯xitalic_x. There must be at least one sign change since both g1subscript𝑔1g_{1}italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and g2subscript𝑔2g_{2}italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT integrate to one. For k2>k1subscriptπ‘˜2subscriptπ‘˜1k_{2}>k_{1}italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT this function is concave so there must be two sign changes. As this function is positive for xβ†’0β†’π‘₯0x\to 0italic_x β†’ 0 and xβ†’βˆžβ†’π‘₯x\to\inftyitalic_x β†’ ∞ when k2>k1subscriptπ‘˜2subscriptπ‘˜1k_{2}>k_{1}italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT we have shown T1≀c⁒xT2subscript𝑐π‘₯subscript𝑇1subscript𝑇2T_{1}\leq_{cx}T_{2}italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≀ start_POSTSUBSCRIPT italic_c italic_x end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. This completes the proof. ∎

References

  • Adell and De La Cal (1994) J.A. Adell and J. De La Cal. Approximating gamma distributions by normalized negative binomial distributions. Journal of Applied Probability, 31:391-400, 1994.
  • Arnold and Sarabia (2018) B.C. Arnold and J.M. Sarabia. Majorization and the Lorenz order with applications in applied mathematics and economics. Springer, New York, 2018.
  • Bezerra and Bocchiglieri (2023) R.H.S. Bezerra and A. Bocchiglieri. Ectoparasitic flies of bats (Mammalia: Chiroptera) in urban green areas of northeastern Brazil. Parasitol. Res., 122:117–126, 2023.
  • Crofton (1971) H.D. Crofton. A quantitative approach to parasitism. Parasitology, 62:179–193, 1971.
  • Gastwirth (1971) J.L. Gastwirth. A general definition of the Lorenz curve. Econometrica, 39:1037–1039, 1971.
  • Gini (1914) C. Gini Sulla misura della concentrazione e della variabilita dei caratteri. Atti del R. Instituto Veneto di Scienze, Lettere ed Arti, 73:1203–1248, 1914.
  • Hankin (2015) R.K.S. Hankin. Numerical Evaluation of the Gauss Hypergeometric Function with the hypergeo Package. The R Journal, 7:81-88, 2015.
  • Kura et al (2022) K. Kura, J.E. Truscott, B.S. Collyera, A. Phillips, A. Garbae and R.M Anderson. The observed relationship between the degree of parasite aggregation and the prevalence of infection within human host populations for soil-transmitted helminth and schistosome infections. Trans. R. Soc. Trop. Med. Hyg., 116:1226–1229, 2022.
  • Lester and Blomberg (2021) R.J.G. Lester and S.P. Blomberg. Three methods to measure parasite aggregation using examples from Australian fish parasites. Methods Ecol. Evol., 12:1999–2007, 2021.
  • Lloyd (1967) M. Lloyd, Mean crowding. J. Anim. Ecol., 36:1–30, (1967).
  • Lorenz (1905) M.O. Lorenz, Methods of measuring the concentration of wealth. Publication of the American Statistical Association, 9:209–219, 1905.
  • Matos et al (2023) I. Matos, D. Silva, J. Oliveira, C. GonΓ§alves, R. Alves, N. Pereira, F. Catarino, O.M.C.C. Ameixa, J.A. Sousa, L.F. Rangel, M.J. Santos and C. Ayra-Pardo. Body size-dependent effects on the distribution patterns of phoretic mite species assemblages on Rhynchophorus ferrugineus (Olivier, 1790). Ecology and Evolution, 13:e10338, 2023.
  • McVinish and Lester (2020) R. McVinish and R.J.G Lester. Measuring aggregation in parasite populations. J. R. Soc. Interface, 17:20190886, 2020.
  • Morato-Moreno (2017) Morato-Moreno, Manuel. Origins of the two-dimensional relief representation on some spanish american maps in the sixteenth century. BoletΓ­n de la AsociaciΓ³n de GeΓ³grafos EspaΓ±oles, 73:493-499, 2017.
  • Morrill et al (2023) A. Morrill, R. Poulin and M.R. Forbes. Interrelationships and properties of parasite aggregation measures: a user’s guide. Int. J. Parasitol., 53:763-776, 2023.
  • Pennycuick (1971) P. Pennycuick. Frequency distributions of parasites in a population of three-spined sticklebacks, Gasterosteus aculeatus L., with particular reference to the negative binomial distribution. Parasitology, 63:389-406, 1971.
  • Pielou (1977) E.C. Pielou. The measurement of aggregation. In: Mathematical Ecology. Wiley Interscience, New York, 1977.
  • Poulin (1993) R. Poulin. The disparity between observed and uniform distributions: A new look at parasite aggregation. Int. J. Parasitol., 23:937–944, 1993.
  • Poulin (2011) R. Poulin. Evolutionary Ecology of Parasites: (Second Edition). Princeton University Press, 2011.
  • R Core Team (2023) R Core Team R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, 2023.
  • Ramasubban (1958) T.A. Ramasubban. The mean difference and the mean deviation of some discontinuous distributions. Biometrika, 45:549-556, 1958.
  • RodrΓ­guez-HernΓ‘ndez et al (2021) K. RodrΓ­guez-HernΓ‘ndez, P. Álvarez-MendizΓ‘bal, P. Chapa-Vargas, F. Escobar, F. GonzΓ‘lez-GarcΓ­a and D. Santiago-Alarcon. Haemosporidian prevalence, parasitaemia and aggregation in relation to avian assemblage life history traits at different elevations. Int. J. Parasitol., 51:365-378, 2021.
  • Schweder (1982) T.Β Schweder. On the dispersion of mixtures. Scandinavian Journal of Statistics, 9:165–169, 1982.
  • Scott (1987) M.E. Scott. Temporal changes in aggregation: a laboratory study. Parasitology, 94:583-595, 1987.
  • Shaked and Shanthikumar (2007) M.Β Shaked and J.G. Shanthikumar. Stochastic orders. Springer, New York, 2007.
  • Shaw and Dobson (1995) D.J Shaw and A.P. Dobson. Patterns of macroparasite abundance and aggregation in wildlife populations: A quantitative review. Parasitology, 111:S111–S133, 1995.
  • Shaw et al (1998) D.J. Shaw, B.T. Grenfell and A.P. Dobson. Patterns of macroparasite aggregation in wildlife host populations. Parasitology, 117:597-610, 1998.
  • Taguchi (1968) T. Taguchi. Concentration-curve methods and structures of skew populations. Annals of Institute of Statistical Mathematics, 20:107–141, 1968.
  • Tinsley et al (2020) R.C. Tinsley, H.R. Vineer, R. Grainger-Wood and E.R. Morgan, ER. Heterogeneity in helminth infections: factors influencing aggregation in a simple host-parasite system. Parasitology, 147:65-77, 2020.
  • Wade et al (2018) M.J. Wade, C.L. Fitzpatrick and C.M. Lively. 50-year anniversary of Lloyd’s β€œmean crowding”: Ideas on patchy distributions. J. Anim. Ecol., 87:1221–1226, 2018.
  • Wickham (2016) H. Wickham. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag, New York, 2016.