Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
\addbibresource

references.bib

Surprisingly Popular Voting for Concentric Rank-Order Models

Hadi Hosseini
Penn State University, USA
hadi@psu.edu
   Debmalya Mandal
University of Warwick, UK
Debmalya.Mandal@warwick.ac.uk
   Amrit Puhan111Authors are ordered alphabetically.
Penn State University, USA
avp6267@psu.edu
Abstract

An important problem on social information sites is the recovery of ground truth from individual reports when the experts are in the minority. The wisdom of the crowd, i.e. the collective opinion of a group of individuals fails in such a scenario. However, the surprisingly popular (SP) algorithm [prelec2017solution] can recover the ground truth even when the experts are in the minority, by asking the individuals to report additional prediction reports–their beliefs about the reports of others. Several recent works have extended the surprisingly popular algorithm to an equivalent voting rule (SP-voting) to recover the ground truth ranking over a set of m𝑚mitalic_m alternatives. However, we are yet to fully understand when SP-voting can recover the ground truth ranking, and if so, how many samples (votes and predictions) it needs. We answer this question by proposing two rank-order models and analyzing the sample complexity of SP-voting under these models. In particular, we propose concentric mixtures of Mallows and Plackett-Luce models with G(2)annotated𝐺absent2G(\geq 2)italic_G ( ≥ 2 ) groups. Our models generalize previously proposed concentric mixtures of Mallows models with 2222 groups, and we highlight the importance of G>2𝐺2G>2italic_G > 2 groups by identifying three distinct groups (expert, intermediate, and non-expert) from existing datasets. Next, we provide conditions on the parameters of the underlying models so that SP-voting can recover ground-truth rankings with high probability, and also derive sample complexities under the same. We complement the theoretical results by evaluating SP-voting on simulated and real datasets.

1 Introduction

The recovery of ground truth from individual reports is one of the most vital aspects of social information sharing and online discourse. The wisdom of the crowds phenomenon refers to the observation that the collective value of a group of noisy individual opinions can be used to recover the ground truth [galton1949vox]. Such a collective value cancels out the biases of individual opinions when the number of participants is large and is often deployed to recover the ground truth on online polling and Q&A platforms (e.g. Reddit).

However, when the experts are in the minority, approaches that rely on the collective opinion of a group of individuals fail to recover the ground truth. The Surprisingly Popular (SP) algorithm [prelec2017solution] is a promising technique capable of recovering the ground truth even when experts are in the minority. In addition to asking individuals’ opinion (aka vote), it asks them to predict how they believe the majority’s answer is (aka prediction). The SP algorithm then picks the outcome which is surprisingly popular i.e. whose actual frequency in the votes is greater than its average predicted frequency. It provably recovers the ground truth as the number of individuals grows, even with a minority of experts.

This approach has been extended to voting rules, called SP-voting, in order to recover the ground truth rankings over a set of m𝑚mitalic_m alternatives. The naive application of SP-algorithm to voting requires that individuals submit their prediction as a distribution over m!𝑚m!italic_m ! possible permutation of alternatives, which implies that the amount of information elicited from each voter is exponential in m𝑚mitalic_m. Surprisingly, SP-voting has been shown to effectively recover the ground truth in practice even when predictions are limited to a set of size m𝑚mitalic_m, providing a substantial improvement over classical voting rules by focusing on eliciting the most likely top alternative or ranking [hosseini2021surprisingly]. Furthermore, SP-voting has been extended to partial ranks where the voters provide reports (votes and predictions) over subsets of size k𝑘kitalic_k with kmmuch-less-than𝑘𝑚k\ll mitalic_k ≪ italic_m [hosseini2024surprising].

While SP-voting has been shown to be effective in full or partial rankings, we are yet to fully understand when SP-voting can recover the ground truth ranking, and if so, how many samples (votes and predictions) it needs. To the best of our knowledge, this question is unexplored even for the basic SP algorithm. The main difficulty of analyzing such algorithms is that they are non-parametric i.e. they don’t make any assumptions about the underlying distribution of votes and predictions, and it’s not immediately clear what type of parametric models would be a good fit for real-world datasets and are also amenable to analysis under the surprisingly popular framework. For the setting of partial rankings, \citeauthor*hosseini2024surprising [hosseini2024surprising] performed a preliminary analysis of SP-voting under a mixture of Mallows model with two groups. However, we observe that the real datasets need more than two groups and more general rank-order models. Thus, we ask the following questions:

What general rank-order models can explain ranking datasets (both votes and predictions) with a ground truth ranking? Furthermore, can we analyze SP-voting under such rank-order models, and determine its sample complexity, and conditions for identifying the ground truth ranking?

1.1 Our Contributions

We propose various rank-order models with a ground truth ranking, and analyse the SP-voting rule under these models. In particular, our contributions are the following.

  • We propose two rank-order models, the Concentric Mixture of Mallows and the Concentric Mixture of Plackett-Luce, and generalize them to accommodate populations of G2𝐺2G\geq 2italic_G ≥ 2 groups.

  • We derive the conditions required for the identification of ground truth ranking under the SP-voting and the proposed concentric rank-order models. The derived conditions highlight a tension between the fraction of different groups and the "expertise" (i.e. noise levels) of different groups.

  • To evaluate practical viability, we fit these models to real-world datasets for populations with G=2𝐺2G=2italic_G = 2 and G=3𝐺3G=3italic_G = 3 groups. When G=3𝐺3G=3italic_G = 3, besides the expert and non-expert groups, we identify an intermediate group of voters of large fraction that explains the observed datasets better than prior approaches with two groups.

  • Furthermore, we generate synthetic data based on these models and provide empirical results on the sample complexity of SP-Voting, comparing it against the Copeland rule. Finally, experiments on real-world datasets show that SP-voting performs significantly better than the Copeland voting rule even when the dataset size is small.

1.2 Related Work

The challenge of ground truth recovery using the wisdom of the crowd has been extensively explored in social choice theory \parencitegalton1949vox, de2014essai, surowiecki2005wisdom. Several vote aggregation rules \parencitede2014essai, borda1781m, copeland1951reasonable, young1977extending have been proposed based on this concept to aggregate voters’ preferences and recover the underlying ground truth. However, this approach falters when the majority of participants are misinformed \parencitesimmons2011intuitive, biased \parencitechen2004eliminating, or when expert opinions are underrepresented within the population \parenciteprelec2017solution. To address this limitation, \citeauthor*prelec2017solution [prelec2017solution] introduced the Surprisingly Popular (SP) algorithm, which requires voters to provide two types of information: their individual vote and their prediction of the consensus vote. This framework has since been used to incentivize truthful behaviour in agents [schoenebeck2021wisdom, schoenebeck2023two], mitigate biases in academic peer review  [lu2024calibrating], elicit expert knowledge [kong2018eliciting], forecast geopolitical events [debmalya2020effectiveness], and aggregate information [chen2023wisdom]. However, \citeauthor*prelec2017solution [prelec2017solution]’s SP algorithm becomes impractical when the objective is to recover true ordinal ranking, since it necessitates information across all m!𝑚m!italic_m ! possible vote configurations. The surprisingly popular algorithm was extended to recover full rankings while reducing its complexity to (m2)binomial𝑚2\binom{m}{2}( FRACOP start_ARG italic_m end_ARG start_ARG 2 end_ARG ) votes, making it more practical for smaller values of m𝑚mitalic_m \parencitehosseini2021surprisingly. Further extending this line of work, SP-Voting has been generalized to handle any number of alternatives, while also introducing mechanisms for partial preference elicitation to improve the efficiency of ground truth recovery \parencitehosseini2024surprising. However, it is still unclear under what conditions SP-Voting is effective for a large number of alternatives when eliciting rankings. Specifically, the structure of the voting population and whether their voting behavior can be mathematically modeled need to be studied in detail.

The modeling of ranked data can be approached from two perspectives: modeling the population of voters and modeling the ranking process itself \parencitemarden1996analyzing. To date, the SP-Voting framework has been examined primarily by classifying voters into two distinct groups. Our work extends this analysis by generalizing it to account for any number of groups, denoted as G𝐺Gitalic_G. In terms of modeling the ranking process, several probabilistic models have been developed to represent voter preference generation. These include Order Statistic models, such as the Thurstonian model \parencitethurstone2017law; Pairwise Comparison models, like the Bradley-Terry model \parencitebradley1952rank; Multistage models, such as the Plackett-Luce model \parenciteluce1959possible, plackett1954reduction; and Distance-based models, like the Mallows’ model \parencitemallows1957non, among others. \citeauthor*marden1996analyzing [marden1996analyzing] provides a more comprehensive review of these models.

The SP-Voting framework was recently studied under the assumption that voters’ preferences are drawn from an underlying probability distribution known as the Concentric Mixture of Mallows model, a variant of Mallows’ model \parencitehosseini2024surprising. In this work, we extend the SP-Voting framework by investigating two different vote distribution assumptions: the distance-based Mallows’ model and the multistage Plackett-Luce model. Specifically, we build on prior work by extending the Mallows’ model to account for G𝐺Gitalic_G groups, allowing for a more general analysis of voter populations. Additionally, we propose a novel Concentric Plackett-Luce Mixture model, a variant of the multistage Plackett-Luce model, which similarly incorporates G𝐺Gitalic_G groups.

2 Model

Here we formally introduce the setting and the necessary notations. We will first introduce surprisingly popular voting considering reports over full rankings, and then cover the setting with partial rankings. Let A𝐴Aitalic_A = {a1,a2,,amsubscript𝑎1subscript𝑎2subscript𝑎𝑚a_{1},a_{2},...,a_{m}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_a start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT} be the set of m𝑚mitalic_m possible alternatives. The set (A)𝐴\mathcal{L}(A)caligraphic_L ( italic_A ) represents all possible complete rankings over the alternatives. Let σ(A)𝜎𝐴\sigma\in\mathcal{L}(A)italic_σ ∈ caligraphic_L ( italic_A ) represent a complete ranking of the m𝑚mitalic_m possible alternatives. We assume that there is a true ranking by σ(A)superscript𝜎𝐴\sigma^{\star}\in\mathcal{L}(A)italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∈ caligraphic_L ( italic_A ); which is drawn from a prior P()𝑃P(\cdot)italic_P ( ⋅ ) over (A)𝐴\mathcal{L}(A)caligraphic_L ( italic_A ). Voter i𝑖iitalic_i observes a ranking σisubscript𝜎𝑖\sigma_{i}italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT that is assumed to be a noisy version of the ground truth ranking σsuperscript𝜎\sigma^{\star}italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT. We will write Prs(σiσ)subscriptPr𝑠conditionalsubscript𝜎𝑖superscript𝜎\mathrm{Pr}_{s}(\sigma_{i}\mid\sigma^{\star})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) to denote the probability that the voter i𝑖iitalic_i observes her ranking σisubscript𝜎𝑖\sigma_{i}italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT given the ground truth ranking σsuperscript𝜎\sigma^{\star}italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT.

Given voter i𝑖iitalic_i’s ranking σisubscript𝜎𝑖\sigma_{i}italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and the prior P()𝑃P(\cdot)italic_P ( ⋅ ), voter i𝑖iitalic_i can compute the posterior distribution over the ground truth using the Bayes rule.

Prg(σσi)=Prs(σiσ)P(σ)σ(A)Prs(σi|σ)P(σ)subscriptPr𝑔conditionalsuperscript𝜎subscript𝜎𝑖subscriptPr𝑠conditionalsubscript𝜎𝑖superscript𝜎𝑃superscript𝜎subscriptsuperscript𝜎𝐴subscriptPr𝑠conditionalsubscript𝜎𝑖superscript𝜎𝑃superscript𝜎\mathrm{Pr}_{g}(\sigma^{\star}\mid\sigma_{i})=\frac{\mathrm{Pr}_{s}(\sigma_{i}% \mid\sigma^{\star})\cdot P(\sigma^{\star})}{\sum_{\sigma^{\prime}\in\mathcal{L% }(A)}{\mathrm{Pr}_{s}(\sigma_{i}|\sigma^{\prime})\cdot P(\sigma^{\prime})}}roman_Pr start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = divide start_ARG roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ⋅ italic_P ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_L ( italic_A ) end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ⋅ italic_P ( italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_ARG (1)

Using the posterior over the ground truth, voter i𝑖iitalic_i can also compute a distribution over the rankings observed by another voter.

Pro(σjσi)=σ(A)Prs(σjσ)Prg(σσi)\mathrm{Pr}_{o}(\sigma_{j}\mid\sigma_{i})=\sum_{\sigma{{}^{\prime}}\in\mathcal% {L}(A)}{\mathrm{Pr}_{s}(\sigma_{j}\mid\sigma{{}^{\prime}})\cdot\mathrm{Pr}_{g}% (\sigma^{{}^{\prime}}\mid\sigma_{i})}roman_Pr start_POSTSUBSCRIPT italic_o end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∣ italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_σ start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ∈ caligraphic_L ( italic_A ) end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∣ italic_σ start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ) ⋅ roman_Pr start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ∣ italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) (2)

The surprisingly popular algorithm asks voters to report their votes, and posterior over others’ votes. For each ranking σsuperscript𝜎\sigma^{\prime}italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, it then computes the frequency f(σ)=1ni𝟏[σ=σ]𝑓superscript𝜎1𝑛subscript𝑖1delimited-[]𝜎superscript𝜎f(\sigma^{\prime})=\frac{1}{n}\sum_{i}\mathbf{1}[\sigma=\sigma^{\prime}]italic_f ( italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_1 [ italic_σ = italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ], and posterior

h(σσ)=1|{i:σi=σ}|i:σi=σPro(σσi),conditional𝜎superscript𝜎1conditional-set𝑖subscript𝜎𝑖superscript𝜎subscript:𝑖subscript𝜎𝑖superscript𝜎subscriptPr𝑜conditional𝜎subscript𝜎𝑖h(\sigma\mid\sigma^{\prime})=\frac{1}{\left|\left\{i:\sigma_{i}=\sigma^{\prime% }\right\}\right|}\sum_{i:\sigma_{i}=\sigma^{\prime}}\mathrm{Pr}_{o}(\sigma\mid% \sigma_{i}),italic_h ( italic_σ ∣ italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = divide start_ARG 1 end_ARG start_ARG | { italic_i : italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT } | end_ARG ∑ start_POSTSUBSCRIPT italic_i : italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_o end_POSTSUBSCRIPT ( italic_σ ∣ italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ,

and finally picks the ranking with highest prediction normalized votes.222This is the direct application of SP algorithm \parenciteprelec2017solution by considering m!𝑚m!italic_m ! possible ground truths.

σ^argmaxσV¯(σ)=f(σ)σ(A)h(σσ)h(σσ)^𝜎subscriptargmax𝜎¯𝑉𝜎𝑓𝜎subscriptsuperscript𝜎𝐴conditionalsuperscript𝜎𝜎conditional𝜎superscript𝜎\widehat{\sigma}\in\textrm{argmax}_{\sigma}\overline{V}(\sigma)=f(\sigma)\cdot% \sum_{\sigma^{\prime}\in\mathcal{L}(A)}\frac{h(\sigma^{\prime}\mid\sigma)}{h(% \sigma\mid\sigma^{\prime})}over^ start_ARG italic_σ end_ARG ∈ argmax start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT over¯ start_ARG italic_V end_ARG ( italic_σ ) = italic_f ( italic_σ ) ⋅ ∑ start_POSTSUBSCRIPT italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_L ( italic_A ) end_POSTSUBSCRIPT divide start_ARG italic_h ( italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∣ italic_σ ) end_ARG start_ARG italic_h ( italic_σ ∣ italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_ARG (3)
\citeauthor

*hosseini2021surprisingly [hosseini2021surprisingly] observed that asking for full posterior over m!𝑚m!italic_m ! rankings might be prohibitive and introduced surprisingly popular voting (SP-voting) that only asks voters about ranking according to the posterior.

We will also consider the setting when voters report partial rankings over subsets of size kmmuch-less-than𝑘𝑚k\ll mitalic_k ≪ italic_m. Let us fix a subset TA𝑇𝐴T\subseteq Aitalic_T ⊆ italic_A of size k𝑘kitalic_k. Then the probability of a partial ranking πisubscript𝜋𝑖\pi_{i}italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT given the ground truth ranking σsuperscript𝜎\sigma^{\star}italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is

Prs(πiσ)=σ:σπiPrs(σσ)subscriptPr𝑠conditionalsubscript𝜋𝑖superscript𝜎subscript:𝜎contains-as-subgroup𝜎subscript𝜋𝑖subscriptPr𝑠conditional𝜎superscript𝜎\mathrm{Pr}_{s}(\pi_{i}\mid\sigma^{\star})=\sum_{\sigma:\sigma\rhd\pi_{i}}% \mathrm{Pr}_{s}(\sigma\mid\sigma^{\star})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_σ : italic_σ ⊳ italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT )

Here σπicontains-as-subgroup𝜎subscript𝜋𝑖\sigma\rhd\pi_{i}italic_σ ⊳ italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT means that the ranking σ𝜎\sigmaitalic_σ when restricted to the subset T𝑇Titalic_T is πisubscript𝜋𝑖\pi_{i}italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. We can also naturally extend definition 1 to define the posterior distribution given a partial ranking.

Prg(σπi)=Prs(πiσ)P(σ)σ(A)Prs(πi|σ)P(σ)subscriptPr𝑔conditionalsuperscript𝜎subscript𝜋𝑖subscriptPr𝑠conditionalsubscript𝜋𝑖superscript𝜎𝑃superscript𝜎subscriptsuperscript𝜎𝐴subscriptPr𝑠conditionalsubscript𝜋𝑖superscript𝜎𝑃superscript𝜎\mathrm{Pr}_{g}(\sigma^{\star}\mid\pi_{i})=\frac{\mathrm{Pr}_{s}(\pi_{i}\mid% \sigma^{\star})\cdot P(\sigma^{\star})}{\sum_{\sigma^{\prime}\in\mathcal{L}(A)% }{\mathrm{Pr}_{s}(\pi_{i}|\sigma^{\prime})\cdot P(\sigma^{\prime})}}roman_Pr start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = divide start_ARG roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ⋅ italic_P ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_L ( italic_A ) end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ⋅ italic_P ( italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_ARG (4)

Using the posterior over the ground truth, voter i𝑖iitalic_i can also compute the distribution over partial rankings observed by another voter.

Pro(πjπi)=σ(A)Prs(πjσ)Prg(σπi)subscriptPr𝑜conditionalsubscript𝜋𝑗subscript𝜋𝑖subscriptsuperscript𝜎𝐴subscriptPr𝑠conditionalsubscript𝜋𝑗superscript𝜎subscriptPr𝑔conditionalsuperscript𝜎subscript𝜋𝑖\mathrm{Pr}_{o}(\pi_{j}\mid\pi_{i})=\sum_{\sigma^{\prime}\in\mathcal{L}(A)}% \mathrm{Pr}_{s}(\pi_{j}\mid\sigma^{\prime})\cdot\mathrm{Pr}_{g}(\sigma^{\prime% }\mid\pi_{i})roman_Pr start_POSTSUBSCRIPT italic_o end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∣ italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_L ( italic_A ) end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∣ italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ⋅ roman_Pr start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∣ italic_π start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) (5)

Finally, we can compute the prediction-normalized vote (as defined in eq. 3 but over partial rankings) and pick the partial ranking π^^𝜋\widehat{\pi}over^ start_ARG italic_π end_ARG over the subset T𝑇Titalic_T with the maximum value. We are interested in extension of SP-voting to partial rankings as proposed by \citeauthor*hosseini2024surprising [hosseini2024surprising]. Namely, the partial-SP algorithm first applies SP-voting to a collection of subsets to recover ground truth partial rankings over these subsets, and then aggregates them using a voting rule [hosseini2024surprising].

In the next section, we describe in detail the exact distribution that PrssubscriptPr𝑠\mathrm{Pr}_{s}roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT takes to accurately model the voter behavior and reason about our choices.

3 Concentric Mixtures Models

Concentric Mixture Models are a class of probabilistic models used to represent how different groups within a population rank a set of alternatives, all relative to a single underlying ground truth ranking. These models capture variations in group behavior by incorporating parameters that reflect the degree and nature of each group’s deviation from this central ranking. Our main goal in this section is to analyze the performance of SP-voting under different concentric mixture models, by first identifying the conditions required to identify the ground truth, and then providing upper bounds on the sample complexity of SP-voting. We begin with the Concentric Mixture of Mallows Model in Section 3.1 , followed by the Concentric Mixture of Plackett-Luce Model in Section 3.2, which is a new model proposed in this work.

3.1 The Concentric Mixture of Mallows Model

The Concentric Mixture of Mallows Model (CMM) \parenciteCI21 uses a distance-based approach to quantify deviations from the central ranking. Specifically, group g𝑔gitalic_g’s ranking is modeled as a Mallows model with a group-specific dispersion parameter ϕgsubscriptitalic-ϕ𝑔\phi_{g}italic_ϕ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT, which controls the degree of expertise of the group. The following equation describes the ranking observed by a voter where the voting population has G𝐺Gitalic_G distinct groups:

Prs(σσ)=g=1GpgPrs(σσ,ϕg)subscriptPr𝑠conditional𝜎superscript𝜎superscriptsubscript𝑔1𝐺subscript𝑝𝑔subscriptPr𝑠conditional𝜎superscript𝜎subscriptitalic-ϕ𝑔\mathrm{Pr}_{s}(\sigma\mid\sigma^{\star})=\sum_{g=1}^{G}p_{g}\cdot\mathrm{Pr}_% {s}(\sigma\mid\sigma^{\star},\phi_{g})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_g = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) (6)

Here σsuperscript𝜎\sigma^{\star}italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is the underlying ground-truth ranking, and Prs(σσ,ϕg)subscriptPr𝑠conditional𝜎superscript𝜎subscriptitalic-ϕ𝑔\mathrm{Pr}_{s}(\sigma\mid\sigma^{\star},\phi_{g})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) is the probability of a voter observing the ranking σ𝜎\sigmaitalic_σ given the ground-truth ranking σsuperscript𝜎\sigma^{\star}italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT and the dispersion parameter ϕgsubscriptitalic-ϕ𝑔\phi_{g}italic_ϕ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT for group g𝑔gitalic_g. The parameter pgsubscript𝑝𝑔p_{g}italic_p start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT represents the probability of voter i𝑖iitalic_i belonging to group g𝑔gitalic_g, where g=1Gpg=1superscriptsubscript𝑔1𝐺subscript𝑝𝑔1\sum_{g=1}^{G}p_{g}=1∑ start_POSTSUBSCRIPT italic_g = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT = 1. In the Concentric Mixture of Mallows model, the probability Prs(σσ,ϕg)subscriptPr𝑠conditional𝜎superscript𝜎subscriptitalic-ϕ𝑔\mathrm{Pr}_{s}(\sigma\mid\sigma^{\star},\phi_{g})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) is defined as:

Prs(σσ,ϕg)=ϕgd(σ,σ)Z(ϕg,m)subscriptPr𝑠conditional𝜎superscript𝜎subscriptitalic-ϕ𝑔superscriptsubscriptitalic-ϕ𝑔𝑑𝜎superscript𝜎𝑍subscriptitalic-ϕ𝑔𝑚\mathrm{Pr}_{s}(\sigma\mid\sigma^{\star},\phi_{g})=\frac{\phi_{g}^{d(\sigma,% \sigma^{\star})}}{Z(\phi_{g},m)}roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , italic_ϕ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) = divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d ( italic_σ , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT , italic_m ) end_ARG (7)

where d(σ,σ)𝑑𝜎superscript𝜎d(\sigma,\sigma^{\star})italic_d ( italic_σ , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) is the Kendall-Tau distance between the observed ranking σ𝜎\sigmaitalic_σ and the central ranking σsuperscript𝜎\sigma^{\star}italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT, and Z(ϕg,m)𝑍subscriptitalic-ϕ𝑔𝑚Z(\phi_{g},m)italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT , italic_m ) is the normalization constant that ensures that the probabilities sum to 1111 across all possible rankings. We will assume that ϕ1ϕ2ϕGsubscriptitalic-ϕ1subscriptitalic-ϕ2subscriptitalic-ϕ𝐺\phi_{1}\leq\phi_{2}\leq\ldots\leq\phi_{G}italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_ϕ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ … ≤ italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT. Note that, a smaller value of the dispersion parameter implies that the group is more expert i.e. likely to observe a ranking closer to the ground truth ranking.

For the case of two groups (i.e. G=2𝐺2G=2italic_G = 2), \citeauthor*CI21 [CI21] analyzed the identifiability and sample complexity of the concentric mixture model under the Borda voting rule. Our first goal is to analyze the same model under the SP-Voting rule and an arbitrary number of groups. There are two main steps in the analysis of SP-Voting

  1. 1.

    Identification: determine the condition needed to ensure

    V¯(σ)2maxτ:d(τ,σ)1V¯(τ),¯𝑉superscript𝜎2subscript:𝜏𝑑𝜏superscript𝜎1¯𝑉𝜏\overline{V}(\sigma^{\star})\geq 2\cdot\max_{\tau:d(\tau,\sigma^{\star})\geq 1% }\overline{V}(\tau),over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 2 ⋅ roman_max start_POSTSUBSCRIPT italic_τ : italic_d ( italic_τ , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 1 end_POSTSUBSCRIPT over¯ start_ARG italic_V end_ARG ( italic_τ ) ,

    so that maximizing prediction-normalized-vote returns the ground truth.

  2. 2.

    Sample Complexity: when the identification condition holds, determine the number of samples necessary to ensure

    V¯^(σ)>maxτ:d(τ,σ)1V¯^(τ),^¯𝑉superscript𝜎subscript:𝜏𝑑𝜏superscript𝜎1^¯𝑉𝜏\widehat{\overline{V}}(\sigma^{\star})>\max_{\tau:d(\tau,\sigma^{\star})\geq 1% }\widehat{\overline{V}}(\tau),over^ start_ARG over¯ start_ARG italic_V end_ARG end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) > roman_max start_POSTSUBSCRIPT italic_τ : italic_d ( italic_τ , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 1 end_POSTSUBSCRIPT over^ start_ARG over¯ start_ARG italic_V end_ARG end_ARG ( italic_τ ) ,

    so that maximizing the prediction-normalized votes from samples returns the ground truth.

For the setting of G=2𝐺2G=2italic_G = 2, the following result regarding identifying the CMM model has already been proved \parencitehosseini2024surprising. 333The results were originally proved for partial rankings with G=2𝐺2G=2italic_G = 2 \parencitehosseini2024surprising but here we present a simplified version for full rankings.

Lemma 1 (\citeauthor*hosseini2024surprising [hosseini2024surprising]).

Suppose p11/2subscript𝑝112p_{1}\leq 1/2italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ 1 / 2 and the following condition holds.

(p11p1)22Z(ϕ2)3Z(ϕ1)2ϕ1m(m1)/2superscriptsubscript𝑝11subscript𝑝122𝑍superscriptsubscriptitalic-ϕ23𝑍superscriptsubscriptitalic-ϕ12superscriptsubscriptitalic-ϕ1𝑚𝑚12\left(\frac{p_{1}}{1-p_{1}}\right)^{2}\geq 2\cdot\frac{Z(\phi_{2})^{3}}{Z(\phi% _{1})^{2}}\phi_{1}^{m(m-1)/2}( divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ 2 ⋅ divide start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m ( italic_m - 1 ) / 2 end_POSTSUPERSCRIPT

Then for any τ𝜏\tauitalic_τ with d(τ,σ)1𝑑𝜏superscript𝜎1d(\tau,\sigma^{\star})\geq 1italic_d ( italic_τ , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 1 we have V¯(σ)2V¯(τ)¯𝑉superscript𝜎2¯𝑉𝜏\overline{V}(\sigma^{\star})\geq 2\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 2 over¯ start_ARG italic_V end_ARG ( italic_τ ).

The above result says that if the non-experts are too noisy (i.e. ϕ2ϕ1much-greater-thansubscriptitalic-ϕ2subscriptitalic-ϕ1\phi_{2}\gg\phi_{1}italic_ϕ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≫ italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT) then the fraction of experts p1subscript𝑝1p_{1}italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT cannot be too small. Next we generalize the lemma for the case of arbitrary number of groups.

Lemma 2.

Suppose the set G𝐺Gitalic_G can be partitioned into sets G1={1,2,,s}subscript𝐺112𝑠G_{1}=\left\{1,2,\ldots,s\right\}italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = { 1 , 2 , … , italic_s } and G2={s+1,,G}subscript𝐺2𝑠1𝐺G_{2}=\left\{s+1,\ldots,G\right\}italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = { italic_s + 1 , … , italic_G }. Let α=jG1pj𝛼subscript𝑗subscript𝐺1subscript𝑝𝑗\alpha=\sum_{j\in G_{1}}p_{j}italic_α = ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and the following condition holds.

αZ(ϕs)+1αZ(ϕG)2(ϕsZ(ϕ1)α+ϕGZ(ϕs+1)(1α))𝛼𝑍subscriptitalic-ϕ𝑠1𝛼𝑍subscriptitalic-ϕ𝐺2subscriptitalic-ϕ𝑠𝑍subscriptitalic-ϕ1𝛼subscriptitalic-ϕ𝐺𝑍subscriptitalic-ϕ𝑠11𝛼\frac{\alpha}{Z(\phi_{s})}+\frac{1-\alpha}{Z(\phi_{G})}\geq 2\left(\frac{\phi_% {s}}{Z(\phi_{1})}\alpha+\frac{\phi_{G}}{Z(\phi_{s+1})}(1-\alpha)\right)divide start_ARG italic_α end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) end_ARG + divide start_ARG 1 - italic_α end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) end_ARG ≥ 2 ( divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG italic_α + divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_s + 1 end_POSTSUBSCRIPT ) end_ARG ( 1 - italic_α ) )

Then we are guaranteed that V¯(σ)2V¯(τ)¯𝑉superscript𝜎2¯𝑉𝜏\overline{V}(\sigma^{\star})\geq 2\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 2 over¯ start_ARG italic_V end_ARG ( italic_τ ) for any τ𝜏\tauitalic_τ such that d(τ,σ)1𝑑𝜏superscript𝜎1d(\tau,\sigma^{\star})\geq 1italic_d ( italic_τ , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 1.

The proof is provided in the appendix where we generalize lemma 1 and also simplify the conditions required for identification. One way to interpret the result is that when the experts are in the minority i.e. α1/2much-less-than𝛼12\alpha\ll 1/2italic_α ≪ 1 / 2 then we need Z(ϕs+1)2ϕGZ(ϕG)𝑍subscriptitalic-ϕ𝑠12subscriptitalic-ϕ𝐺𝑍subscriptitalic-ϕ𝐺Z(\phi_{s+1})\geq 2\phi_{G}Z(\phi_{G})italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_s + 1 end_POSTSUBSCRIPT ) ≥ 2 italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) i.e. the dispersion parameter of the best non-expert should be sufficiently large. In the next subsection, we derive identifiability results under a different concentric mixture model, and then later provide sample complexity of SP-Voting under different rank-order models.

3.2 The Concentric Mixture of Plackett-Luce Model

In this subsection, we introduce the Concentric Mixture of Plackett-Luce Model (CMPL), which uses an element-specific probabilistic framework to rank alternatives based on their relative probabilities within each group. Specifically, group g𝑔gitalic_g’s ranking is modelled as a Plackett-Luce model with a group-specific parameter vector θgR+msubscript𝜃𝑔subscriptsuperscriptR𝑚\theta_{g}\in\mathrm{R}^{m}_{+}italic_θ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ∈ roman_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT + end_POSTSUBSCRIPT. As before, the following equation describes the ranking observed by a voter, where the voting population is divided into G𝐺Gitalic_G distinct groups:

Prs(σσ,𝜽)=g=1GpgPrs(σσ,θg)subscriptPr𝑠conditional𝜎superscript𝜎𝜽superscriptsubscript𝑔1𝐺subscript𝑝𝑔subscriptPr𝑠conditional𝜎superscript𝜎subscript𝜃𝑔\mathrm{Pr}_{s}(\sigma\mid\sigma^{\star},\bm{\theta})=\sum_{g=1}^{G}p_{g}\cdot% \mathrm{Pr}_{s}(\sigma\mid\sigma^{\star},\theta_{g})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , bold_italic_θ ) = ∑ start_POSTSUBSCRIPT italic_g = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , italic_θ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) (8)

Here σsuperscript𝜎\sigma^{\star}italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is the ground-truth ranking, and θgsubscript𝜃𝑔\theta_{g}italic_θ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT is the vector of strength parameters for group g𝑔gitalic_g. The parameter pgsubscript𝑝𝑔p_{g}italic_p start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT represents the probability that voter i𝑖iitalic_i belongs to group g𝑔gitalic_g, where the mixture weights satisfy the constraint g=1Gpg=1superscriptsubscript𝑔1𝐺subscript𝑝𝑔1\sum_{g=1}^{G}p_{g}=1∑ start_POSTSUBSCRIPT italic_g = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT = 1. In the Concentric mixture of Plackett-Luce model, the probability Prs(σσ,θg)subscriptPr𝑠conditional𝜎superscript𝜎subscript𝜃𝑔\mathrm{Pr}_{s}(\sigma\mid\sigma^{\star},\theta_{g})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , italic_θ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) is defined as:

Prs(σσ,θg)=j=1mθg,σ1(σ(j))=jmθg,σ1(σ())subscriptPr𝑠conditional𝜎superscript𝜎subscript𝜃𝑔superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑔superscript𝜎superscript1𝜎𝑗superscriptsubscript𝑗𝑚subscript𝜃𝑔superscript𝜎superscript1𝜎\mathrm{Pr}_{s}(\sigma\mid\sigma^{\star},\theta_{g})=\prod_{j=1}^{m}\frac{% \theta_{g,\sigma^{\star^{-1}}(\sigma(j))}}{\sum_{\ell=j}^{m}\theta_{g,\sigma^{% \star^{-1}}(\sigma(\ell))}}roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , italic_θ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) = ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_g , italic_σ start_POSTSUPERSCRIPT ⋆ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ( italic_σ ( italic_j ) ) end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_g , italic_σ start_POSTSUPERSCRIPT ⋆ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ( italic_σ ( roman_ℓ ) ) end_POSTSUBSCRIPT end_ARG (9)

Here, σ(j)𝜎𝑗\sigma(j)italic_σ ( italic_j ) denotes the alternative assigned to the j𝑗jitalic_j-th position in the ranking σ𝜎\sigmaitalic_σ, while σ1(σ(j))superscript𝜎superscript1𝜎𝑗\sigma^{\star^{-1}}(\sigma(j))italic_σ start_POSTSUPERSCRIPT ⋆ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ( italic_σ ( italic_j ) ) denotes the position of the alternative σ(j)𝜎𝑗\sigma(j)italic_σ ( italic_j ) in the ranking σsuperscript𝜎\sigma^{\star}italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT. Equation 9 describes a Plackett-Luce model with ground truth σsuperscript𝜎\sigma^{\star}italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT and strength parameter vector θgsubscript𝜃𝑔\theta_{g}italic_θ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT, as θg,σ1(σ(j))subscript𝜃𝑔superscript𝜎superscript1𝜎𝑗\theta_{g,\sigma^{\star^{-1}}(\sigma(j))}italic_θ start_POSTSUBSCRIPT italic_g , italic_σ start_POSTSUPERSCRIPT ⋆ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ( italic_σ ( italic_j ) ) end_POSTSUBSCRIPT represents the strength parameter for that alternative within group g𝑔gitalic_g, and, the denominator, =jmθg,σ1(σ())superscriptsubscript𝑗𝑚subscript𝜃𝑔superscript𝜎superscript1𝜎\sum_{\ell=j}^{m}\theta_{g,\sigma^{\star^{-1}}(\sigma(\ell))}∑ start_POSTSUBSCRIPT roman_ℓ = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_g , italic_σ start_POSTSUPERSCRIPT ⋆ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ( italic_σ ( roman_ℓ ) ) end_POSTSUBSCRIPT, ensures that the probability of selecting each alternative is normalized, considering only the alternatives that remain to be ranked.

3.2.1 Constraints on Strength Parameters

Recall that in the concentric mixture of Mallows model the groups were ranked according to their dispersion parameters, i.e. ϕg1ϕg2subscriptitalic-ϕsubscript𝑔1subscriptitalic-ϕsubscript𝑔2\phi_{g_{1}}\leq\phi_{g_{2}}italic_ϕ start_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ italic_ϕ start_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT implies that group g1subscript𝑔1g_{1}italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is more expert compared to the group g2subscript𝑔2g_{2}italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. We now impose a similar condition on the parameters of the concentric mixture of Plackett-Luce model.

The strength parameters θgsubscript𝜃𝑔\theta_{g}italic_θ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT for each group are subject to two key constraints:

  • Within-group constraint: For each group g𝑔gitalic_g, the sum of the strength parameters equals 1111444The constant 1111 can be arbitrary, but must be the same across the groups., ensuring that the sum of the parameters is identical across the G𝐺Gitalic_G groups.

    j=1mθg,j=1g{1,,G}.formulae-sequencesuperscriptsubscript𝑗1𝑚subscript𝜃𝑔𝑗1for-all𝑔1𝐺\sum_{j=1}^{m}\theta_{g,j}=1\quad\forall g\in\{1,\dots,G\}.∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_g , italic_j end_POSTSUBSCRIPT = 1 ∀ italic_g ∈ { 1 , … , italic_G } .

    Additionally, the entries in θgsubscript𝜃𝑔\theta_{g}italic_θ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT are non-increasing i.e. θg,iθg,jsubscript𝜃𝑔𝑖subscript𝜃𝑔𝑗\theta_{g,i}\geq\theta_{g,j}italic_θ start_POSTSUBSCRIPT italic_g , italic_i end_POSTSUBSCRIPT ≥ italic_θ start_POSTSUBSCRIPT italic_g , italic_j end_POSTSUBSCRIPT for ij𝑖𝑗i\geq jitalic_i ≥ italic_j.

  • Between-group constraint: The strength parameters for the higher-expertise group should stochastically dominate those of the lower-expertise groups. In particular, for any location \ellroman_ℓ the following condition must hold.

    j=1θ1,jj=1θ2,jj=1θG,j{1,,m}.formulae-sequencesuperscriptsubscript𝑗1subscript𝜃1𝑗superscriptsubscript𝑗1subscript𝜃2𝑗superscriptsubscript𝑗1subscript𝜃𝐺𝑗for-all1𝑚\sum_{j=1}^{\ell}\theta_{1,j}\geq\sum_{j=1}^{\ell}\theta_{2,j}\geq\dots\geq% \sum_{j=1}^{\ell}\theta_{G,j}\quad\forall\ell\in\{1,\dots,m\}.∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_j end_POSTSUBSCRIPT ≥ ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 2 , italic_j end_POSTSUBSCRIPT ≥ ⋯ ≥ ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_G , italic_j end_POSTSUBSCRIPT ∀ roman_ℓ ∈ { 1 , … , italic_m } .

    This hierarchical constraint ensures that the behavior of the groups is ordered in a way that reflects their relative strengths, with group 1111 being closest to the ground-truth ranking, and subsequent groups deviating further from it.

We now turn to derive the identification condition to ensure that the ground truth ranking is the unique ranking to maximize the prediction-normalized vote. The next lemma gives a sufficient condition under the CMPL model and two groups.

Lemma 3.

Suppose p11/2subscript𝑝112p_{1}\leq 1/2italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ 1 / 2 and the following condition holds.

(p11p1)22(j=1mθ2,ji=jmθ2,i)(j=1mθ1,ji=jmθ1,i)1(j=1mθ1,mj+1i=jmθ1,mi+1)superscriptsubscript𝑝11subscript𝑝122superscriptsubscriptproduct𝑗1𝑚subscript𝜃2𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃2𝑖superscriptsuperscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑖1superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑚𝑖1\left(\frac{p_{1}}{1-p_{1}}\right)^{2}\geq 2\cdot\left(\prod_{j=1}^{m}\frac{% \theta_{2,j}}{\sum_{i=j}^{m}\theta_{2,i}}\right)\left(\prod_{j=1}^{m}\frac{% \theta_{1,j}}{\sum_{i=j}^{m}\theta_{1,i}}\right)^{-1}\left(\prod_{j=1}^{m}% \frac{\theta_{1,m-j+1}}{\sum_{i=j}^{m}\theta_{1,m-i+1}}\right)( divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ 2 ⋅ ( ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 2 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 2 , italic_i end_POSTSUBSCRIPT end_ARG ) ( ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_i end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG )

Then for any ranking τ𝜏\tauitalic_τ with d(τ,σ)1𝑑𝜏superscript𝜎1d(\tau,\sigma^{\star})\geq 1italic_d ( italic_τ , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 1 we are guaranteed that V¯(σ)2V¯(τ)¯𝑉superscript𝜎2¯𝑉𝜏\overline{V}(\sigma^{\star})\geq 2\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 2 over¯ start_ARG italic_V end_ARG ( italic_τ ).

In order to interpret the condition, let us choose a simple setting of strength parameters. Let θ1=(γ1,1,,1)/(γ1+m1)subscript𝜃1subscript𝛾111subscript𝛾1𝑚1\theta_{1}=(\gamma_{1},1,\ldots,1)/(\gamma_{1}+m-1)italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ( italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , 1 , … , 1 ) / ( italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_m - 1 ) and similarly θ2=(γ2,1,,1)/(γ2+m1)subscript𝜃2subscript𝛾211subscript𝛾2𝑚1\theta_{2}=(\gamma_{2},1,\ldots,1)/(\gamma_{2}+m-1)italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ( italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , 1 , … , 1 ) / ( italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_m - 1 ). Then it can be verified that condition of Lemma 3 simplifies to the following,

(p11p1)22γ2γ2+m1γ1+m1γ1j=1m11γ1+mjsuperscriptsubscript𝑝11subscript𝑝122subscript𝛾2subscript𝛾2𝑚1subscript𝛾1𝑚1subscript𝛾1superscriptsubscriptproduct𝑗1𝑚11subscript𝛾1𝑚𝑗\left(\frac{p_{1}}{1-p_{1}}\right)^{2}\geq 2\frac{\gamma_{2}}{\gamma_{2}+m-1}% \frac{\gamma_{1}+m-1}{\gamma_{1}}\prod_{j=1}^{m-1}\frac{1}{\gamma_{1}+m-j}( divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ 2 divide start_ARG italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_m - 1 end_ARG divide start_ARG italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_m - 1 end_ARG start_ARG italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_m - italic_j end_ARG

and for large enough m𝑚mitalic_m we need p11p12γ2/γ1m(m1)/2greater-than-or-equivalent-tosubscript𝑝11subscript𝑝12subscript𝛾2subscript𝛾1superscript𝑚𝑚12\frac{p_{1}}{1-p_{1}}\gtrsim\sqrt{2\gamma_{2}/\gamma_{1}}\cdot m^{-(m-1)/2}divide start_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ≳ square-root start_ARG 2 italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT / italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ⋅ italic_m start_POSTSUPERSCRIPT - ( italic_m - 1 ) / 2 end_POSTSUPERSCRIPT. This means that as γ2subscript𝛾2\gamma_{2}italic_γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT approaches γ1subscript𝛾1\gamma_{1}italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT (i.e. non-experts become close to experts), we need a larger value of p1subscript𝑝1p_{1}italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT (i.e. fraction of experts) to succeed. The next lemma generalizes the identifiability condition to an arbitrary number of groups.

Lemma 4.

Suppose the set G𝐺Gitalic_G can be partitioned into sets G1={1,2,,s}subscript𝐺112𝑠G_{1}=\left\{1,2,\ldots,s\right\}italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = { 1 , 2 , … , italic_s } and G2={s+1,,G}subscript𝐺2𝑠1𝐺G_{2}=\left\{s+1,\ldots,G\right\}italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = { italic_s + 1 , … , italic_G }. Let α=jG1pj𝛼subscript𝑗subscript𝐺1subscript𝑝𝑗\alpha=\sum_{j\in G_{1}}p_{j}italic_α = ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and the following condition holds.

αj=1mθs,ji=jmθs,i+(1α)j=1mθG,ji=jmθG,i𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑠𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃𝑠𝑖1𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝐺𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃𝐺𝑖\displaystyle\alpha\prod_{j=1}^{m}\frac{\theta_{s,j}}{\sum_{i=j}^{m}\theta_{s,% i}}+(1-\alpha)\prod_{j=1}^{m}\frac{\theta_{G,j}}{\sum_{i=j}^{m}\theta_{G,i}}italic_α ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_s , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_s , italic_i end_POSTSUBSCRIPT end_ARG + ( 1 - italic_α ) ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_G , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_G , italic_i end_POSTSUBSCRIPT end_ARG
\displaystyle\geq 2αj=1mθ1,ji=jmθ1,i+2(1α)j=1mθs+1,ji=jmθs+1,iαj=1mθ1,mj+1i=jmθ1,mi+1+(1α)j=1mθs+1,mj+1i=jmθs+1,mi+12𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑖21𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑠1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃𝑠1𝑖𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑚𝑖11𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑠1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃𝑠1𝑚𝑖1\displaystyle\frac{2\alpha\prod_{j=1}^{m}\frac{\theta_{1,j}}{\sum_{i=j}^{m}% \theta_{1,i}}+2(1-\alpha)\prod_{j=1}^{m}\frac{\theta_{s+1,j}}{\sum_{i=j}^{m}% \theta_{s+1,i}}}{\alpha\cdot\prod_{j=1}^{m}\frac{\theta_{1,m-j+1}}{\sum_{i=j}^% {m}\theta_{1,m-i+1}}+(1-\alpha)\cdot\prod_{j=1}^{m}\frac{\theta_{s+1,m-j+1}}{% \sum_{i=j}^{m}\theta_{s+1,m-i+1}}}divide start_ARG 2 italic_α ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_i end_POSTSUBSCRIPT end_ARG + 2 ( 1 - italic_α ) ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_i end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_α ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG + ( 1 - italic_α ) ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG end_ARG

Then for any ranking τ𝜏\tauitalic_τ with d(τ,σ)1𝑑𝜏superscript𝜎1d(\tau,\sigma^{\star})\geq 1italic_d ( italic_τ , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 1 we are guaranteed that V¯(σ)2V¯(τ)¯𝑉superscript𝜎2¯𝑉𝜏\overline{V}(\sigma^{\star})\geq 2\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 2 over¯ start_ARG italic_V end_ARG ( italic_τ ).

3.3 Sample Complexity Bounds

Once we have derived the identifiability conditions, the derivation of sample complexity is relatively straightforward. When the number of samples is large, the empirical prediction-normalized vote V^(σ)^𝑉𝜎\widehat{V}(\sigma)over^ start_ARG italic_V end_ARG ( italic_σ ) concentrates around V¯(σ)¯𝑉𝜎\overline{V}(\sigma)over¯ start_ARG italic_V end_ARG ( italic_σ ) with high probability, and the condition V¯(σ)2V¯(τ)¯𝑉superscript𝜎2¯𝑉𝜏\overline{V}(\sigma^{\star})\geq 2\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 2 over¯ start_ARG italic_V end_ARG ( italic_τ ) guarantees that we can always ensure V^(σ)V^(τ)^𝑉superscript𝜎^𝑉𝜏\widehat{V}(\sigma^{\star})\geq\widehat{V}(\tau)over^ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ over^ start_ARG italic_V end_ARG ( italic_τ ) for any τ𝜏\tauitalic_τ with d(τ,σ)1𝑑𝜏superscript𝜎1d(\tau,\sigma^{\star})\geq 1italic_d ( italic_τ , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 1. Therefore, picking a ranking that maximizes the empirical prediction-normalized votes returns the ground truth ranking. The next lemma states the sample complexity for the CMM model.

Lemma 5.

Under the same setting as Lemma 2, suppose the number of samples is nO(m!mlog(m/δ))𝑛𝑂𝑚𝑚𝑚𝛿n\geq O\left(m!\sqrt{m\log(m/\delta)}\right)italic_n ≥ italic_O ( italic_m ! square-root start_ARG italic_m roman_log ( italic_m / italic_δ ) end_ARG ). Then SP-voting recovers the ground truth ranking with probability at least 1δ1𝛿1-\delta1 - italic_δ.

The proof draws inspiration from \citeauthor*hosseini2024surprising [hosseini2024surprising]’s proof of Corollary 1 with the difference being that here we consider m!𝑚m!italic_m ! rankings instead of k!𝑘k!italic_k ! and then use union bound over all subsets.

4 Experiments

In this section, we describe how we infer the parameters for both the CMM and the CMPL using a real-world dataset.

Dataset. We use a real-world dataset from a recent online experiment run on SP-voting by \citeauthor*hosseini2024surprising [hosseini2024surprising].555The dataset can be found here - https://github.com/amrit19/Surprisingly-Popular-Voting-Partial The dataset consists of real participants who provide both Vote and Prediction data across three distinct domains: Geography, Movies, and Paintings. The dataset contains rankings over five alternatives that are selected from a universe of 36363636 alternatives. The dataset contains reports from 432432432432 participants over 12121212 questions from each domain. The alternatives are ranked based on the following domain-specific metrics:

  • Countries: Ranked by population.

  • Movies: Ranked by gross lifetime box-office earnings.

  • Paintings: Ranked by auction prices.

In addition to their Votes over these alternatives, each participant provides their Prediction report based on the posterior belief about another participant’s votes. The types of prediction reports are based on ranking and can be Top (most likely alternative), Rank (most likely ranking), Top-t𝑡titalic_t (approval of top t𝑡titalic_t alternatives).

We fit both variants of the Concentric Mixture Models— Mallows and Plackett-Luce— to the dataset to infer the parameters governing the group-specific ranking behaviors. The objective is to capture how different population groups deviate from a shared underlying ground truth ranking.

Inference Methodology. To estimate the parameters of the models, we employ a Bayesian inference approach, which allows us to estimate the posterior distributions of the parameters given the observed rankings. In particular, we use No-U-Turn Sampling (NUTS) \parencitehoffman2014no, an advanced variant of Hamiltonian Monte Carlo (HMC), to sample from the posterior distribution of the parameters. By utilizing this sampling technique, we can obtain accurate estimates of the model parameters, such as the proportion parameters (pksubscript𝑝𝑘p_{k}italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT), dispersion parameters (ϕgsubscriptitalic-ϕ𝑔\phi_{g}italic_ϕ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT) for the Mallows model, and strength parameters (θgsubscript𝜃𝑔\theta_{g}italic_θ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT) for the Plackett-Luce model, for each of the population groups. The use of NUTS also enables us to quantify the uncertainty in the parameter estimates, providing credible intervals for the inferred parameters. This is particularly important when analyzing real-world ranking data, as it allows us to account for variability across different population groups and rankings. Next we discuss the parameter inference for the CMM and CMPL models.

Refer to caption
(a) G=2𝐺2G=2italic_G = 2
Refer to caption
(b) G=3𝐺3G=3italic_G = 3
Figure 1: Dispersion parameters for Votes and Predictions for G=2𝐺2G=2italic_G = 2 and 3333 of CMM model. We see that the CMM model with G=3𝐺3G=3italic_G = 3 identifies an intermediate group whose peak lies between experts and non-experts.

4.1 Concentric Mixture of Mallows

We fit the CMM with G=2𝐺2G=2italic_G = 2 and 3333 groups to the dataset described earlier in this section. Below we describe the parameter inference procedure for G=3𝐺3G=3italic_G = 3 groups, the more general case. The three groups are categorized as experts, intermediates, and non-experts. We infer several key parameters, including the proportion of each group (pksubscript𝑝𝑘p_{k}italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT), the dispersion parameters for experts’ votes (ϕE-votessubscriptitalic-ϕ𝐸-votes\phi_{E\text{-votes}}italic_ϕ start_POSTSUBSCRIPT italic_E -votes end_POSTSUBSCRIPT) and predictions (ϕE-predictionssubscriptitalic-ϕ𝐸-predictions\phi_{E\text{-predictions}}italic_ϕ start_POSTSUBSCRIPT italic_E -predictions end_POSTSUBSCRIPT), the dispersion parameters for intermediates’ votes (ϕI-votessubscriptitalic-ϕ𝐼-votes\phi_{I\text{-votes}}italic_ϕ start_POSTSUBSCRIPT italic_I -votes end_POSTSUBSCRIPT) and predictions (ϕI-predictionssubscriptitalic-ϕ𝐼-predictions\phi_{I\text{-predictions}}italic_ϕ start_POSTSUBSCRIPT italic_I -predictions end_POSTSUBSCRIPT), and the dispersion parameters for non-experts’ votes (ϕNE-votessubscriptitalic-ϕ𝑁𝐸-votes\phi_{NE\text{-votes}}italic_ϕ start_POSTSUBSCRIPT italic_N italic_E -votes end_POSTSUBSCRIPT) and predictions (ϕNE-predictionssubscriptitalic-ϕ𝑁𝐸-predictions\phi_{NE\text{-predictions}}italic_ϕ start_POSTSUBSCRIPT italic_N italic_E -predictions end_POSTSUBSCRIPT).

We first compute the Kendall-Tau distances between each participant’s vote and prediction rankings and the ground-truth ranking. These Kendall-Tau distances (τvotessubscript𝜏votes\tau_{\text{votes}}italic_τ start_POSTSUBSCRIPT votes end_POSTSUBSCRIPT and τpredictionssubscript𝜏predictions\tau_{\text{predictions}}italic_τ start_POSTSUBSCRIPT predictions end_POSTSUBSCRIPT) serve as a measure of how much each participant’s rankings deviate from the central ground-truth ordering. The model’s priors for the dispersion parameters and the group proportions are specified as follows:

pDirichlet(2,2,4)ϕE-votesN(0.1,0.2),ϕE-predictionsN(0.4,0.3)ϕI-votesN(0.4,0.2),ϕI-predictionsN(0.4,0.3)ϕNE-votesN(0.8,0.2),ϕNE-predictionsN(0.8,0.3)missing-subexpressionsimilar-to𝑝Dirichlet224subscriptitalic-ϕ𝐸-votesformulae-sequencesimilar-toabsent𝑁0.10.2similar-tosubscriptitalic-ϕ𝐸-predictions𝑁0.40.3subscriptitalic-ϕ𝐼-votesformulae-sequencesimilar-toabsent𝑁0.40.2similar-tosubscriptitalic-ϕ𝐼-predictions𝑁0.40.3subscriptitalic-ϕ𝑁𝐸-votesformulae-sequencesimilar-toabsent𝑁0.80.2similar-tosubscriptitalic-ϕ𝑁𝐸-predictions𝑁0.80.3\begin{array}[]{rl}&p\sim\text{Dirichlet}(2,2,4)\\ \phi_{E\text{-votes}}&\sim N(0.1,0.2),\quad\phi_{E\text{-predictions}}\sim N(0% .4,0.3)\\ \phi_{I\text{-votes}}&\sim N(0.4,0.2),\quad\phi_{I\text{-predictions}}\sim N(0% .4,0.3)\\ \phi_{NE\text{-votes}}&\sim N(0.8,0.2),\quad\phi_{NE\text{-predictions}}\sim N% (0.8,0.3)\end{array}start_ARRAY start_ROW start_CELL end_CELL start_CELL italic_p ∼ Dirichlet ( 2 , 2 , 4 ) end_CELL end_ROW start_ROW start_CELL italic_ϕ start_POSTSUBSCRIPT italic_E -votes end_POSTSUBSCRIPT end_CELL start_CELL ∼ italic_N ( 0.1 , 0.2 ) , italic_ϕ start_POSTSUBSCRIPT italic_E -predictions end_POSTSUBSCRIPT ∼ italic_N ( 0.4 , 0.3 ) end_CELL end_ROW start_ROW start_CELL italic_ϕ start_POSTSUBSCRIPT italic_I -votes end_POSTSUBSCRIPT end_CELL start_CELL ∼ italic_N ( 0.4 , 0.2 ) , italic_ϕ start_POSTSUBSCRIPT italic_I -predictions end_POSTSUBSCRIPT ∼ italic_N ( 0.4 , 0.3 ) end_CELL end_ROW start_ROW start_CELL italic_ϕ start_POSTSUBSCRIPT italic_N italic_E -votes end_POSTSUBSCRIPT end_CELL start_CELL ∼ italic_N ( 0.8 , 0.2 ) , italic_ϕ start_POSTSUBSCRIPT italic_N italic_E -predictions end_POSTSUBSCRIPT ∼ italic_N ( 0.8 , 0.3 ) end_CELL end_ROW end_ARRAY

These priors represent our assumptions about the behavior of the three groups, where votes of experts are expected to have the tightest alignment with the ground-truth ranking (small dispersion), intermediates show moderate dispersion, and non-experts have the highest dispersion. On the other hand, the predictions of experts, intermediates, and non-experts have a lot of overlap, representing each voter’s opinion of the consensus ranking.

The likelihood function is structured to account for the possibility that each participant could belong to any of the three groups. This implies that the observed Kendall-Tau distances for votes and predictions are modeled as a mixture of normal distributions in the rank space, and we can set a maximum likelihood estimation problem to infer various parameters. In particular, we run the NUTS algorithm with four chains, each consisting of 8000 iterations, with 2000 iterations reserved for warm-up.

Figure 1 and Figure 1 depict the distribution of dispersion parameters for Votes (ϕvotessubscriptitalic-ϕvotes\phi_{\text{votes}}italic_ϕ start_POSTSUBSCRIPT votes end_POSTSUBSCRIPT) and Predictions (ϕpredictionssubscriptitalic-ϕpredictions\phi_{\text{predictions}}italic_ϕ start_POSTSUBSCRIPT predictions end_POSTSUBSCRIPT) across different groups for G=2𝐺2G=2italic_G = 2 and G=3𝐺3G=3italic_G = 3. For votes, experts peak at a lower dispersion parameter in both G=2𝐺2G=2italic_G = 2 and G=3𝐺3G=3italic_G = 3, indicating more agreement, while non-experts peak at higher dispersion, showing greater spread in their voting. Experts show a widespread distribution for predictions since they reflect the majority belief, which deviates from the true belief, while non-experts are even farther away. The addition of the intermediate group in G=3𝐺3G=3italic_G = 3 adds valuable insights – their peak lies between experts and non-experts in votes, and their prediction distribution is similarly widespread as experts, reflecting the majority belief. This indicates that modeling voter behavior with more than two groups provides a more accurate and nuanced understanding of the data.

4.2 Concentric Mixture of Plackett-Luce

Refer to caption
(a) G=2
Refer to caption
(b) G=3
Figure 2: Strength parameters for Votes at Positions 1111, 3333, and 5555 for G=2𝐺2G=2italic_G = 2 and 3333 of CMPL Model. We observe a stochastic dominance relationship. Initially, the strength parameter of the expert peaks at a large value, but gradually decreases at higher positions to ensure normalization.

We fit the CMPL with G=2𝐺2G=2italic_G = 2 and 3333 groups. For G=3𝐺3G=3italic_G = 3, the groups are labeled as experts, intermediates, and non-experts. Similar to the CMM model, we infer the proportion of each group (pksubscript𝑝𝑘p_{k}italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT). Additionally, we infer the strength parameters for experts’ votes (θE-votessubscript𝜃𝐸-votes\theta_{E\text{-votes}}italic_θ start_POSTSUBSCRIPT italic_E -votes end_POSTSUBSCRIPT) and predictions (θE-predictionssubscript𝜃𝐸-predictions\theta_{E\text{-predictions}}italic_θ start_POSTSUBSCRIPT italic_E -predictions end_POSTSUBSCRIPT), intermediates’ votes (θI-votessubscript𝜃𝐼-votes\theta_{I\text{-votes}}italic_θ start_POSTSUBSCRIPT italic_I -votes end_POSTSUBSCRIPT) and predictions (θI-predictionssubscript𝜃𝐼-predictions\theta_{I\text{-predictions}}italic_θ start_POSTSUBSCRIPT italic_I -predictions end_POSTSUBSCRIPT), and non-experts’ votes (θNE-votessubscript𝜃𝑁𝐸-votes\theta_{NE\text{-votes}}italic_θ start_POSTSUBSCRIPT italic_N italic_E -votes end_POSTSUBSCRIPT) and predictions (θNE-predictionssubscript𝜃𝑁𝐸-predictions\theta_{NE\text{-predictions}}italic_θ start_POSTSUBSCRIPT italic_N italic_E -predictions end_POSTSUBSCRIPT). We use the Inference Method described earlier in this section, utilizing the No-U-Turn Sampler (NUTS) to explore the parameter space and infer posterior distributions for the model parameters.

Refer to caption
(a) CMM
Refer to caption
(b) CMPL
Figure 3: Distribution of complete rankings (predicted) for each population group. The data contains partial preferences over subsets. The CMPL model provides fine-grained inferences because it learns the weight of each position in the full ranking.

Before sampling, the rankings provided by participants (both votes and predictions) are converted into indices, which correspond to the options being ranked. The strength parameters, which reflect the relative probability of ranking an alternative higher than the others within a group, are inferred separately for experts, intermediates, and non-experts. The model’s priors for the group proportions and the strength parameters are defined as follows:

p𝑝\displaystyle pitalic_p Dirichlet(1,2,3),similar-toabsentDirichlet123\displaystyle\sim\text{Dirichlet}(1,2,3),∼ Dirichlet ( 1 , 2 , 3 ) ,
θE-votessubscript𝜃𝐸-votes\displaystyle\theta_{E\text{-votes}}italic_θ start_POSTSUBSCRIPT italic_E -votes end_POSTSUBSCRIPT Dirichlet(3,3,,3m),θE-predictionsDirichlet(1,1,,1m),formulae-sequencesimilar-toabsentDirichletsubscript333𝑚similar-tosubscript𝜃𝐸-predictionsDirichletsubscript111𝑚\displaystyle\sim\text{Dirichlet}(\underbrace{3,3,\dots,3}_{m}),\quad\theta_{E% \text{-predictions}}\sim\text{Dirichlet}(\underbrace{1,1,\dots,1}_{m}),∼ Dirichlet ( under⏟ start_ARG 3 , 3 , … , 3 end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , italic_θ start_POSTSUBSCRIPT italic_E -predictions end_POSTSUBSCRIPT ∼ Dirichlet ( under⏟ start_ARG 1 , 1 , … , 1 end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ,
θI-votessubscript𝜃𝐼-votes\displaystyle\theta_{I\text{-votes}}italic_θ start_POSTSUBSCRIPT italic_I -votes end_POSTSUBSCRIPT Dirichlet(2,2,,2m),θI-predictionsDirichlet(1,1,,1m),formulae-sequencesimilar-toabsentDirichletsubscript222𝑚similar-tosubscript𝜃𝐼-predictionsDirichletsubscript111𝑚\displaystyle\sim\text{Dirichlet}(\underbrace{2,2,\dots,2}_{m}),\quad\theta_{I% \text{-predictions}}\sim\text{Dirichlet}(\underbrace{1,1,\dots,1}_{m}),∼ Dirichlet ( under⏟ start_ARG 2 , 2 , … , 2 end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , italic_θ start_POSTSUBSCRIPT italic_I -predictions end_POSTSUBSCRIPT ∼ Dirichlet ( under⏟ start_ARG 1 , 1 , … , 1 end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ,
θNE-votessubscript𝜃𝑁𝐸-votes\displaystyle\theta_{NE\text{-votes}}italic_θ start_POSTSUBSCRIPT italic_N italic_E -votes end_POSTSUBSCRIPT Dirichlet(1,1,,1m),θNE-predictionsDirichlet(1,1,,1m)formulae-sequencesimilar-toabsentDirichletsubscript111𝑚similar-tosubscript𝜃𝑁𝐸-predictionsDirichletsubscript111𝑚\displaystyle\sim\text{Dirichlet}(\underbrace{1,1,\dots,1}_{m}),\quad\theta_{% NE\text{-predictions}}\sim\text{Dirichlet}(\underbrace{1,1,\dots,1}_{m})∼ Dirichlet ( under⏟ start_ARG 1 , 1 , … , 1 end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , italic_θ start_POSTSUBSCRIPT italic_N italic_E -predictions end_POSTSUBSCRIPT ∼ Dirichlet ( under⏟ start_ARG 1 , 1 , … , 1 end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT )

These priors reflect the assumption that experts are expected to have higher strengths, indicating that they consistently rank the correct alternatives higher. Intermediates have moderate strengths, and non-experts are assumed to have the lowest strengths, indicating a less accurate ranking behavior.

In addition, we impose the model constraints described in Section 3.2.1, ensuring that the strength parameters for each group follow the expected relationships (e.g., ensuring that expert strengths are higher and decrease in a structured manner across groups). The likelihood function is structured to account for the mixture model, where participants may belong to one of the three groups. The observed rankings (in the form of indices) are used to compute the log-likelihood based on the Plackett-Luce model, where each group’s strength parameters determine the probability of a particular ranking.

Similar to the CMM model, we run the NUTS algorithm with four chains, each consisting of 6000 iterations, with 2000 iterations reserved for warm-up. Figure 2 and Figure 2 show the distribution of strength parameters of Votes and Predictions for the first, third, and fifth positions in the ranking. We again observe the benefit of having G=3𝐺3G=3italic_G = 3 where the intermediate group peaks between the experts and non-experts (Figure 2, Position 1). Additionally, the recovered strength parameters also demonstrate the stochastic dominance property. Looking at position 1 in both Figure 2 and Figure 2, the strength parameter of the expert peaks at a higher value than the non-experts and intermediates. For positions 3 and 5 the peaks of the experts’ strength parameter shifts left and gradually merges with non-experts, in order to ensure that iθg,i=1subscript𝑖subscript𝜃𝑔𝑖1\sum_{i}\theta_{g,i}=1∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_g , italic_i end_POSTSUBSCRIPT = 1.

4.3 Predicting Complete Rankings from Partial Rankings using CMM and CMPL

We predict the complete ranking of 36 alternatives from partial rankings, for each population group (experts, intermediates, and non-experts) using the CMM and CMPL models. The dataset containing 36363636 alternatives is divided into 12121212 subsets, each containing 5555 alternatives and we collect vote information over these subsets.

In both models, we use a hierarchical approach. We first fit each model to the subsets independently, learning the parameters for the alternatives within each subset. Since some alternatives appear in multiple subsets, this creates transitive relationships that help predict a global ranking across all 36 alternatives accurately. Once the parameters are inferred, we sample from the posterior distributions and input these samples into the respective CMM or CMPL model to generate the full ranking.

CMM. For each subset, we infer the group-specific posterior distribution of dispersion parameters for each population group (experts, intermediates, and non-experts). Using these inferred parameters, we generate rankings by inputting the values into CMM model. This allows us to compute a distribution of Kendall Tau distances by comparing the predicted subset-level rankings to the ground truth for each group. We then sample from the posterior of these group-specific distributions- both the dispersion parameters and Kendall Tau distances- and use these samples in the CMM model to generate full rankings for all 36 alternatives. To quantify uncertainty in these predicted rankings, we apply bootstrapping, which provides a range of plausible full rankings derived from the posterior samples.

CMPL. For each subset, we infer the posterior distribution of group-specific strength parameters for each alternative, providing a probabilistic estimate of each alternative’s rank. We use the CMPL model to iteratively select the alternative with the highest sampled strength parameter at each position, repeating the process for the remaining positions to generate a complete ranking. To quantify uncertainty, we apply bootstrapping, generating a full distribution of plausible complete rankings.

Figure 3 and Figure 3 show the distribution of Kendall Tau distance for each group (experts, intermediates, and non-experts) when the complete rankings are inferred from CMM and CMPL respectively. For the CMPL model, Figure 3, the distributions reflect that experts are closest to the ground truth, followed by intermediates, and then non-experts. This distinction is less pronounced in CMM model, Figure 3. The CMPL model provides more fine-grained inferences because it learns the distribution over each position in the full ranking through the posterior estimates, allowing for more precise predictions of the rank order of alternatives. In contrast, the CMM model is less fine-grained, as it estimates how close the ranking is to the ground truth based on a single dispersion parameter, per population group, that represents the overall distance but lacks detailed information about specific positions within the ranking.

5 Sample Complexity Results

Refer to caption
Refer to caption
Figure 4: Comparison of sample complexity for data generated from CMM and CMPL models with G=3, and aggregated using Copeland and SP-Voting rule.
Refer to caption
Figure 5: Comparison of sample complexity on real data when votes are aggregated using Copeland and SP-Voting.

In this section, we analyze the impact of sample size on ground truth recovery by generating synthetic data using the CMM and CMPL models with G=3𝐺3G=3italic_G = 3. We generate 500 samples with the proportion of experts in the population being 1%percent11\%1 %. Figure 4 present a comparison of how sample size affects the performance of two aggregation methods: Copeland Rule \parencitecopeland1951reasonable and SP-Voting. Figure 5 shows the same comparison on real data. Refer to Figure 6 in Appendix B for results with G=2𝐺2G=2italic_G = 2.

From Figure 4, it is evident that SP-Voting outperforms the Copeland Rule in terms of accurately recovering the ground-truth ranking as the sample size increases. For both CMM and CMPL models, the Kendall Tau distance between the estimated and ground truth rankings consistently decreases with increasing sample sizes. However, the SP-Voting method shows a sharper decline compared to the Copeland Rule, indicating its superior performance in reaching the ground truth. The confidence intervals (shaded areas) for SP-Voting are consistently narrower compared to those for Copeland, implying higher stability and lower variability of SP-Voting across different sampling scenarios. Figure 5 shows the analysis on a limited 48 samples of real data, where we can see a gradual decrease in the mean value and the confidence around it for SP-Voting as compared to Copeland, indicating that with more samples, ground-truth recovery can be achieved faster and with higher certainty using SP-Voting.

Overall, the comparison between the two models— CMM and CMPL—shows similar trends, with SP-Voting consistently outperforming the Copeland Rule across both models. Increasing sample size notably helps both methods, but SP-Voting achieves ground truth recovery with fewer samples and more consistency. This indicates that the prediction information involved in SP-Voting helps correct the effect of non-expert votes and thus helps reach the ground truth faster. These findings reinforce the efficacy of SP-Voting over traditional aggregation rules like the Copeland Rule, in terms of both accuracy and reliability when aggregating rankings to recover the ground truth.

6 Discussion and Future Work

In this work, we have analyzed SP-voting under two concentric rank-order models (Mallows and Plackett-Luce) with an arbitrary number of groups. We observed that real-world datasets often have multiple groups of experts (G3𝐺3G\geq 3italic_G ≥ 3) and SP-voting performs better in terms of sample complexity when compared to standard voting rules. There are many interesting directions for future work. First, \citeauthor*prelec2017solution [prelec2017solution] have proposed the self-predicting property for the general SP algorithms. Although this condition is not sufficient to derive finite sample complexity bounds, it would be interesting to see how it compares with the conditions we derived for various concentric rank-order models. Second, we have seen that moving from G=2𝐺2G=2italic_G = 2 to G=3𝐺3G=3italic_G = 3 groups gives a significantly better fit (and explanation) with respect to the real data but the improvement is marginal for larger values of G𝐺Gitalic_G. Then a natural question is can we choose the number of groups G𝐺Gitalic_G in a a data-dependent way? Finally, in terms of sample complexity, we have analyzed SP-voting for recovering ground truth ranking over m𝑚mitalic_m alternatives, and the bound grows with m!𝑚m!italic_m !. This can be reduced to O(m2)𝑂superscript𝑚2O(m^{2})italic_O ( italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) for the pairwise version of SP-voting considered in prior work [hosseini2021surprisingly] with additional assumptions. However, when the number of alternatives m𝑚mitalic_m is large, we want the sample complexity to be independent of m𝑚mitalic_m. SP-voting with partial preferences [hosseini2024surprising] help in such contexts, and we leave a fine-grained analysis of the partial variants of SP (under various concentric rank-order models) as future work.

Acknowledgments

Hadi Hosseini acknowledges support from NSF IIS grants #2144413 and #2107173. \printbibliography

Appendix A Missing Proofs

A.1 Proof of Lemma 2

Proof.

As mentioned in Lemma 2 in the main text, we partition the set G𝐺Gitalic_G into sets G1={1,2,,s}subscript𝐺112𝑠G_{1}=\left\{1,2,\ldots,s\right\}italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = { 1 , 2 , … , italic_s } and G2={s+1,,G}subscript𝐺2𝑠1𝐺G_{2}=\left\{s+1,\ldots,G\right\}italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = { italic_s + 1 , … , italic_G }. Now that we have simplified the formulation into two partitions, we proceed with an approach inspired by the proof of Lemma 2 in \citeauthor*hosseini2024surprising [hosseini2024surprising] and establish the following upper and lower bounds on prediction normalized vote for G𝐺Gitalic_G groups in CMM model.

f(σ)σ~Prs(σ|σ~)V¯(σ)f(σ)minσ~Prs(σ|σ~)𝑓𝜎subscript~𝜎subscriptPr𝑠conditional𝜎~𝜎¯𝑉𝜎𝑓𝜎subscript~𝜎subscriptPr𝑠conditional𝜎~𝜎\frac{f(\sigma)}{\sum_{\tilde{\sigma}}\mathrm{Pr}_{s}(\sigma|\tilde{\sigma})}% \leq\overline{V}(\sigma)\leq\frac{f(\sigma)}{\min_{\tilde{\sigma}}\mathrm{Pr}_% {s}(\sigma|\tilde{\sigma})}divide start_ARG italic_f ( italic_σ ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ | over~ start_ARG italic_σ end_ARG ) end_ARG ≤ over¯ start_ARG italic_V end_ARG ( italic_σ ) ≤ divide start_ARG italic_f ( italic_σ ) end_ARG start_ARG roman_min start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ | over~ start_ARG italic_σ end_ARG ) end_ARG

We can express the probability Prs(σ|σ~)subscriptPr𝑠conditionalsuperscript𝜎~𝜎\mathrm{Pr}_{s}(\sigma^{\star}|\tilde{\sigma})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT | over~ start_ARG italic_σ end_ARG ) as follows

Prs(σ|σ~)=jG1pjϕjd(σ~,σ)Z(ϕj)+jG2pjϕjd(σ~,σ)Z(ϕj)subscriptPr𝑠conditional𝜎~𝜎subscript𝑗subscript𝐺1subscript𝑝𝑗superscriptsubscriptitalic-ϕ𝑗𝑑~𝜎𝜎𝑍subscriptitalic-ϕ𝑗subscript𝑗subscript𝐺2subscript𝑝𝑗superscriptsubscriptitalic-ϕ𝑗𝑑~𝜎𝜎𝑍subscriptitalic-ϕ𝑗\displaystyle\mathrm{Pr}_{s}(\sigma|\tilde{\sigma})=\sum_{j\in G_{1}}p_{j}% \cdot\frac{\phi_{j}^{d(\tilde{\sigma},\sigma)}}{Z(\phi_{j})}+\sum_{j\in G_{2}}% p_{j}\cdot\frac{\phi_{j}^{d(\tilde{\sigma},\sigma)}}{Z(\phi_{j})}roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ | over~ start_ARG italic_σ end_ARG ) = ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⋅ divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d ( over~ start_ARG italic_σ end_ARG , italic_σ ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG + ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⋅ divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d ( over~ start_ARG italic_σ end_ARG , italic_σ ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG

This gives us the following lower bound on V¯(σ)¯𝑉superscript𝜎\overline{V}(\sigma^{\star})over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ).

V¯(σ)¯𝑉superscript𝜎\displaystyle\overline{V}(\sigma^{\star})over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) =jG1pjZ(ϕj)+jG2pjZ(ϕj)σ~(jG1pjϕjd(σ~,σ)Z(ϕj)+jG2pjϕjd(σ~,σ)Z(ϕj))absentsubscript𝑗subscript𝐺1subscript𝑝𝑗𝑍subscriptitalic-ϕ𝑗subscript𝑗subscript𝐺2subscript𝑝𝑗𝑍subscriptitalic-ϕ𝑗subscript~𝜎subscript𝑗subscript𝐺1subscript𝑝𝑗superscriptsubscriptitalic-ϕ𝑗𝑑~𝜎superscript𝜎𝑍subscriptitalic-ϕ𝑗subscript𝑗subscript𝐺2subscript𝑝𝑗superscriptsubscriptitalic-ϕ𝑗𝑑~𝜎superscript𝜎𝑍subscriptitalic-ϕ𝑗\displaystyle=\frac{\sum_{j\in G_{1}}\frac{p_{j}}{Z(\phi_{j})}+\sum_{j\in G_{2% }}\frac{p_{j}}{Z(\phi_{j})}}{\sum_{\tilde{\sigma}}\left(\sum_{j\in G_{1}}p_{j}% \cdot\frac{\phi_{j}^{d(\tilde{\sigma},\sigma^{\star})}}{Z(\phi_{j})}+\sum_{j% \in G_{2}}p_{j}\cdot\frac{\phi_{j}^{d(\tilde{\sigma},\sigma^{\star})}}{Z(\phi_% {j})}\right)}= divide start_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG + ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG end_ARG start_ARG ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⋅ divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d ( over~ start_ARG italic_σ end_ARG , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG + ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⋅ divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d ( over~ start_ARG italic_σ end_ARG , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG ) end_ARG
1Z(ϕs)jG1pj+1Z(ϕG)jG2pjjG1pjσ~ϕjd(σ,σ~)Z(ϕj)+jG2pjσ~ϕjd(σ,σ~)Z(ϕj)absent1𝑍subscriptitalic-ϕ𝑠subscript𝑗subscript𝐺1subscript𝑝𝑗1𝑍subscriptitalic-ϕ𝐺subscript𝑗subscript𝐺2subscript𝑝𝑗subscript𝑗subscript𝐺1subscript𝑝𝑗subscript~𝜎superscriptsubscriptitalic-ϕ𝑗𝑑superscript𝜎~𝜎𝑍subscriptitalic-ϕ𝑗subscript𝑗subscript𝐺2subscript𝑝𝑗subscript~𝜎superscriptsubscriptitalic-ϕ𝑗𝑑superscript𝜎~𝜎𝑍subscriptitalic-ϕ𝑗\displaystyle\geq\frac{\frac{1}{Z(\phi_{s})}\sum_{j\in G_{1}}p_{j}+\frac{1}{Z(% \phi_{G})}\sum_{j\in G_{2}}p_{j}}{\sum_{j\in G_{1}}p_{j}\sum_{\tilde{\sigma}}% \frac{\phi_{j}^{d(\sigma^{\star},\tilde{\sigma})}}{Z(\phi_{j})}+\sum_{j\in G_{% 2}}p_{j}\sum_{\tilde{\sigma}}\frac{\phi_{j}^{d(\sigma^{\star},\tilde{\sigma})}% }{Z(\phi_{j})}}≥ divide start_ARG divide start_ARG 1 end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , over~ start_ARG italic_σ end_ARG ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG + ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , over~ start_ARG italic_σ end_ARG ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG end_ARG
=αZ(ϕs)+1αZ(ϕG)jG1pj+jG2pjabsent𝛼𝑍subscriptitalic-ϕ𝑠1𝛼𝑍subscriptitalic-ϕ𝐺subscript𝑗subscript𝐺1subscript𝑝𝑗subscript𝑗subscript𝐺2subscript𝑝𝑗\displaystyle=\frac{\frac{\alpha}{Z(\phi_{s})}+\frac{1-\alpha}{Z(\phi_{G})}}{% \sum_{j\in G_{1}}p_{j}+\sum_{j\in G_{2}}p_{j}}= divide start_ARG divide start_ARG italic_α end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) end_ARG + divide start_ARG 1 - italic_α end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) end_ARG end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG
=αZ(ϕs)+1αZ(ϕG)absent𝛼𝑍subscriptitalic-ϕ𝑠1𝛼𝑍subscriptitalic-ϕ𝐺\displaystyle=\frac{\alpha}{Z(\phi_{s})}+\frac{1-\alpha}{Z(\phi_{G})}= divide start_ARG italic_α end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) end_ARG + divide start_ARG 1 - italic_α end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) end_ARG

We can also obtain the following upper bound on V¯(τ)¯𝑉𝜏\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_τ ).

V¯(τ)¯𝑉𝜏\displaystyle\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_τ ) jG1pjϕjZ(ϕj)+jG2pjϕjZ(ϕj)σ~(jG1pjϕjd(σ~,τ)Z(ϕj)+jG2pjϕjd(σ~,τ)Z(ϕj))absentsubscript𝑗subscript𝐺1subscript𝑝𝑗subscriptitalic-ϕ𝑗𝑍subscriptitalic-ϕ𝑗subscript𝑗subscript𝐺2subscript𝑝𝑗subscriptitalic-ϕ𝑗𝑍subscriptitalic-ϕ𝑗subscript~𝜎subscript𝑗subscript𝐺1subscript𝑝𝑗superscriptsubscriptitalic-ϕ𝑗𝑑~𝜎𝜏𝑍subscriptitalic-ϕ𝑗subscript𝑗subscript𝐺2subscript𝑝𝑗superscriptsubscriptitalic-ϕ𝑗𝑑~𝜎𝜏𝑍subscriptitalic-ϕ𝑗\displaystyle\leq\frac{\sum_{j\in G_{1}}p_{j}\cdot\frac{\phi_{j}}{Z(\phi_{j})}% +\sum_{j\in G_{2}}p_{j}\cdot\frac{\phi_{j}}{Z(\phi_{j})}}{\sum_{\tilde{\sigma}% }\left(\sum_{j\in G_{1}}p_{j}\cdot\frac{\phi_{j}^{d(\tilde{\sigma},\tau)}}{Z(% \phi_{j})}+\sum_{j\in G_{2}}p_{j}\cdot\frac{\phi_{j}^{d(\tilde{\sigma},\tau)}}% {Z(\phi_{j})}\right)}≤ divide start_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⋅ divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG + ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⋅ divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG end_ARG start_ARG ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⋅ divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d ( over~ start_ARG italic_σ end_ARG , italic_τ ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG + ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⋅ divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d ( over~ start_ARG italic_σ end_ARG , italic_τ ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG ) end_ARG
ϕsZ(ϕ1)jG1pj+ϕGZ(ϕs+1)jG2pjjG1pjσ~ϕjd(σ~,τ)Z(ϕj)+jG2pjσ~ϕjd(σ~,τ)Z(ϕj)absentsubscriptitalic-ϕ𝑠𝑍subscriptitalic-ϕ1subscript𝑗subscript𝐺1subscript𝑝𝑗subscriptitalic-ϕ𝐺𝑍subscriptitalic-ϕ𝑠1subscript𝑗subscript𝐺2subscript𝑝𝑗subscript𝑗subscript𝐺1subscript𝑝𝑗subscript~𝜎superscriptsubscriptitalic-ϕ𝑗𝑑~𝜎𝜏𝑍subscriptitalic-ϕ𝑗subscript𝑗subscript𝐺2subscript𝑝𝑗subscript~𝜎superscriptsubscriptitalic-ϕ𝑗𝑑~𝜎𝜏𝑍subscriptitalic-ϕ𝑗\displaystyle\leq\frac{\frac{\phi_{s}}{Z(\phi_{1})}\sum_{j\in G_{1}}p_{j}+% \frac{\phi_{G}}{Z(\phi_{s+1})}\sum_{j\in G_{2}}p_{j}}{\sum_{j\in G_{1}}p_{j}% \sum_{\tilde{\sigma}}\frac{\phi_{j}^{d(\tilde{\sigma},\tau)}}{Z(\phi_{j})}+% \sum_{j\in G_{2}}p_{j}\cdot\sum_{\tilde{\sigma}}\frac{\phi_{j}^{d(\tilde{% \sigma},\tau)}}{Z(\phi_{j})}}≤ divide start_ARG divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_s + 1 end_POSTSUBSCRIPT ) end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d ( over~ start_ARG italic_σ end_ARG , italic_τ ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG + ∑ start_POSTSUBSCRIPT italic_j ∈ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⋅ ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d ( over~ start_ARG italic_σ end_ARG , italic_τ ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG end_ARG
=ϕsZ(ϕ1)α+ϕGZ(ϕs+1)(1α)absentsubscriptitalic-ϕ𝑠𝑍subscriptitalic-ϕ1𝛼subscriptitalic-ϕ𝐺𝑍subscriptitalic-ϕ𝑠11𝛼\displaystyle=\frac{\phi_{s}}{Z(\phi_{1})}\alpha+\frac{\phi_{G}}{Z(\phi_{s+1})% }(1-\alpha)= divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG italic_α + divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_s + 1 end_POSTSUBSCRIPT ) end_ARG ( 1 - italic_α )

Therefore, in order to ensure V¯(σ)2V¯(τ)¯𝑉superscript𝜎2¯𝑉𝜏\overline{V}(\sigma^{\star})\geq 2\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 2 over¯ start_ARG italic_V end_ARG ( italic_τ ) we need the following condition.

αZ(ϕs)+1αZ(ϕG)2(ϕsZ(ϕ1)α+ϕGZ(ϕs+1)(1α))𝛼𝑍subscriptitalic-ϕ𝑠1𝛼𝑍subscriptitalic-ϕ𝐺2subscriptitalic-ϕ𝑠𝑍subscriptitalic-ϕ1𝛼subscriptitalic-ϕ𝐺𝑍subscriptitalic-ϕ𝑠11𝛼\frac{\alpha}{Z(\phi_{s})}+\frac{1-\alpha}{Z(\phi_{G})}\geq 2\left(\frac{\phi_% {s}}{Z(\phi_{1})}\alpha+\frac{\phi_{G}}{Z(\phi_{s+1})}(1-\alpha)\right)divide start_ARG italic_α end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) end_ARG + divide start_ARG 1 - italic_α end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) end_ARG ≥ 2 ( divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG italic_α + divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT end_ARG start_ARG italic_Z ( italic_ϕ start_POSTSUBSCRIPT italic_s + 1 end_POSTSUBSCRIPT ) end_ARG ( 1 - italic_α ) )

A.2 Proof of Lemma 3

Proof.

The proof is a direct application of the proof of Lemma 2 in \citeauthor*hosseini2024surprising [hosseini2024surprising] with the only difference being that the parameters under consideration are of CMPL instead of CMM. Here, we establish the following upper and lower bounds on prediction normalized vote for G=2𝐺2G=2italic_G = 2 groups in CMPL model.

f(σ)σ~Prs(σσ~)V¯(σ)f(σ)minσ~Prs(σσ~)𝑓𝜎subscript~𝜎subscriptPr𝑠conditional𝜎~𝜎¯𝑉𝜎𝑓𝜎subscript~𝜎subscriptPr𝑠conditional𝜎~𝜎\frac{f(\sigma)}{\sum_{\tilde{\sigma}}\mathrm{Pr}_{s}(\sigma\mid\tilde{\sigma}% )}\leq\overline{V}(\sigma)\leq\frac{f(\sigma)}{\min_{\tilde{\sigma}}\mathrm{Pr% }_{s}(\sigma\mid\tilde{\sigma})}divide start_ARG italic_f ( italic_σ ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ over~ start_ARG italic_σ end_ARG ) end_ARG ≤ over¯ start_ARG italic_V end_ARG ( italic_σ ) ≤ divide start_ARG italic_f ( italic_σ ) end_ARG start_ARG roman_min start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ over~ start_ARG italic_σ end_ARG ) end_ARG (10)

Suppose σsuperscript𝜎\sigma^{\star}italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is the true ranking and consider any ranking τ𝜏\tauitalic_τ with d(τ,σ)1𝑑𝜏superscript𝜎1d(\tau,\sigma^{\star})\geq 1italic_d ( italic_τ , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 1. Without loss of generality, we can assume that σ=12msuperscript𝜎1succeeds2succeedssucceeds𝑚\sigma^{\star}=1\succ 2\succ\ldots\succ mitalic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = 1 ≻ 2 ≻ … ≻ italic_m. This also implies that θg,1θg,2θg,msubscript𝜃𝑔1subscript𝜃𝑔2subscript𝜃𝑔𝑚\theta_{g,1}\geq\theta_{g,2}\geq\ldots\geq\theta_{g,m}italic_θ start_POSTSUBSCRIPT italic_g , 1 end_POSTSUBSCRIPT ≥ italic_θ start_POSTSUBSCRIPT italic_g , 2 end_POSTSUBSCRIPT ≥ … ≥ italic_θ start_POSTSUBSCRIPT italic_g , italic_m end_POSTSUBSCRIPT for any group g𝑔gitalic_g.

Under the assumption of Concentric mixture of Plackett-Luce model we have,

Prs(σσ~)subscriptPr𝑠conditionalsuperscript𝜎~𝜎\displaystyle\mathrm{Pr}_{s}(\sigma^{\star}\mid\tilde{\sigma})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ over~ start_ARG italic_σ end_ARG ) =pPrs(σθ1,σ~)+(1p)Prs(σθ2,σ~)absent𝑝subscriptPr𝑠conditionalsuperscript𝜎subscript𝜃1~𝜎1𝑝subscriptPr𝑠conditionalsuperscript𝜎subscript𝜃2~𝜎\displaystyle=p\cdot\mathrm{Pr}_{s}(\sigma^{\star}\mid\theta_{1},\tilde{\sigma% })+(1-p)\cdot\mathrm{Pr}_{s}(\sigma^{\star}\mid\theta_{2},\tilde{\sigma})= italic_p ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , over~ start_ARG italic_σ end_ARG ) + ( 1 - italic_p ) ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , over~ start_ARG italic_σ end_ARG )
=pj=1mθ1,σ~1(σ(j))i=jmθ1,σ~1(σ(i))+(1p)j=1mθ2,σ~1(σ(j))i=jmθ2,σ~1(σ(i))absent𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃1superscript~𝜎1superscript𝜎𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃1superscript~𝜎1superscript𝜎𝑖1𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃2superscript~𝜎1superscript𝜎𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃2superscript~𝜎1superscript𝜎𝑖\displaystyle=p\cdot\prod_{j=1}^{m}\frac{\theta_{1,\tilde{\sigma}^{-1}(\sigma^% {\star}(j))}}{\sum_{i=j}^{m}\theta_{1,\tilde{\sigma}^{-1}(\sigma^{\star}(i))}}% +(1-p)\cdot\prod_{j=1}^{m}\frac{\theta_{2,\tilde{\sigma}^{-1}(\sigma^{\star}(j% ))}}{\sum_{i=j}^{m}\theta_{2,\tilde{\sigma}^{-1}(\sigma^{\star}(i))}}= italic_p ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_j ) ) end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_i ) ) end_POSTSUBSCRIPT end_ARG + ( 1 - italic_p ) ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 2 , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_j ) ) end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 2 , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_i ) ) end_POSTSUBSCRIPT end_ARG

When θ1subscript𝜃1\theta_{1}italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT stochastically dominates θ2subscript𝜃2\theta_{2}italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT we have Prs(σθ1,σ)Prs(σθ2,σ)subscriptPr𝑠conditionalsuperscript𝜎subscript𝜃1superscript𝜎subscriptPr𝑠conditionalsuperscript𝜎subscript𝜃2superscript𝜎\mathrm{Pr}_{s}(\sigma^{\star}\mid\theta_{1},\sigma^{\star})\geq\mathrm{Pr}_{s% }(\sigma^{\star}\mid\theta_{2},\sigma^{\star})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ). Moreover, using the fact p<(1p)𝑝1𝑝p<(1-p)italic_p < ( 1 - italic_p ) we obtain the following lower bound on V¯(σ)¯𝑉superscript𝜎\overline{V}(\sigma^{\star})over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ).

V¯(σ)2pj=1mθ2,ji=jmθ2,i(1p)σ~j=1mθ1,σ~1(j)i=jmθ1,σ~1(i)+j=1mθ2,σ~1(j)i=jmθ2,σ~1(i)=p1pj=1mθ2,ji=jmθ2,i¯𝑉superscript𝜎2𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃2𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃2𝑖1𝑝subscript~𝜎superscriptsubscriptproduct𝑗1𝑚subscript𝜃1superscript~𝜎1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃1superscript~𝜎1𝑖superscriptsubscriptproduct𝑗1𝑚subscript𝜃2superscript~𝜎1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃2superscript~𝜎1𝑖𝑝1𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃2𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃2𝑖\displaystyle\overline{V}(\sigma^{\star})\geq\frac{2p\cdot\prod_{j=1}^{m}\frac% {\theta_{2,j}}{\sum_{i=j}^{m}\theta_{2,i}}}{(1-p)\cdot\sum_{\tilde{\sigma}}% \prod_{j=1}^{m}\frac{\theta_{1,\tilde{\sigma}^{-1}(j)}}{\sum_{i=j}^{m}\theta_{% 1,\tilde{\sigma}^{-1}(i)}}+\prod_{j=1}^{m}\frac{\theta_{2,\tilde{\sigma}^{-1}(% j)}}{\sum_{i=j}^{m}\theta_{2,\tilde{\sigma}^{-1}(i)}}}=\frac{p}{1-p}\cdot\prod% _{j=1}^{m}\frac{\theta_{2,j}}{\sum_{i=j}^{m}\theta_{2,i}}over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ divide start_ARG 2 italic_p ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 2 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 2 , italic_i end_POSTSUBSCRIPT end_ARG end_ARG start_ARG ( 1 - italic_p ) ⋅ ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_j ) end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT end_ARG + ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 2 , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_j ) end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 2 , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT end_ARG end_ARG = divide start_ARG italic_p end_ARG start_ARG 1 - italic_p end_ARG ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 2 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 2 , italic_i end_POSTSUBSCRIPT end_ARG

The last equality uses lemma 6. We now provide an upper bound on V¯(τ)¯𝑉𝜏\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_τ ).

Prs(τσ)subscriptPr𝑠conditional𝜏superscript𝜎\displaystyle\mathrm{Pr}_{s}(\tau\mid\sigma^{\star})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_τ ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) =pPrs(τθ1,σ)+(1p)Prs(τθ2,σ)absent𝑝subscriptPr𝑠conditional𝜏subscript𝜃1superscript𝜎1𝑝subscriptPr𝑠conditional𝜏subscript𝜃2superscript𝜎\displaystyle=p\cdot\mathrm{Pr}_{s}(\tau\mid\theta_{1},\sigma^{\star})+(1-p)% \cdot\mathrm{Pr}_{s}(\tau\mid\theta_{2},\sigma^{\star})= italic_p ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_τ ∣ italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) + ( 1 - italic_p ) ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_τ ∣ italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT )
pPrs(σθ1,σ)+(1p)Prs(σθ2,σ)absent𝑝subscriptPr𝑠conditionalsuperscript𝜎subscript𝜃1superscript𝜎1𝑝subscriptPr𝑠conditionalsuperscript𝜎subscript𝜃2superscript𝜎\displaystyle\leq p\cdot\mathrm{Pr}_{s}(\sigma^{\star}\mid\theta_{1},\sigma^{% \star})+(1-p)\cdot\mathrm{Pr}_{s}(\sigma^{\star}\mid\theta_{2},\sigma^{\star})≤ italic_p ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) + ( 1 - italic_p ) ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT )
2(1p)Prs(σθ1,σ)absent21𝑝subscriptPr𝑠conditionalsuperscript𝜎subscript𝜃1superscript𝜎\displaystyle\leq 2(1-p)\cdot\mathrm{Pr}_{s}(\sigma^{\star}\mid\theta_{1},% \sigma^{\star})≤ 2 ( 1 - italic_p ) ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT )

The first inequality follows because the elements of θ1subscript𝜃1\theta_{1}italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and θ2subscript𝜃2\theta_{2}italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are arranged in non-decreasing order. The second inequality follows because θ1subscript𝜃1\theta_{1}italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT stochastically dominates θ2subscript𝜃2\theta_{2}italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. On the other hand,

minσ~Prs(τσ~)subscript~𝜎subscriptPr𝑠conditional𝜏~𝜎\displaystyle\min_{\tilde{\sigma}}\mathrm{Pr}_{s}(\tau\mid\tilde{\sigma})roman_min start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_τ ∣ over~ start_ARG italic_σ end_ARG ) p(j=1mθ1,mj+1i=jmθ1,mi+1+j=1mθ2,mj+1i=jmθ2,mi+1)absent𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑚𝑖1superscriptsubscriptproduct𝑗1𝑚subscript𝜃2𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃2𝑚𝑖1\displaystyle\geq p\left(\prod_{j=1}^{m}\frac{\theta_{1,m-j+1}}{\sum_{i=j}^{m}% \theta_{1,m-i+1}}+\prod_{j=1}^{m}\frac{\theta_{2,m-j+1}}{\sum_{i=j}^{m}\theta_% {2,m-i+1}}\right)≥ italic_p ( ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG + ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 2 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 2 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG )
2pj=1mθ1,mj+1i=jmθ1,mi+1absent2𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑚𝑖1\displaystyle\geq 2p\cdot\prod_{j=1}^{m}\frac{\theta_{1,m-j+1}}{\sum_{i=j}^{m}% \theta_{1,m-i+1}}≥ 2 italic_p ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG

The last inequality follows since θ1subscript𝜃1\theta_{1}italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT stochastically dominates θ2subscript𝜃2\theta_{2}italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Now we have the following upper bound on V¯(τ)¯𝑉𝜏\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_τ ).

V¯(τ)1ppj=1mθ1,ji=jmθ1,ij=1mθ1,mj+1i=jmθ1,mi+1¯𝑉𝜏1𝑝𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑖superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑚𝑖1\displaystyle\overline{V}(\tau)\leq\frac{1-p}{p}\cdot\frac{\prod_{j=1}^{m}% \frac{\theta_{1,j}}{\sum_{i=j}^{m}\theta_{1,i}}}{\prod_{j=1}^{m}\frac{\theta_{% 1,m-j+1}}{\sum_{i=j}^{m}\theta_{1,m-i+1}}}over¯ start_ARG italic_V end_ARG ( italic_τ ) ≤ divide start_ARG 1 - italic_p end_ARG start_ARG italic_p end_ARG ⋅ divide start_ARG ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_i end_POSTSUBSCRIPT end_ARG end_ARG start_ARG ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG end_ARG

Therefore, as long as

(p1p)22(j=1mθ2,ji=jmθ2,i)(j=1mθ1,ji=jmθ1,i)1(j=1mθ1,mj+1i=jmθ1,mi+1)superscript𝑝1𝑝22superscriptsubscriptproduct𝑗1𝑚subscript𝜃2𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃2𝑖superscriptsuperscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑖1superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑚𝑖1\left(\frac{p}{1-p}\right)^{2}\geq 2\cdot\left(\prod_{j=1}^{m}\frac{\theta_{2,% j}}{\sum_{i=j}^{m}\theta_{2,i}}\right)\left(\prod_{j=1}^{m}\frac{\theta_{1,j}}% {\sum_{i=j}^{m}\theta_{1,i}}\right)^{-1}\left(\prod_{j=1}^{m}\frac{\theta_{1,m% -j+1}}{\sum_{i=j}^{m}\theta_{1,m-i+1}}\right)( divide start_ARG italic_p end_ARG start_ARG 1 - italic_p end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ 2 ⋅ ( ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 2 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 2 , italic_i end_POSTSUBSCRIPT end_ARG ) ( ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_i end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG )

we are guaranteed that V¯(σ)2V¯(τ)¯𝑉superscript𝜎2¯𝑉𝜏\overline{V}(\sigma^{\star})\geq 2\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 2 over¯ start_ARG italic_V end_ARG ( italic_τ ). ∎

Lemma 6.

For any vector u=(u1,,um)𝑢subscript𝑢1subscript𝑢𝑚u=(u_{1},\ldots,u_{m})italic_u = ( italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) we have,

σ~j=1muσ~1(j)i=jmuσ~1(i)=1subscript~𝜎superscriptsubscriptproduct𝑗1𝑚subscript𝑢superscript~𝜎1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝑢superscript~𝜎1𝑖1\sum_{\tilde{\sigma}}\prod_{j=1}^{m}\frac{u_{\tilde{\sigma}^{-1}(j)}}{\sum_{i=% j}^{m}u_{\tilde{\sigma}^{-1}(i)}}=1∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_u start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_j ) end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT end_ARG = 1
Proof.

We prove this result by induction on m𝑚mitalic_m. For m=1𝑚1m=1italic_m = 1, there is only one permutation and the base case holds. Suppose, the claim is true for m𝑚mitalic_m. Then we have,

σ~j=1m+1uσ~1(j)i=jm+1uσ~1(i)=aσ~:σ~[1]=aj=1m+1uσ~1(j)i=jm+1uσ~1(i)subscript~𝜎superscriptsubscriptproduct𝑗1𝑚1subscript𝑢superscript~𝜎1𝑗superscriptsubscript𝑖𝑗𝑚1subscript𝑢superscript~𝜎1𝑖subscript𝑎subscript:~𝜎~𝜎delimited-[]1𝑎superscriptsubscriptproduct𝑗1𝑚1subscript𝑢superscript~𝜎1𝑗superscriptsubscript𝑖𝑗𝑚1subscript𝑢superscript~𝜎1𝑖\displaystyle\sum_{\tilde{\sigma}}\prod_{j=1}^{m+1}\frac{u_{\tilde{\sigma}^{-1% }(j)}}{\sum_{i=j}^{m+1}u_{\tilde{\sigma}^{-1}(i)}}=\sum_{a}\sum_{\tilde{\sigma% }:\tilde{\sigma}[1]=a}\prod_{j=1}^{m+1}\frac{u_{\tilde{\sigma}^{-1}(j)}}{\sum_% {i=j}^{m+1}u_{\tilde{\sigma}^{-1}(i)}}∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 end_POSTSUPERSCRIPT divide start_ARG italic_u start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_j ) end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT end_ARG = ∑ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG : over~ start_ARG italic_σ end_ARG [ 1 ] = italic_a end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 end_POSTSUPERSCRIPT divide start_ARG italic_u start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_j ) end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT end_ARG
=\displaystyle== auaj=1m+1ujσ~:σ~𝒮j=1muσ~1(j)i=jm+1uσ~1(i)subscript𝑎subscript𝑢𝑎superscriptsubscript𝑗1𝑚1subscript𝑢𝑗subscript:~𝜎~𝜎𝒮superscriptsubscriptproduct𝑗1𝑚subscript𝑢superscript~𝜎1𝑗superscriptsubscript𝑖𝑗𝑚1subscript𝑢superscript~𝜎1𝑖\displaystyle\sum_{a}\frac{u_{a}}{\sum_{j=1}^{m+1}u_{j}}\sum_{\tilde{\sigma}:% \tilde{\sigma}\in\mathcal{S}}\prod_{j=1}^{m}\frac{u_{\tilde{\sigma}^{-1}(j)}}{% \sum_{i=j}^{m+1}u_{\tilde{\sigma}^{-1}(i)}}∑ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT divide start_ARG italic_u start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG : over~ start_ARG italic_σ end_ARG ∈ caligraphic_S end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_u start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_j ) end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT end_ARG
=auaj=1m+1uj=1absentsubscript𝑎subscript𝑢𝑎superscriptsubscript𝑗1𝑚1subscript𝑢𝑗1\displaystyle=\sum_{a}\frac{u_{a}}{\sum_{j=1}^{m+1}u_{j}}=1= ∑ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT divide start_ARG italic_u start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG = 1

A.3 Proof of Lemma 4

Proof.

As mentioned in Lemma 4 in the main text, we partition the set G𝐺Gitalic_G into sets G1={1,2,,s}subscript𝐺112𝑠G_{1}=\left\{1,2,\ldots,s\right\}italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = { 1 , 2 , … , italic_s } and G2={s+1,,G}subscript𝐺2𝑠1𝐺G_{2}=\left\{s+1,\ldots,G\right\}italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = { italic_s + 1 , … , italic_G }. Now that we have simplified the formulation into two partitions, we proceed with an approach inspired by the proof of Lemma 2 in \citeauthor*hosseini2024surprising [hosseini2024surprising] and establish the following upper and lower bounds on prediction normalized vote for G𝐺Gitalic_G groups in CMPL model.

f(σ)σ~Prs(σσ~)V¯(σ)f(σ)minσ~Prs(σσ~)𝑓𝜎subscript~𝜎subscriptPr𝑠conditional𝜎~𝜎¯𝑉𝜎𝑓𝜎subscript~𝜎subscriptPr𝑠conditional𝜎~𝜎\frac{f(\sigma)}{\sum_{\tilde{\sigma}}\mathrm{Pr}_{s}(\sigma\mid\tilde{\sigma}% )}\leq\overline{V}(\sigma)\leq\frac{f(\sigma)}{\min_{\tilde{\sigma}}\mathrm{Pr% }_{s}(\sigma\mid\tilde{\sigma})}divide start_ARG italic_f ( italic_σ ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ over~ start_ARG italic_σ end_ARG ) end_ARG ≤ over¯ start_ARG italic_V end_ARG ( italic_σ ) ≤ divide start_ARG italic_f ( italic_σ ) end_ARG start_ARG roman_min start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ ∣ over~ start_ARG italic_σ end_ARG ) end_ARG (11)

Suppose σsuperscript𝜎\sigma^{\star}italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT is the true ranking and consider any ranking τ𝜏\tauitalic_τ with d(τ,σ)1𝑑𝜏superscript𝜎1d(\tau,\sigma^{\star})\geq 1italic_d ( italic_τ , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 1. Without loss of generality, we can assume that σ=12msuperscript𝜎1succeeds2succeedssucceeds𝑚\sigma^{\star}=1\succ 2\succ\ldots\succ mitalic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = 1 ≻ 2 ≻ … ≻ italic_m. This also implies that θg,1θg,2θg,msubscript𝜃𝑔1subscript𝜃𝑔2subscript𝜃𝑔𝑚\theta_{g,1}\geq\theta_{g,2}\geq\ldots\geq\theta_{g,m}italic_θ start_POSTSUBSCRIPT italic_g , 1 end_POSTSUBSCRIPT ≥ italic_θ start_POSTSUBSCRIPT italic_g , 2 end_POSTSUBSCRIPT ≥ … ≥ italic_θ start_POSTSUBSCRIPT italic_g , italic_m end_POSTSUBSCRIPT for any group g𝑔gitalic_g.

Under the assumption of Concentric mixture of Plackett-Luce model we have,

Prs(σσ~)subscriptPr𝑠conditionalsuperscript𝜎~𝜎\displaystyle\mathrm{Pr}_{s}(\sigma^{\star}\mid\tilde{\sigma})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ over~ start_ARG italic_σ end_ARG ) ==1GpPrs(σθ,σ~)absentsuperscriptsubscript1𝐺subscript𝑝subscriptPr𝑠conditionalsuperscript𝜎subscript𝜃~𝜎\displaystyle=\sum_{\ell=1}^{G}p_{\ell}\cdot\mathrm{Pr}_{s}(\sigma^{\star}\mid% \theta_{\ell},\tilde{\sigma})= ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ italic_θ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , over~ start_ARG italic_σ end_ARG )
==1Gpj=1mθ,σ~1(σ(j))i=jmθ,σ~1(σ(i))absentsuperscriptsubscript1𝐺subscript𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃superscript~𝜎1superscript𝜎𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃superscript~𝜎1superscript𝜎𝑖\displaystyle=\sum_{\ell=1}^{G}p_{\ell}\cdot\prod_{j=1}^{m}\frac{\theta_{\ell,% \tilde{\sigma}^{-1}(\sigma^{\star}(j))}}{\sum_{i=j}^{m}\theta_{\ell,\tilde{% \sigma}^{-1}(\sigma^{\star}(i))}}= ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT roman_ℓ , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_j ) ) end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT roman_ℓ , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ( italic_i ) ) end_POSTSUBSCRIPT end_ARG
==1Gpj=1mθ,σ~1(j)i=jmθ,σ~1(i)absentsuperscriptsubscript1𝐺subscript𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃superscript~𝜎1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃superscript~𝜎1𝑖\displaystyle=\sum_{\ell=1}^{G}p_{\ell}\cdot\prod_{j=1}^{m}\frac{\theta_{\ell,% \tilde{\sigma}^{-1}(j)}}{\sum_{i=j}^{m}\theta_{\ell,\tilde{\sigma}^{-1}(i)}}= ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT roman_ℓ , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_j ) end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT roman_ℓ , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT end_ARG

When θ𝜃\thetaitalic_θ stochastically dominates θsuperscript𝜃\theta^{\prime}italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT we have Prs(σθ,σ)Prs(σθ,σ)subscriptPr𝑠conditionalsuperscript𝜎𝜃superscript𝜎subscriptPr𝑠conditionalsuperscript𝜎superscript𝜃superscript𝜎\mathrm{Pr}_{s}(\sigma^{\star}\mid\theta,\sigma^{\star})\geq\mathrm{Pr}_{s}(% \sigma^{\star}\mid\theta^{\prime},\sigma^{\star})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ italic_θ , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∣ italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ). This gives us the following lower bound on V¯(σ)¯𝑉superscript𝜎\overline{V}(\sigma^{\star})over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ).

V¯(σ)¯𝑉superscript𝜎\displaystyle\overline{V}(\sigma^{\star})over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) G1pj=1mθs,ji=jmθs,i+G2pj=1mθG,ji=jmθG,ipσ~j=1mθ,σ~1(j)i=jmθ,σ~1(i)absentsubscriptsubscript𝐺1subscript𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑠𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃𝑠𝑖subscriptsubscript𝐺2subscript𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝐺𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃𝐺𝑖subscriptsubscript𝑝subscript~𝜎superscriptsubscriptproduct𝑗1𝑚subscript𝜃superscript~𝜎1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃superscript~𝜎1𝑖\displaystyle\geq\frac{\sum_{\ell\in G_{1}}p_{\ell}\cdot\prod_{j=1}^{m}\frac{% \theta_{s,j}}{\sum_{i=j}^{m}\theta_{s,i}}+\sum_{\ell\in G_{2}}p_{\ell}\cdot% \prod_{j=1}^{m}\frac{\theta_{G,j}}{\sum_{i=j}^{m}\theta_{G,i}}}{\sum_{\ell}p_{% \ell}\cdot\sum_{\tilde{\sigma}}\prod_{j=1}^{m}\frac{\theta_{\ell,\tilde{\sigma% }^{-1}(j)}}{\sum_{i=j}^{m}\theta_{\ell,\tilde{\sigma}^{-1}(i)}}}≥ divide start_ARG ∑ start_POSTSUBSCRIPT roman_ℓ ∈ italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_s , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_s , italic_i end_POSTSUBSCRIPT end_ARG + ∑ start_POSTSUBSCRIPT roman_ℓ ∈ italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_G , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_G , italic_i end_POSTSUBSCRIPT end_ARG end_ARG start_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⋅ ∑ start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT roman_ℓ , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_j ) end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT roman_ℓ , over~ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_i ) end_POSTSUBSCRIPT end_ARG end_ARG
=αj=1mθs,ji=jmθs,i+(1α)j=1mθG,ji=jmθG,iabsent𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑠𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃𝑠𝑖1𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝐺𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃𝐺𝑖\displaystyle=\alpha\prod_{j=1}^{m}\frac{\theta_{s,j}}{\sum_{i=j}^{m}\theta_{s% ,i}}+(1-\alpha)\prod_{j=1}^{m}\frac{\theta_{G,j}}{\sum_{i=j}^{m}\theta_{G,i}}= italic_α ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_s , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_s , italic_i end_POSTSUBSCRIPT end_ARG + ( 1 - italic_α ) ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_G , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_G , italic_i end_POSTSUBSCRIPT end_ARG

The last equality uses lemma 6. We now provide an upper bound on V¯(τ)¯𝑉𝜏\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_τ ).

Prs(τσ)subscriptPr𝑠conditional𝜏superscript𝜎\displaystyle\mathrm{Pr}_{s}(\tau\mid\sigma^{\star})roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_τ ∣ italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ==1GpPrs(τθ,σ)absentsuperscriptsubscript1𝐺subscript𝑝subscriptPr𝑠conditional𝜏subscript𝜃superscript𝜎\displaystyle=\sum_{\ell=1}^{G}p_{\ell}\cdot\mathrm{Pr}_{s}(\tau\mid\theta_{% \ell},\sigma^{\star})= ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_τ ∣ italic_θ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT )
==1spPrs(τθ,σ)+=s+1GpPrs(τθ,σ)absentsuperscriptsubscript1𝑠subscript𝑝subscriptPr𝑠conditional𝜏subscript𝜃superscript𝜎superscriptsubscript𝑠1𝐺subscript𝑝subscriptPr𝑠conditional𝜏subscript𝜃superscript𝜎\displaystyle=\sum_{\ell=1}^{s}p_{\ell}\cdot\mathrm{Pr}_{s}(\tau\mid\theta_{% \ell},\sigma^{\star})+\sum_{\ell=s+1}^{G}p_{\ell}\cdot\mathrm{Pr}_{s}(\tau\mid% \theta_{\ell},\sigma^{\star})= ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_τ ∣ italic_θ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) + ∑ start_POSTSUBSCRIPT roman_ℓ = italic_s + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⋅ roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_τ ∣ italic_θ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT )

Now using the stochastic dominance relation, we obtain the lower bound.

minσ~Prs(τσ~)subscript~𝜎subscriptPr𝑠conditional𝜏~𝜎\displaystyle\min_{\tilde{\sigma}}\mathrm{Pr}_{s}(\tau\mid\tilde{\sigma})roman_min start_POSTSUBSCRIPT over~ start_ARG italic_σ end_ARG end_POSTSUBSCRIPT roman_Pr start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_τ ∣ over~ start_ARG italic_σ end_ARG ) =1spj=1mθ1,mj+1i=jmθ1,mi+1+=s+1Gpj=1mθs+1,mj+1i=jmθs+1,mi+1absentsuperscriptsubscript1𝑠subscript𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑚𝑖1superscriptsubscript𝑠1𝐺subscript𝑝superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑠1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃𝑠1𝑚𝑖1\displaystyle\geq\sum_{\ell=1}^{s}p_{\ell}\prod_{j=1}^{m}\frac{\theta_{1,m-j+1% }}{\sum_{i=j}^{m}\theta_{1,m-i+1}}+\sum_{\ell=s+1}^{G}p_{\ell}\prod_{j=1}^{m}% \frac{\theta_{s+1,m-j+1}}{\sum_{i=j}^{m}\theta_{s+1,m-i+1}}≥ ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG + ∑ start_POSTSUBSCRIPT roman_ℓ = italic_s + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG
αj=1mθ1,mj+1i=jmθ1,mi+1+(1α)j=1mθs+1,mj+1i=jmθs+1,mi+1absent𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑚𝑖11𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑠1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃𝑠1𝑚𝑖1\displaystyle\geq\alpha\cdot\prod_{j=1}^{m}\frac{\theta_{1,m-j+1}}{\sum_{i=j}^% {m}\theta_{1,m-i+1}}+(1-\alpha)\cdot\prod_{j=1}^{m}\frac{\theta_{s+1,m-j+1}}{% \sum_{i=j}^{m}\theta_{s+1,m-i+1}}≥ italic_α ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG + ( 1 - italic_α ) ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG

Now we have the following upper bound on V¯(τ)¯𝑉𝜏\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_τ ).

V¯(τ)αj=1mθ1,ji=jmθ1,i+(1α)j=1mθs+1,ji=jmθs+1,iαj=1mθ1,mj+1i=jmθ1,mi+1+(1α)j=1mθs+1,mj+1i=jmθs+1,mi+1¯𝑉𝜏𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑖1𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑠1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃𝑠1𝑖𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑚𝑖11𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑠1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃𝑠1𝑚𝑖1\displaystyle\overline{V}(\tau)\leq\frac{\alpha\prod_{j=1}^{m}\frac{\theta_{1,% j}}{\sum_{i=j}^{m}\theta_{1,i}}+(1-\alpha)\prod_{j=1}^{m}\frac{\theta_{s+1,j}}% {\sum_{i=j}^{m}\theta_{s+1,i}}}{\alpha\cdot\prod_{j=1}^{m}\frac{\theta_{1,m-j+% 1}}{\sum_{i=j}^{m}\theta_{1,m-i+1}}+(1-\alpha)\cdot\prod_{j=1}^{m}\frac{\theta% _{s+1,m-j+1}}{\sum_{i=j}^{m}\theta_{s+1,m-i+1}}}over¯ start_ARG italic_V end_ARG ( italic_τ ) ≤ divide start_ARG italic_α ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_i end_POSTSUBSCRIPT end_ARG + ( 1 - italic_α ) ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_i end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_α ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG + ( 1 - italic_α ) ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG end_ARG

Therefore, as long as

αj=1mθs,ji=jmθs,i+(1α)j=1mθG,ji=jmθG,i2αj=1mθ1,ji=jmθ1,i+2(1α)j=1mθs+1,ji=jmθs+1,iαj=1mθ1,mj+1i=jmθ1,mi+1+(1α)j=1mθs+1,mj+1i=jmθs+1,mi+1𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑠𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃𝑠𝑖1𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝐺𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃𝐺𝑖2𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑖21𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑠1𝑗superscriptsubscript𝑖𝑗𝑚subscript𝜃𝑠1𝑖𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃1𝑚𝑖11𝛼superscriptsubscriptproduct𝑗1𝑚subscript𝜃𝑠1𝑚𝑗1superscriptsubscript𝑖𝑗𝑚subscript𝜃𝑠1𝑚𝑖1\alpha\prod_{j=1}^{m}\frac{\theta_{s,j}}{\sum_{i=j}^{m}\theta_{s,i}}+(1-\alpha% )\prod_{j=1}^{m}\frac{\theta_{G,j}}{\sum_{i=j}^{m}\theta_{G,i}}\geq\frac{2% \alpha\prod_{j=1}^{m}\frac{\theta_{1,j}}{\sum_{i=j}^{m}\theta_{1,i}}+2(1-% \alpha)\prod_{j=1}^{m}\frac{\theta_{s+1,j}}{\sum_{i=j}^{m}\theta_{s+1,i}}}{% \alpha\cdot\prod_{j=1}^{m}\frac{\theta_{1,m-j+1}}{\sum_{i=j}^{m}\theta_{1,m-i+% 1}}+(1-\alpha)\cdot\prod_{j=1}^{m}\frac{\theta_{s+1,m-j+1}}{\sum_{i=j}^{m}% \theta_{s+1,m-i+1}}}italic_α ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_s , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_s , italic_i end_POSTSUBSCRIPT end_ARG + ( 1 - italic_α ) ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_G , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_G , italic_i end_POSTSUBSCRIPT end_ARG ≥ divide start_ARG 2 italic_α ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_i end_POSTSUBSCRIPT end_ARG + 2 ( 1 - italic_α ) ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_i end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_α ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG + ( 1 - italic_α ) ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_m - italic_j + 1 end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_s + 1 , italic_m - italic_i + 1 end_POSTSUBSCRIPT end_ARG end_ARG

we are guaranteed that V¯(σ)2V¯(τ)¯𝑉superscript𝜎2¯𝑉𝜏\overline{V}(\sigma^{\star})\geq 2\overline{V}(\tau)over¯ start_ARG italic_V end_ARG ( italic_σ start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ≥ 2 over¯ start_ARG italic_V end_ARG ( italic_τ ). ∎

Appendix B Missing Results

Refer to caption
Refer to caption
Figure 6: Comparison of sample complexity for data generated from CMM and CMPL models with G=2, and aggregated using Copeland and SP-Voting rule.
Refer to caption
Figure 7: Comparison of sample complexity on real data when votes are aggregated using Copeland and Partial-SP.