research-article

Effective signal reconstruction from multiple ranked lists via convex optimization

Authors:

Michael G. Schimek,

Bastian Pfeifer,

Michele La RoccaAuthors Info & Claims

Data Mining and Knowledge Discovery, Volume 38, Issue 3

Pages 1125 - 1169

https://doi.org/10.1007/s10618-023-00991-z

Published: 02 January 2024 Publication History

Abstract

The ranking of objects is widely used to rate their relative quality or relevance across multiple assessments. Beyond classical rank aggregation, it is of interest to estimate the usually unobservable latent signals that inform a consensus ranking. Under the only assumption of independent assessments, which can be incomplete, we introduce indirect inference via convex optimization in combination with computationally efficient Poisson Bootstrap. Two different objective functions are suggested, one linear and the other quadratic. The mathematical formulation of the signal estimation problem is based on pairwise comparisons of all objects with respect to their rank positions. Sets of constraints represent the order relations. The transitivity property of rank scales allows us to reduce substantially the number of constraints associated with the full set of object comparisons. The key idea is to globally reduce the errors induced by the rankers until optimal latent signals can be obtained. Its main advantage is low computational costs, even when handling

n < < p

data problems. Exploratory tools can be developed based on the bootstrap signal estimates and standard errors. Simulation evidence, a comparison with the state-of-the-art rank centrality method, and two applications, one in higher education evaluation and the other in molecular cancer research, are presented.

References

[1]

Alvo M, Yu PLH (2014) Statistical methods for ranking data. Springer, New York

[2]

Babu GJ, Pathak PK, Rao CR (1999) Second-order correctness of the Poisson bootstrap. Ann Stat 27(5):1666–1683

[3]

Bradley RA, Terry ME (1955) Rank analysis of incomplete block designs: I. The method of paired comparisons. Biometrika 39, 3/4, 324–345

[4]

de Borda JC. (1781) Mémoire sur les Élections au Scrutiny. Histoire de l’Acaémie Royal des Sciences, Paris

[5]

Chamandy N, Muralidharan O, Najmi A, and Naidu S Estimating Uncertainty for Massive Data Streams 2012 Google Technical Report

[6]

Cucuringu M Sync-Rank: Robust ranking, constrained ranking and rank aggregation via eigenvector and SDP synchronizalion IEEE Trans Netw Sci Eng 2016 3 1 58-79

[7]

DeConde PR et al. (2006) Combined results of microarray experiments: a rank aggregation approach. Stat Appl Genet Molecul Biol 5(15)

[8]

Dwork C et al. (2001) Rank aggregation methods for the Web. http://www10.org/cdrom/papers/577/

[9]

Fagin R, Kumar R, and Sivakumar D Comparing top-k lists SIAM J Discr Math 2003 17 134-160

Digital Library

[10]

Fligner MA and Verducci JS Multistage ranking models J Am Stat Assoc 1988 83 892-901

[11]

Gao C, Shen Y, Zhang AY (2021) Uncertainty quantification in the Bradley-Terry-Luce model. https://doi.org/10.48550/arXiv.2110.03874

[12]

Hall P and Schimek MG Moderate deviation-based inference for random degeneration in paired rank lists J Am Stat Assoc 2012 107 661-672

[13]

Kohyanagi N, Kitamura N, Tanaka K, Mizuno T, Fujiwara N, Ohama T, and Sato K The protein level of the tumour-promoting factor SET is regulated by cell density J Biochem 2022 3 171 295-303

[14]

Li H, Xu M, Liu JS, and Fan X An extended Mallows model for ranked data aggregation J Am Stat Assoc 2020 115 730-746

[15]

Li X, Yi D, and Liu JS Bayesian analysis of rank data with covariates and heterogeneous rankers Stat Sci 2022 37 1 1-23

[16]

Lin S (2010) Space oriented rank-based data integration. Stat Appl Genet Molecul Biol, 9(20)

[17]

Lin S and Ding J Integration of ranked lists via Cross Entropy Monte Carlo with applications to mRNA and microRNA studies Biometrics 2009 65 9-18

[18]

Luce RD Individual choice behavior: A theoretical analysis 1959 New York John Wiley and Sons

[19]

McFadden D Conditional logit analysis of qualitative choice behavior Front Econ 1973 1 1 105-142

[20]

Mallows CL Non null ranking models I Biometrika 1957 44 114-130

[21]

Negahban S, Oh S, Shah D (2012) Iterative ranking from pair-wise comparisons. Proc Adv Neural Inf Process Syst 25 (NIPS 2012): 2474–2482

[22]

Negahban S, Oh S, and Shah D Rank centrality: ranking from pairwise comparisons Oper Res 2017 65 266-287

Digital Library

[23]

Plackett RL The analysis of permutations Appl Stat 1975 24 193-202

[24]

Rao CR, Pathak PK, and Koltchinskii VI Bootstrap by sequential resampling J Stat Plan Infer 1997 64 257-281

[25]

Rappoport N and Shamir R Multi-omic and multi-view clustering algorithms: review and cancer benchmark Nucleic Acids Res 2018 46 20 10546-10562

[26]

Ritchie ME, Phipson B, Wu DI, Hu Y, Law CW, Shi W, and Smyth GK limma powers differential expression analyses for RNA-sequencing and microarray studies Nucleic Acids Res 2015 43 7 e47-e47

[27]

Sampath S and Verducci JS Detecting the end of agreement between two long ranked lists Stat Anal Data Min 2013 6 458-471

[28]

Schimek MG, Budinska E, Kugler KG, Svendova V, Ding J, and Lin S TopKLists: a comprehensive R package for statistical inference, stochastic aggregation, and visualization of multiple omics ranked lists Stat Appl Genet Molecul Biol 2015 14 3 311-316

[29]

Schulte-Sasse R, Budach S, Hnisz D, and Marsico A Integration of multiomics data with graph convolutional networks to identify new cancer genes and their associated molecular mechanisms Nat Mach Intell 2021 3 6 513-526

[30]

Spearman C The proof and measurement of association between two things Am J Psychol 1904 15 72-101

[31]

Svendova V and Schimek MG A novel method for estimating the common signals for consensus across multiple ranked lists Comput Stat Data Anal 2017 115 122-135

[32]

Thurstone LL (1927) A law of comparative judgement. Psychol Rev 34:273–286

[33]

Vitelli V, Sørensen Ø, Crispino M, Frigessi A, and Arjas E Probabilistic preference learning with the Mallows rank model J Mach Learn Res 2018 18 1-49

[34]

Wauthier FL, Jordan MI, Jojic N (2013) Efficient ranking from pairwise comparisons. Proceedings of the 30th International Conference on Machine Learning, Atlanta, GA, USA, 28(3), 109–117

[35]

Xu H, Alvo M, and Yu LH Angle-based models for ranking data Comput Stat Data Anal 2018 121 113-136

[36]

Yu LH and Xu H Rank aggregation using latent-scale distance-based models Stat Comput 2019 29 335-349

Digital Library

[37]

Zhu W, Jiang Y, Liu JS, and Deng K Partition-Mallows model and its inference for rank aggregation J Am Stat Assoc 2023 118 343-359

Recommendations

Brilliant Search Engine Optimisation
An introduction to convex optimization for communications and signal processing

Convex optimization methods are widely used in the design and analysis of communication systems and signal processing algorithms. This tutorial surveys some of recent progress in this area. The tutorial contains two parts. The first part gives a survey ...
Rank aggregation using latent-scale distance-based models

Rank aggregation aims at combining rankings of a set of items assigned by a sample of rankers to generate a consensus ranking. A typical solution is to adopt a distance-based approach to minimize the sum of the distances to the observed rankings. ...

Comments

Information & Contributors

Information

Published In

cover image Data Mining and Knowledge Discovery

Data Mining and Knowledge Discovery Volume 38, Issue 3

May 2024

732 pages

Issue’s Table of Contents

© The Author(s) 2024. corrected publication 2024.

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 02 January 2024

Accepted: 13 November 2023

Received: 20 August 2022

Author Tags

Qualifiers

Research-article

Funding Sources

Medical University of Graz

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 25 Dec 2024

Other Metrics

View Author Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

View Issue’s Table of Contents