Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Effective signal reconstruction from multiple ranked lists via convex optimization

Published: 02 January 2024 Publication History

Abstract

The ranking of objects is widely used to rate their relative quality or relevance across multiple assessments. Beyond classical rank aggregation, it is of interest to estimate the usually unobservable latent signals that inform a consensus ranking. Under the only assumption of independent assessments, which can be incomplete, we introduce indirect inference via convex optimization in combination with computationally efficient Poisson Bootstrap. Two different objective functions are suggested, one linear and the other quadratic. The mathematical formulation of the signal estimation problem is based on pairwise comparisons of all objects with respect to their rank positions. Sets of constraints represent the order relations. The transitivity property of rank scales allows us to reduce substantially the number of constraints associated with the full set of object comparisons. The key idea is to globally reduce the errors induced by the rankers until optimal latent signals can be obtained. Its main advantage is low computational costs, even when handling n<<p data problems. Exploratory tools can be developed based on the bootstrap signal estimates and standard errors. Simulation evidence, a comparison with the state-of-the-art rank centrality method, and two applications, one in higher education evaluation and the other in molecular cancer research, are presented.

References

[1]
Alvo M, Yu PLH (2014) Statistical methods for ranking data. Springer, New York
[2]
Babu GJ, Pathak PK, Rao CR (1999) Second-order correctness of the Poisson bootstrap. Ann Stat 27(5):1666–1683
[3]
Bradley RA, Terry ME (1955) Rank analysis of incomplete block designs: I. The method of paired comparisons. Biometrika 39, 3/4, 324–345
[4]
de Borda JC. (1781) Mémoire sur les Élections au Scrutiny. Histoire de l’Acaémie Royal des Sciences, Paris
[5]
Chamandy N, Muralidharan O, Najmi A, and Naidu S Estimating Uncertainty for Massive Data Streams 2012 Google Technical Report
[6]
Cucuringu M Sync-Rank: Robust ranking, constrained ranking and rank aggregation via eigenvector and SDP synchronizalion IEEE Trans Netw Sci Eng 2016 3 1 58-79
[7]
DeConde PR et al. (2006) Combined results of microarray experiments: a rank aggregation approach. Stat Appl Genet Molecul Biol 5(15)
[8]
Dwork C et al. (2001) Rank aggregation methods for the Web. http://www10.org/cdrom/papers/577/
[9]
Fagin R, Kumar R, and Sivakumar D Comparing top-k lists SIAM J Discr Math 2003 17 134-160
[10]
Fligner MA and Verducci JS Multistage ranking models J Am Stat Assoc 1988 83 892-901
[11]
Gao C, Shen Y, Zhang AY (2021) Uncertainty quantification in the Bradley-Terry-Luce model. https://doi.org/10.48550/arXiv.2110.03874
[12]
Hall P and Schimek MG Moderate deviation-based inference for random degeneration in paired rank lists J Am Stat Assoc 2012 107 661-672
[13]
Kohyanagi N, Kitamura N, Tanaka K, Mizuno T, Fujiwara N, Ohama T, and Sato K The protein level of the tumour-promoting factor SET is regulated by cell density J Biochem 2022 3 171 295-303
[14]
Li H, Xu M, Liu JS, and Fan X An extended Mallows model for ranked data aggregation J Am Stat Assoc 2020 115 730-746
[15]
Li X, Yi D, and Liu JS Bayesian analysis of rank data with covariates and heterogeneous rankers Stat Sci 2022 37 1 1-23
[16]
Lin S (2010) Space oriented rank-based data integration. Stat Appl Genet Molecul Biol, 9(20)
[17]
Lin S and Ding J Integration of ranked lists via Cross Entropy Monte Carlo with applications to mRNA and microRNA studies Biometrics 2009 65 9-18
[18]
Luce RD Individual choice behavior: A theoretical analysis 1959 New York John Wiley and Sons
[19]
McFadden D Conditional logit analysis of qualitative choice behavior Front Econ 1973 1 1 105-142
[20]
Mallows CL Non null ranking models I Biometrika 1957 44 114-130
[21]
Negahban S, Oh S, Shah D (2012) Iterative ranking from pair-wise comparisons. Proc Adv Neural Inf Process Syst 25 (NIPS 2012): 2474–2482
[22]
Negahban S, Oh S, and Shah D Rank centrality: ranking from pairwise comparisons Oper Res 2017 65 266-287
[23]
Plackett RL The analysis of permutations Appl Stat 1975 24 193-202
[24]
Rao CR, Pathak PK, and Koltchinskii VI Bootstrap by sequential resampling J Stat Plan Infer 1997 64 257-281
[25]
Rappoport N and Shamir R Multi-omic and multi-view clustering algorithms: review and cancer benchmark Nucleic Acids Res 2018 46 20 10546-10562
[26]
Ritchie ME, Phipson B, Wu DI, Hu Y, Law CW, Shi W, and Smyth GK limma powers differential expression analyses for RNA-sequencing and microarray studies Nucleic Acids Res 2015 43 7 e47-e47
[27]
Sampath S and Verducci JS Detecting the end of agreement between two long ranked lists Stat Anal Data Min 2013 6 458-471
[28]
Schimek MG, Budinska E, Kugler KG, Svendova V, Ding J, and Lin S TopKLists: a comprehensive R package for statistical inference, stochastic aggregation, and visualization of multiple omics ranked lists Stat Appl Genet Molecul Biol 2015 14 3 311-316
[29]
Schulte-Sasse R, Budach S, Hnisz D, and Marsico A Integration of multiomics data with graph convolutional networks to identify new cancer genes and their associated molecular mechanisms Nat Mach Intell 2021 3 6 513-526
[30]
Spearman C The proof and measurement of association between two things Am J Psychol 1904 15 72-101
[31]
Svendova V and Schimek MG A novel method for estimating the common signals for consensus across multiple ranked lists Comput Stat Data Anal 2017 115 122-135
[32]
Thurstone LL (1927) A law of comparative judgement. Psychol Rev 34:273–286
[33]
Vitelli V, Sørensen Ø, Crispino M, Frigessi A, and Arjas E Probabilistic preference learning with the Mallows rank model J Mach Learn Res 2018 18 1-49
[34]
Wauthier FL, Jordan MI, Jojic N (2013) Efficient ranking from pairwise comparisons. Proceedings of the 30th International Conference on Machine Learning, Atlanta, GA, USA, 28(3), 109–117
[35]
Xu H, Alvo M, and Yu LH Angle-based models for ranking data Comput Stat Data Anal 2018 121 113-136
[36]
Yu LH and Xu H Rank aggregation using latent-scale distance-based models Stat Comput 2019 29 335-349
[37]
Zhu W, Jiang Y, Liu JS, and Deng K Partition-Mallows model and its inference for rank aggregation J Am Stat Assoc 2023 118 343-359

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Data Mining and Knowledge Discovery
Data Mining and Knowledge Discovery  Volume 38, Issue 3
May 2024
732 pages

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 02 January 2024
Accepted: 13 November 2023
Received: 20 August 2022

Author Tags

  1. Ranking data
  2. Rank centrality
  3. Signal estimation
  4. Convex optimization
  5. Poisson bootstrap

Qualifiers

  • Research-article

Funding Sources

  • Medical University of Graz

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media