Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Rank estimation for (approximately) low-rank matrices

Published: 20 January 2022 Publication History

Abstract

In observational data analysis, e.g., causal inference, one often encounters data sets that are noisy and incomplete, but come from inherently "low rank" (or correlated) systems. Examples include user ratings of movies/products and term frequency matrices for documents amongst others. In such analysis, estimating the approximate rank of the data sets serves an important function of delineating the signal from the noise. In this paper, we propose a technique to estimate the rank of observational data matrices, compare it to previously proposed techniques, and make a specific methodological contribution of improving the algorithmic parameter estimation in the robust synthetic control method in [1].

References

[1]
M. Amjad, D. Shah, and D. Shen. Robust synthetic control. J. Mach. Learn. Res., 19(1):802--852, January 2018.
[2]
Alberto Abadie and Javier Gardeazabal. The economic costs of conflict: A case study of the basque country. American economic review, 93(1):113--132, 2003.
[3]
Alberto Abadie, Alexis Diamond, and Jens Hainmueller. Synthetic control methods for comparative case studies: Estimating the effect of california's tobacco control program. Journal of the American statistical Association, 105(490):493--505, 2010.
[4]
Scott WH Young. Improving library user experience with a/b testing: Principles and process. Weave: Journal of Library User Experience, 1(1), 2014.
[5]
Steve Chaplin. The placebo response: an important part of treatment. Prescriber, 17(5):16--22, 2006.
[6]
S. Chatterjee. Matrix estimation by universal singular value thresholding. Annals of Statistics, 43:177--214, 2015.
[7]
John J Donohue, Abhay Aneja, and Kyle D Weber. Right-to-carry laws and violent crime: A comprehensive assessment using panel data and a state-level synthetic control analysis. Journal of Empirical Legal Studies, 16(2):198--247, 2019.
[8]
Scott Cunningham and Manisha Shah. Decriminalizing indoor prostitution: Implications for sexual violence and public health. The Review of Economic Studies, 85(3):1683--1715, 2018.
[9]
Sarah Bohn, Magnus Lofstrom, and Steven Raphael. Did the 2007 legal arizona workers act reduce the state's unauthorized immigrant population? Review of Economics and Statistics, 96(2):258--269, 2014.
[10]
Noémi Kreif, Richard Grieve, Dominik Hangartner, Alex James Turner, Silviya Nikolova, and Matt Sutton. Examination of the synthetic control method for evaluating health policies with multiple treated units. Health economics, 25(12):1514--1528, 2016.
[11]
Boris Heersink, Brenton D Peterson, and Jeffery A Jenkins. Disasters and elections: Estimating the net effect of damage and relief in historical perspective. Political Analysis, 25(2):260--268, 2017.
[12]
Shashanka Ubaru and Yousef Saad. Fast methods for estimating the numerical rank of large matrices. In International Conference on Machine Learning, pages 468--477. PMLR, 2016.
[13]
Anish Agarwal, Abdullah Alomar, Romain Cosson, Devavrat Shah, and Dennis Shen. Synthetic Interventions. https://arxiv.org/pdf/2006.07691.pdf.
[14]
M. Amjad, V. Misra, D. Shah, and D. Shen. mRSC: Multi-dimensional robust synthetic control. In Proceedings of the ACM on Measurement and Analysis of Computing Systems (Sigmetrics 2019), volume 3, page 37, June 2019.
[15]
Make low rank matrix. https://scikit-learn.org/ stable/modules/generated/sklearn. datasets.make_low_rank_matrix.html.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGMETRICS Performance Evaluation Review
ACM SIGMETRICS Performance Evaluation Review  Volume 49, Issue 2
September 2021
73 pages
ISSN:0163-5999
DOI:10.1145/3512798
Issue’s Table of Contents
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 January 2022
Published in SIGMETRICS Volume 49, Issue 2

Check for updates

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 55
    Total Downloads
  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)0
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media