Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Open access

I Know What You Bought At Chipotle for $9.81 by Solving A Linear Inverse Problem

Published: 30 November 2020 Publication History

Abstract

We consider the question of identifying which set of products are purchased and at what prices in a given transaction by observing only the total amount spent in the transaction, and nothing more. The ability to solve such an inverse problem can lead to refined information about consumer spending by simply observing anonymized credit card transactions data. Indeed, when considered in isolation, it is impossible to identify the products purchased and their prices from a given transaction just based on the transaction total. However, given a large number of transactions, there may be a hope. As the main contribution of this work, we provide a robust estimation algorithm for decomposing transaction totals into the underlying, individual product(s) purchased by utilizing a large corpus of transactions. Our method recovers a (product prices) vector $p \in \mathbbR ^N_>0 $ of unknown dimension (number of products) N as well as matrix $A \in \mathbbZ ^M \times N _\geq0 $ simply from M observations (transaction totals) $y \in \mathbbR ^M_>0 $ such that $y = A p + η$ with η \in \mathbbR ^M$ representing noise (taxes, discounts, etc.). We formally establish that our algorithm identifies $N, A$ precisely and p approximately, as long as each product is purchased individually at least once, i.e. $M \geq N$ and A has rank N. Computationally, the algorithm runs in polynomial time (with respect to problem parameters), and thus we provide a computationally efficient and statistically robust method for solving such inverse problems. We apply the algorithm to a large corpus of anonymized consumer credit card transactions in the period 2016-2019, with data obtained from a commercial data vendor. The transactions are associated with spending at Apple, Chipotle, Netflix, and Spotify. From just transactions data, our algorithm identifies (i) key price points (without access to the listed prices), (ii) products purchased within a transaction, (iii) product launches, and (iv) evidence of a new 'secret' product from Netflix - rumored to be in limited release.

References

[1]
AlternativeData.org. Alternativedata.org database point of sale data. https://alternativedata.org/data-providers/category,point-of-sale. Accessed: 2019-05--19.
[2]
Mark Bergen and Jennifer Surane. Google and mastercard cut a secret ad deal to track retail sales. https://www.bloomberg.com/news/articles/2018-08--30/google-and-mastercard-cut-a-secret-ad-deal-to-track-retail-sales, August 2018. Accessed: 2019-05--19.
[3]
Radu Berinde, Anna C Gilbert, Piotr Indyk, Howard Karloff, and Martin J Strauss. Combining geometry and combinatorics: A unified approach to sparse signal recovery. In 2008 46th Annual Allerton Conference on Communication, Control, and Computing, pages 798--805. IEEE, 2008.
[4]
Florentin Butaru, QingQing Chen, Brian Clark, Sanmay Das, Andrew W Lo, and Akhtar Siddique. Risk and risk management in the credit card industry. Working Paper 21305, National Bureau of Economic Research, June 2015.
[5]
Emmanuel Candes and Terence Tao. Near optimal signal recovery from random projections: Universal encoding strategies. arXiv preprint math/0410542, 2004.
[6]
Emmanuel J Candes. The restricted isometry property and its implications for compressed sensing. Comptes rendus mathematique, 346(9--10):589--592, 2008.
[7]
Chipotle. Chipotle online ordering. https://order.chipotle.com/Meal/Index/1597'showloc=1, 2019. Accessed: 2019-05-01.
[8]
Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein. Introduction to Algorithms, Third Edition. The MIT Press, 3rd edition, 2009.
[9]
Ryan Dezember. Your smartphone's location data is worth big money to wall street. https://www.wsj.com/articles/your-smartphones-location-data-is-worth-big-money-to-wall-street-1541131260, November 2018. Accessed: 2018--11-04.
[10]
David L Donoho. Compressed sensing. IEEE Transactions on information theory, 52(4):1289--1306, 2006.
[11]
DVD.COM. Dvd.com choose a plan. https://dvd.netflix.com/Plans?dsrc=DVDWEB_NMHOME_NMHEADER_PLANS. Accessed: 2019-05--27.
[12]
Amir Efrati. U.S. slowdown at Uber and Lyft. https://www.theinformation.com/articles/u-s-slowdown-at-uber-and-lyft, September 2018. Accessed: 2018--10--25.
[13]
Michael Fleder and Devavrat Shah. Forecasting with alternative data. In Abstracts of the 2020 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS '20, page 23--24, New York, NY, USA, 2020. Association for Computing Machinery.
[14]
Bradley Hope. Provider of personal finance tools tracks bank cards sells data to investors. https://www.wsj.com/articles/provider-of-personal-finance-tools-tracks-bank-cards-sells-data-to-investors-1438914620, April 2015. Accessed: 2018-05--10.
[15]
IO&C. The big trends in data reshaping financial industry. https://ioandc.com/the-big-trends-in-data-reshaping-financial-industry, April 2019. Accessed: 2019-04-07.
[16]
Jon Kleinberg and Eva Tardos. Algorithm design. Pearson Education India, 2006.
[17]
S.P Kothari. Capital markets research in accounting. Journal of Accounting and Economics, 31(1):105 -- 231, 2001.
[18]
Tze Leung Lai, Ching Zong Wei, et al. Least squares estimates in stochastic regression models with applications to identification and control of dynamic systems. The Annals of Statistics, 10(1):154--166, 1982.
[19]
Netflix. Netflix pick your price. https://www.netflix.com. Accessed: 2019-05--27.
[20]
Sarah Perez. Spotify and Hulu launch a discounted entertainment bundle for $$12.99$. https://techcrunch.com/2018/04/11/spotify-and-hulu-launch-a-discounted-entertainment-bundle-for-12--99-per-month, April 2018. Accessed: 2019-06--11.
[21]
Ashley Rodriguez. A history of netflix us price hikes, charted. https://qz.com/1524449/netflix-just-raised-prices-in-the-us-a-history-of-hikes-charted. Accessed: 2019-05--27.
[22]
Second Measure. Data points. https://secondmeasure.com/datapoints. Accessed: 2019-05--19.
[23]
Todd Spangler. Netflix testing out pricier new "Ultra" plan at $16.99 per month. https://variety.com/2018/digital/news/netflix-ultra-plan-hdr-ultrahd-test-1202865305, July 2018. Accessed: 2019-05--27.
[24]
Robin Wigglesworth. Asset management's fight for alternative data analysts heats up. https://www.ft.com/content/2f454550-02c8--11e8--9650--9c0ad2d7c5b5, January 2018. Accessed: 2018-05-07.

Cited By

View all

Index Terms

  1. I Know What You Bought At Chipotle for $9.81 by Solving A Linear Inverse Problem

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the ACM on Measurement and Analysis of Computing Systems
    Proceedings of the ACM on Measurement and Analysis of Computing Systems  Volume 4, Issue 3
    POMACS
    December 2020
    345 pages
    EISSN:2476-1249
    DOI:10.1145/3440131
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 November 2020
    Published in POMACS Volume 4, Issue 3

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. alternative data
    2. blind compressed sensing
    3. consumer credit card transactions
    4. finance

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)212
    • Downloads (Last 6 weeks)41
    Reflects downloads up to 15 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Full Access

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media