Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3533271.3561728acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicaifConference Proceedingsconference-collections
research-article
Open access

Dark-Pool Smart Order Routing: a Combinatorial Multi-armed Bandit Approach

Published: 26 October 2022 Publication History

Abstract

We study the problem of developing a Smart Order Routing algorithm that learns how to optimize the dollar volume, i.e., the total value of the traded shares, gained from slicing an order across multiple dark pools. Our work is motivated by two distinct issues: (i) the surge in liquidity fragmentation caused by the rising popularity of electronic trading and by the increasing number of trading venues, and (ii) the growth in popularity of dark pools, an exchange venue characterised by a lack of transparency. This paper critically discusses the known dark pool literature and proposes a novel algorithm, namely the DP-CMAB algorithm, that extends existing solutions by allowing the agent to specify the desired limit price when placing orders. Specifically, we frame the problem of dollar volume optimization in a multi-venue setting as a Combinatorial Multi-Armed Bandit (CMAB) problem, representing a generalization of the well-studied MAB framework. Drawing from the rich MAB and CMAB literature, we present multiple strategies that our algorithm may adopt to select the best allocation options. Furthermore, we analyze how exploiting financial domain knowledge improves the agents’ performance. Finally, we evaluate the DP-CMAB performance in an environment built from real market data and show that our algorithm outperforms state-of-the-art solutions.

References

[1]
Alekh Agarwal, Peter Bartlett, and Max Dama. 2010. Optimal Allocation Strategies for the Dark Pool Problem. In Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS), Vol. 9. 9–16.
[2]
Robert Almgren and Neil Chriss. 2001. Optimal execution of portfolio transactions. Journal of Risk 3(2001), 5–40.
[3]
Robert Almgren and Bill Harts. 2008. A dynamic algorithm for smart order routing. White paper StreamBase(2008), 1–11.
[4]
Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. 2002. Finite-time Analysis of the Multiarmed Bandit Problem. Machine Learning 47 (05 2002), 235–256.
[5]
Martino Bernasconi-De-Luca, Luigi Fusco, and Ozrenka Dragić. 2021. martinobdl/ITCH: ITCH50Converter. https://doi.org/10.5281/ZENODO.5209267
[6]
Martino Bernasconi de Luca, Edoardo Vittori, Francesco Trovò, and Marcello Restelli. 2021. Conservative Online Convex Optimization. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 19–34.
[7]
Dimitris Bertsimas and Andrew W Lo. 1998. Optimal control of execution costs. Journal of Financial Markets 1, 1 (1998), 1–50.
[8]
Wei Chen, Yajun Wang, and Yang Yuan. 2013. Combinatorial multi-armed bandit: General framework and applications. In Proceedings of the International conference on Machine Learning (ICML). 151–159.
[9]
Rama Cont and Arseniy Kukanov. 2017. Optimal order placement in limit order markets. Quantitative Finance 17, 1 (2017), 21–39.
[10]
Puja Das, Nicholas Johnson, and Arindam Banerjee. 2013. Online lazy updates for portfolio selection with transaction costs. In Proceedings of the conference on Artificial Intelligence (AAAI). 202–208.
[11]
Hans Degryse, Mark Van Achter, and Gunther Wuyts. 2009. Shedding Light on Dark Liquidity Pools. Trading1(2009), 147–155.
[12]
Martin Ester, Hans-Peter Kriegel, Jörg Sander, and Xiaowei Xu. 1996. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In Proceedings of the conference on knowledge discovery and data mining (SIGKDD). 226–231.
[13]
Thierry Foucault and Albert J Menkveld. 2008. Competition for order flow and smart order routing systems. The Journal of Finance 63, 1 (2008), 119–158.
[14]
Kuzman Ganchev, Yuriy Nevmyvaka, Michael Kearns, and Jennifer Wortman Vaughan. 2010. Censored exploration and the dark pool problem. Commun. ACM 53, 5 (2010), 99–107.
[15]
Olivier Guéant. 2016. The Financial Mathematics of Market Liquidity: From optimal execution to market making. Vol. 33. CRC Press.
[16]
Olivier Guéant, Charles-Albert Lehalle, and Joaquin Fernandez-Tapia. 2012. Optimal portfolio liquidation with limit orders. SIAM Journal on Financial Mathematics 3, 1 (2012), 740–764.
[17]
Terrence Hendershott and Haim Mendelson. 2000. Crossing networks and dealer markets: Competition and performance. The Journal of Finance 55, 5 (2000), 2071–2115.
[18]
Woonghee Huh and Paat Rusmevichientong. 2009. A Nonparametric Asymptotic Analysis of Inventory Planning with Censored Demand. Math. Oper. Res. 34 (02 2009), 103–123.
[19]
Emilie Kaufmann, Olivier Cappé, and Aurélien Garivier. 2012. On Bayesian upper confidence bounds for bandit problems. In Proceedings of the International Confererence on Artificial intelligence and Statistics (AISTATS). PMLR, 592–600.
[20]
Peter Kratz and Torsten Schöneborn. 2014. Optimal liquidation in dark pools. Quantitative Finance 14, 9 (2014), 1519–1539.
[21]
Sophie Laruelle, Charles-Albert Lehalle, and Gilles Pages. 2011. Optimal split of orders across liquidity pools: a stochastic algorithm approach. SIAM Journal on Financial Mathematics 2, 1 (2011), 1042–1076.
[22]
Costis Maglaras, Ciamac Moallemi, and Hua Zheng. 2012. Optimal Order Routing in a Fragmented Market. Preprint (05 2012), 1–5.
[23]
Marco Mussi, Gianmarco Genalti, Francesco Trovò, Alessandro Nuara, Nicola Gatti, and Marcello Restelli. 2022. Pricing the Long Tail by Explainable Product Aggregation and Monotonic Bandits. In Proceedings of the conference on knowledge discovery and data mining (SIGKDD). 3623–3633.
[24]
Alessandro Nuara, Francesco Trovò, Nicola Gatti, and Marcello Restelli. 2018. A combinatorial-bandit algorithm for the online joint bid/budget optimization of pay-per-click advertising campaigns. In Proceedings of the Conference on Artificial Intelligence (AAAI). 2379–2386.
[25]
Alessandro Nuara, Francesco Trovò, Nicola Gatti, and Marcello Restelli. 2022. Online joint bid/daily budget optimization of internet advertising campaigns. Artificial Intelligence 305 (2022), 103663.
[26]
Gregor Pujol and Alexander Brueckner. 2009. Smart Order Routing and Best Execution. In Proceedings of the Americas Conference on Information Systems (AMCIS), Vol. 3. 155.
[27]
Yan Qin, Ruoxuan Wang, Asoo J. Vakharia, Yuwen Chen, and Michelle M.H. Seref. 2011. The newsvendor problem: Review and directions for future research. European Journal of Operational Research 213, 2 (2011), 361–374.
[28]
Gary Shorter and Rena S. Miller. 2014. Dark Pools in Equity Trading: Policy Concerns and Recent Developments. In University of North Texas Libraries, UNT Digital Library. 1–18. https://digital.library.unt.edu/ark:/67531/metadc461960/
[29]
Francesco Trovò, Stefano Paladino, Marcello Restelli, and Nicola Gatti. 2018. Improving multi-armed bandit algorithms in online pricing settings. International Journal of Approximate Reasoning 98 (2018), 196–235.
[30]
Edoardo Vittori, Martino Bernasconi de Luca, Francesco Trovò, and Marcello Restelli. 2020. Dealing with transaction costs in portfolio optimization: online gradient descent with momentum. In Proceedings of the International Conference on AI in Finance (ICAIF). 1–8.
[31]
Siwei Wang and Wei Chen. 2018. Thompson Sampling for Combinatorial Semi-Bandits. In Proceedings of the International Conference on Machine Learning (ICML), Vol. 80. 5114–5122.
[32]
Linlin Ye. 2016. Understanding the Impacts of Dark Pools on Price Discovery. (2016), 1–73.
[33]
Haoxiang Zhu. 2013. Do Dark Pools Harm Price Discovery?The Review of Financial Studies 27, 3 (12 2013), 747–789.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICAIF '22: Proceedings of the Third ACM International Conference on AI in Finance
November 2022
527 pages
ISBN:9781450393768
DOI:10.1145/3533271
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 October 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Dark pools
  2. combinatorial multi-armed bandit
  3. smart order routing

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICAIF '22
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)247
  • Downloads (Last 6 weeks)26
Reflects downloads up to 04 Feb 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media