research-article

Open access

Dark-Pool Smart Order Routing: a Combinatorial Multi-armed Bandit Approach

Authors:

Martino Bernasconi,

Stefano Martino,

Edoardo Vittori,

Francesco Trovò,

Marcello RestelliAuthors Info & Claims

ICAIF '22: Proceedings of the Third ACM International Conference on AI in Finance

Pages 352 - 360

https://doi.org/10.1145/3533271.3561728

Published: 26 October 2022 Publication History

All formats PDF

Abstract

We study the problem of developing a Smart Order Routing algorithm that learns how to optimize the dollar volume, i.e., the total value of the traded shares, gained from slicing an order across multiple dark pools. Our work is motivated by two distinct issues: (i) the surge in liquidity fragmentation caused by the rising popularity of electronic trading and by the increasing number of trading venues, and (ii) the growth in popularity of dark pools, an exchange venue characterised by a lack of transparency. This paper critically discusses the known dark pool literature and proposes a novel algorithm, namely the DP-CMAB algorithm, that extends existing solutions by allowing the agent to specify the desired limit price when placing orders. Specifically, we frame the problem of dollar volume optimization in a multi-venue setting as a Combinatorial Multi-Armed Bandit (CMAB) problem, representing a generalization of the well-studied MAB framework. Drawing from the rich MAB and CMAB literature, we present multiple strategies that our algorithm may adopt to select the best allocation options. Furthermore, we analyze how exploiting financial domain knowledge improves the agents’ performance. Finally, we evaluate the DP-CMAB performance in an environment built from real market data and show that our algorithm outperforms state-of-the-art solutions.

References

[1]

Alekh Agarwal, Peter Bartlett, and Max Dama. 2010. Optimal Allocation Strategies for the Dark Pool Problem. In Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS), Vol. 9. 9–16.

[2]

Robert Almgren and Neil Chriss. 2001. Optimal execution of portfolio transactions. Journal of Risk 3(2001), 5–40.

[3]

Robert Almgren and Bill Harts. 2008. A dynamic algorithm for smart order routing. White paper StreamBase(2008), 1–11.

[4]

Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. 2002. Finite-time Analysis of the Multiarmed Bandit Problem. Machine Learning 47 (05 2002), 235–256.

[5]

Martino Bernasconi-De-Luca, Luigi Fusco, and Ozrenka Dragić. 2021. martinobdl/ITCH: ITCH50Converter. https://doi.org/10.5281/ZENODO.5209267

[6]

Martino Bernasconi de Luca, Edoardo Vittori, Francesco Trovò, and Marcello Restelli. 2021. Conservative Online Convex Optimization. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 19–34.

[7]

Dimitris Bertsimas and Andrew W Lo. 1998. Optimal control of execution costs. Journal of Financial Markets 1, 1 (1998), 1–50.

[8]

Wei Chen, Yajun Wang, and Yang Yuan. 2013. Combinatorial multi-armed bandit: General framework and applications. In Proceedings of the International conference on Machine Learning (ICML). 151–159.

[9]

Rama Cont and Arseniy Kukanov. 2017. Optimal order placement in limit order markets. Quantitative Finance 17, 1 (2017), 21–39.

[10]

Puja Das, Nicholas Johnson, and Arindam Banerjee. 2013. Online lazy updates for portfolio selection with transaction costs. In Proceedings of the conference on Artificial Intelligence (AAAI). 202–208.

[11]

Hans Degryse, Mark Van Achter, and Gunther Wuyts. 2009. Shedding Light on Dark Liquidity Pools. Trading1(2009), 147–155.

[12]

Martin Ester, Hans-Peter Kriegel, Jörg Sander, and Xiaowei Xu. 1996. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In Proceedings of the conference on knowledge discovery and data mining (SIGKDD). 226–231.

[13]

Thierry Foucault and Albert J Menkveld. 2008. Competition for order flow and smart order routing systems. The Journal of Finance 63, 1 (2008), 119–158.

[14]

Kuzman Ganchev, Yuriy Nevmyvaka, Michael Kearns, and Jennifer Wortman Vaughan. 2010. Censored exploration and the dark pool problem. Commun. ACM 53, 5 (2010), 99–107.

Digital Library

[15]

Olivier Guéant. 2016. The Financial Mathematics of Market Liquidity: From optimal execution to market making. Vol. 33. CRC Press.

[16]

Olivier Guéant, Charles-Albert Lehalle, and Joaquin Fernandez-Tapia. 2012. Optimal portfolio liquidation with limit orders. SIAM Journal on Financial Mathematics 3, 1 (2012), 740–764.

Digital Library

[17]

Terrence Hendershott and Haim Mendelson. 2000. Crossing networks and dealer markets: Competition and performance. The Journal of Finance 55, 5 (2000), 2071–2115.

[18]

Woonghee Huh and Paat Rusmevichientong. 2009. A Nonparametric Asymptotic Analysis of Inventory Planning with Censored Demand. Math. Oper. Res. 34 (02 2009), 103–123.

[19]

Emilie Kaufmann, Olivier Cappé, and Aurélien Garivier. 2012. On Bayesian upper confidence bounds for bandit problems. In Proceedings of the International Confererence on Artificial intelligence and Statistics (AISTATS). PMLR, 592–600.

[20]

Peter Kratz and Torsten Schöneborn. 2014. Optimal liquidation in dark pools. Quantitative Finance 14, 9 (2014), 1519–1539.

[21]

Sophie Laruelle, Charles-Albert Lehalle, and Gilles Pages. 2011. Optimal split of orders across liquidity pools: a stochastic algorithm approach. SIAM Journal on Financial Mathematics 2, 1 (2011), 1042–1076.

Digital Library

[22]

Costis Maglaras, Ciamac Moallemi, and Hua Zheng. 2012. Optimal Order Routing in a Fragmented Market. Preprint (05 2012), 1–5.

[23]

Marco Mussi, Gianmarco Genalti, Francesco Trovò, Alessandro Nuara, Nicola Gatti, and Marcello Restelli. 2022. Pricing the Long Tail by Explainable Product Aggregation and Monotonic Bandits. In Proceedings of the conference on knowledge discovery and data mining (SIGKDD). 3623–3633.

Digital Library

[24]

Alessandro Nuara, Francesco Trovò, Nicola Gatti, and Marcello Restelli. 2018. A combinatorial-bandit algorithm for the online joint bid/budget optimization of pay-per-click advertising campaigns. In Proceedings of the Conference on Artificial Intelligence (AAAI). 2379–2386.

[25]

Alessandro Nuara, Francesco Trovò, Nicola Gatti, and Marcello Restelli. 2022. Online joint bid/daily budget optimization of internet advertising campaigns. Artificial Intelligence 305 (2022), 103663.

Digital Library

[26]

Gregor Pujol and Alexander Brueckner. 2009. Smart Order Routing and Best Execution. In Proceedings of the Americas Conference on Information Systems (AMCIS), Vol. 3. 155.

[27]

Yan Qin, Ruoxuan Wang, Asoo J. Vakharia, Yuwen Chen, and Michelle M.H. Seref. 2011. The newsvendor problem: Review and directions for future research. European Journal of Operational Research 213, 2 (2011), 361–374.

[28]

Gary Shorter and Rena S. Miller. 2014. Dark Pools in Equity Trading: Policy Concerns and Recent Developments. In University of North Texas Libraries, UNT Digital Library. 1–18. https://digital.library.unt.edu/ark:/67531/metadc461960/

[29]

Francesco Trovò, Stefano Paladino, Marcello Restelli, and Nicola Gatti. 2018. Improving multi-armed bandit algorithms in online pricing settings. International Journal of Approximate Reasoning 98 (2018), 196–235.

[30]

Edoardo Vittori, Martino Bernasconi de Luca, Francesco Trovò, and Marcello Restelli. 2020. Dealing with transaction costs in portfolio optimization: online gradient descent with momentum. In Proceedings of the International Conference on AI in Finance (ICAIF). 1–8.

Digital Library

[31]

Siwei Wang and Wei Chen. 2018. Thompson Sampling for Combinatorial Semi-Bandits. In Proceedings of the International Conference on Machine Learning (ICML), Vol. 80. 5114–5122.

[32]

Linlin Ye. 2016. Understanding the Impacts of Dark Pools on Price Discovery. (2016), 1–73.

[33]

Haoxiang Zhu. 2013. Do Dark Pools Harm Price Discovery?The Review of Financial Studies 27, 3 (12 2013), 747–789.

Cited By

Ji JXu RZhu R(undefined)Risk-Aware Linear Bandits: Theory and Applications in Smart Order RoutingSSRN Electronic Journal10.2139/ssrn.4178738
https://doi.org/10.2139/ssrn.4178738

Index Terms

Dark-Pool Smart Order Routing: a Combinatorial Multi-armed Bandit Approach
1. Theory of computation
  1. Design and analysis of algorithms
    1. Online algorithms
      1. Online learning algorithms
  2. Theory and algorithms for application domains
    1. Machine learning theory
      1. Online learning theory
      2. Reinforcement learning
        Sequential decision making

Recommendations

Risk-Aware Linear Bandits with Application in Smart Order Routing
ICAIF '22: Proceedings of the Third ACM International Conference on AI in Finance

Motivated by practical considerations in machine learning for financial decision-making, such as risk-aversion and large action space, we initiate the study of risk-aware linear bandits. Specifically, we consider regret minimization under the mean-...
Combinatorial multi-armed bandit and its extension to probabilistically triggered arms

We define a general framework for a large class of combinatorial multi-armed bandit (CMAB) problems, where subsets of base arms with unknown distributions form super arms. In each round, a super arm is played and the base arms contained in the super arm ...
A combinatorial multi-armed bandit approach to correlation clustering
Abstract
Given a graph whose edges are assigned positive-type and negative-type weights, the problem of correlation clustering aims at grouping the graph vertices so as to minimize (resp. maximize) the sum of negative-type (resp. positive-type) intra-...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICAIF '22: Proceedings of the Third ACM International Conference on AI in Finance

November 2022

527 pages

ISBN:9781450393768

DOI:10.1145/3533271

Editors:
Daniele Magazzeni
J.P. Morgan AI Research
,
Senthil Kumar
Capital One
,
Rahul Savani
University of Liverpool
,
Renyuan Xu
University of Southern California
,
Carmine Ventre
King's College London
,
Blanka Horvath
University of Oxford
,
Ruimeng Hu
University of California Santa Barbara
,
Tucker Balch
J.P. Morgan AI Research
,
Francesca Toni
Imperial College London

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

ACM: Association for Computing Machinery

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 October 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICAIF '22

Sponsor:

ACM

ICAIF '22: 3rd ACM International Conference on AI in Finance

November 2 - 4, 2022

NY, New York, USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
495
Total Downloads

Downloads (Last 12 months)247
Downloads (Last 6 weeks)26

Reflects downloads up to 04 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Ji JXu RZhu R(undefined)Risk-Aware Linear Bandits: Theory and Applications in Smart Order RoutingSSRN Electronic Journal10.2139/ssrn.4178738
https://doi.org/10.2139/ssrn.4178738

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten