research-article

Towards generalizable neural solvers for vehicle routing problems via ensemble with transferrable local policy

AUTHORs:

Chao QianAuthors Info & Claims

IJCAI '24: Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence

Article No.: 764, Pages 6914 - 6922

https://doi.org/10.24963/ijcai.2024/764

Published: 03 August 2024 Publication History

Abstract

Machine learning has been adapted to help solve NP-hard combinatorial optimization problems. One prevalent way is learning to construct solutions by deep neural networks, which has been receiving more and more attention due to the high efficiency and less requirement for expert knowledge. However, many neural construction methods for Vehicle Routing Problems (VRPs) focus on synthetic problem instances with specified node distributions and limited scales, leading to poor performance on real-world problems which usually involve complex and unknown node distributions together with large scales. To make neural VRP solvers more practical, we design an auxiliary policy that learns from the local transferable topological features, named local policy, and integrate it with a typical construction policy (which learns from the global information of VRP instances) to form an ensemble policy. With joint training, the aggregated policies perform cooperatively and complementarily to boost generalization. The experimental results on two well-known benchmarks, TSPLIB and CVRPLIB, of travelling salesman problem and capacitated VRP show that the ensemble policy significantly improves both cross-distribution and cross-scale generalization performance, and even performs well on real-world problems with several thousand nodes.

References

[1]

David Applegate, Robert Bixby, Vasek Chvatal, and William Cook. Concorde TSP solver. http://www.math.uwaterloo.ca/tsp/concorde/m, 2006.

[2]

Florian Arnold, Michel Gendreau, and Kenneth Sörensen. Efficiently solving very large-scale routing problems. Computers & Operations Research, 107:32-42, 2019.

Digital Library

[3]

Irwan Bello, Hieu Pham, Quoc V Le, Mohammad Norouzi, and Samy Bengio. Neural combinatorial optimization with reinforcement learning. In Proceedings of the 5th ICLR, Toulon, France, 2017.

[4]

Yoshua Bengio, Andrea Lodi, and Antoine Prouvost. Machine learning for combinatorial optimization: a methodological tour d'horizon. European Journal of Operational Research, 290(2):405-421, 2021.

[5]

Jieyi Bi, Yining Ma, Jiahai Wang, Zhiguang Cao, Jinbiao Chen, Yuan Sun, and Yeow Meng Chee. Learning generalizable models for vehicle routing problems via knowledge distillation. In Advances in Neural Information Processing Systems 35 (NeurIPS), pages 31226-31238, New Orleans, LA, 2022.

[6]

Jakob Bossek, Pascal Kerschke, Aneta Neumann, Markus Wagner, Frank Neumann, and Heike Trautmann. Evolving diverse TSP instances by means of novel and creative mutation operators. In Proceedings of the 15th ACM/SIGEVO Conference on Foundations of Genetic Algorithms (FOGA), pages 58-71, Potsdam, Germany, 2019.

Digital Library

[7]

Quentin Cappart, Didier Chételat, Elias B Khalil, Andrea Lodi, Christopher Morris, and Petar Veličković. Combinatorial optimization and reasoning with graph neural networks. Journal of Machine Learning Research, 24(130):1-61, 2023.

[8]

Wenjie Chen, Shengcai Liu, Yew-Soon Ong, and Ke Tang. Neural influence estimator: Towards real-time solutions to influence blocking maximization. arXiv:2308.14012, 2023.

[9]

Hanni Cheng, Haosi Zheng, Ya Cong, Weihao Jiang, and Shiliang Pu. Select and optimize: Learning to aolve large-scale TSP instances. In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics (AISTATS), pages 1219-1231, Valencia, Spain, 2023.

[10]

Jan Christiaens and Greet Vanden Berghe. Slack induction by string removals for vehicle routing problems. Transportation Science, 54(2):417-433, 2020.

Digital Library

[11]

George B Dantzig and John H Ramser. The truck dispatching problem. Management Science, 6(1):80-91, 1959.

Digital Library

[12]

Darko Drakulic, Sofia Michel, Florian Mai, Arnaud Sors, and Jean-Marc Andreoli. BQNCO: Bisimulation quotienting for generalizable neural combinatorial optimization. In Advances in Neural Information Processing Systems 36 (NeurIPS), New Orleans, LA, 2023.

[13]

Zhang-Hua Fu, Kai-Bin Qiu, and Hongyuan Zha. Generalize a small pre-trained model to arbitrarily large TSP instances. In Proceedings of the 35th AAAI, pages 7474-7482, Virtual, 2021.

[14]

Qiming Fu, Zhechao Wang, Nengwei Fang, Bin Xing, Xiao Zhang, and Jianping Chen. Maml2: meta reinforcement learning via meta-learning for task categories. Frontiers of Computer Science, 17(4):174325, 2023.

Digital Library

[15]

Keld Helsgaun. An effective implementation of the lin-kernighan traveling salesman heuristic. European Journal of Operational Research, 126(1):106-130, 2000.

[16]

Keld Helsgaun. An extension of the linkernighan-helsgaun TSP solver for constrained traveling salesman and vehicle routing problems. Technical report, 2017.

[17]

Qingchun Hou, Jingwei Yang, Yiqiang Su, Xiaoqing Wang, and Yuming Deng. Generalize learned heuristics to solve large-scale vehicle routing problems in real-time. In Proceedings of the 11th ICLR, Kigali, Rwanda, 2023.

[18]

Yuan Jiang, Yaoxin Wu, Zhiguang Cao, and Jie Zhang. Learning to solve routing problems via distributionally robust optimization. In Proceedings of the 36th AAAI, pages 9786-9794, Virtual, 2022.

[19]

Yuan Jiang, Zhiguang Cao, Yaoxin Wu, Wen Song, and Jie Zhang. Ensemble-based deep reinforcement learning for vehicle routing problems under distribution shift. In Advances in Neural Information Processing Systems 36 (NeurIPS), pages 53112-53125, New Orleans, LA, 2023.

[20]

Yuan Jiang, Zhiguang Cao, Yaoxin Wu, and Jie Zhang. Multi-view graph contrastive learning for solving vehicle routing problems. In Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence (UAI), pages 984-994, pittsburgh, PA, 2023.

[21]

Yan Jin, Yuandong Ding, Xuanhao Pan, Kun He, Li Zhao, Tao Qin, Lei Song, and Jiang Bian. Pointerformer: Deep reinforced multi-pointer transformer for the traveling salesman problem. In Proceedings of the 37th AAAI, pages 8132-8140, Washington, DC, 2023.

Digital Library

[22]

Chaitanya K. Joshi, Thomas Laurent, and Xavier Bresson. An efficient graph convolutional network technique for the travelling salesman problem. arXiv:1906.01227, 2019.

[23]

Chaitanya K. Joshi, Quentin Cappart, Louis-Martin Rousseau, and Thomas Laurent. Learning the travelling salesperson problem requires rethinking generalization. Constraints, 27(1-2):70-98, 2022.

Digital Library

[24]

Elias B. Khalil, Hanjun Dai, Yuyu Zhang, Bistra Dilkina, and Le Song. Learning combinatorial optimization algorithms over graphs. In Advances in Neural Information Processing Systems 30 (NeurIPS), pages 6348-6358, Long Beach, CA, 2017.

[25]

Minsu Kim, Junyoung Park, and Jinkyoo Park. Sym-NCO: Leveraging symmetricity for neural combinatorial optimization. In Advances in Neural Information Processing Systems 35 (NeurIPS), pages 1936- 1949, New Orleans, LA, 2022.

[26]

Grigorios D Konstantakopoulos, Sotiris P Gayialis, and Evripidis P Kechagias. Vehicle routing problem and related algorithms for logistics distribution: A literature review and classification. Operational Research, 22(3):2033-2062, 2022.

[27]

Wouter Kool, Herke van Hoof, and Max Welling. Attention, learn to solve routing problems! In Proceedings of the 7th ICLR, New Orleans, LA, 2019.

[28]

Yeong-Dae Kwon, Jinho Choo, Byoungjip Kim, Iljoo Yoon, Youngjune Gwon, and Seungjai Min. POMO: Policy optimization with multiple optima for reinforcement learning. In Advances in Neural Information Processing Systems 33 (NeurIPS), pages 21188- 21198, Virtual, 2020.

[29]

Yang Li, Jinpei Guo, Runzhong Wang, and Junchi Yan. From distribution learning in training to gradient search in testing for combinatorial optimization. In Advances in Neural Information Processing Systems 36 (NeurIPS), New Orleans, LA, 2023.

[30]

Shengcai Liu, Yu Zhang, Ke Tang, and Xin Yao. How good is neural combinatorial optimization? A systematic evaluation on the traveling salesman problem. IEEE Computational Intelligence Magazine, 18(3):14-28, 2023.

Digital Library

[31]

Hao Lu, Xingwen Zhang, and Shuang Yang. A learning-based iterative method for solving vehicle routing problems. In Proceedings of the 7th ICLR, New Orleans, LA, 2019.

[32]

Fu Luo, Xi Lin, Fei Liu, Qingfu Zhang, and Zhenkun Wang. Neural combinatorial optimization with heavy decoder: Toward large scale generalization. In Advances in Neural Information Processing Systems 36 (NeurIPS), New Orleans, LA, 2023.

[33]

Yining Ma, Zhiguang Cao, and Yeow Meng Chee. Learning to search feasible and infeasible regions of routing problems with flexible neural k-opt. In Advances in Neural Information Processing Systems 36 (NeurIPS), New Orleans, LA, 2023.

[34]

Sahil Manchanda, Sofia Michel, Darko Drakulic, and Jean-Marc Andreoli. On the generalization of neural combinatorial optimization heuristics. In Proceedings of the Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD), pages 426-442, Grenoble, France, 2022.

[35]

Gerhard Reinelt. TSPLIB - A traveling salesman problem library. ORSA Journal on Computing, 3(4):376-384, 1991.

[36]

Jiwoo Son, Minsu Kim, Hyeonah Kim, and Jinkyoo Park. Meta-sage: Scale meta-learning scheduled adaptation with guided exploration for mitigating scale shift on combinatorial optimization. In Proceedings of the 40th ICML, Honolulu, Hawaii, 2023.

[37]

Wen Song, Nan Mi, Qiqiang Li, Jing Zhuang, and Zhiguang Cao. Stochastic economic lot scheduling via self-attention based deep reinforcement learning. IEEE Transactions on Automation Science and Engineering, pages 1-12, 2023.

[38]

Zhiqing Sun and Yiming Yang. DIFUSCO: Graph-based diffusion solvers for combinatorial optimization. In Advances in Neural Information Processing Systems 36 (NeurIPS), New Orleans, LA, 2023.

[39]

Eduardo Uchoa, Diego Pecin, Artur Pessoa, Marcus Poggi, Thibaut Vidal, and Anand Subramanian. New benchmark instances for the capacitated vehicle routing problem. European Journal of Operational Research, 257(3):845-858, 2017.

[40]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need. In Advances in Neural Information Processing Systems 30 (NeurIPS), pages 5998-6008, Long Beach, CA, 2017.

[41]

Thibaut Vidal. Hybrid genetic search for the CVRP: Open-source implementation and swap* neighborhood. Computers & Operations Research, 140:105643, 2022.

Digital Library

[42]

Oriol Vinyals, Meire Fortunato, and Navdeep Jaitly. Pointer networks. In Advances in Neural Information Processing Systems 28 (NeurIPS), pages 2692-2700, Montreal, Canada, 2015.

[43]

Ronald J Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8(3):229-256, 1992.

Digital Library

[44]

Cong Zhang, Wen Song, Zhiguang Cao, Jie Zhang, Puay Siew Tan, and Xu Chi. Learning to dispatch for job shop scheduling via deep reinforcement learning. In Advances in Neural Information Processing Systems 33 (NeurIPS), pages 1621-1632, Vancouver, Canada, 2020.

[45]

Jianan Zhou, Yaoxin Wu, Wen Song, Zhiguang Cao, and Jie Zhang. Towards omni-generalizable neural methods for vehicle routing problems. In Proceedings of the 40th ICML, pages 42769- 42789, Honolulu, HI, 2023.

Index Terms

Index terms have been assigned to the content through auto-classification.

Recommendations

A general heuristic for vehicle routing problems

We present a unified heuristic which is able to solve five different variants of the vehicle routing problem: the vehicle routing problem with time windows (VRPTW), the capacitated vehicle routing problem (CVRP), the multi-depot vehicle routing problem (...
Towards omni-generalizable neural methods for vehicle routing problems
ICML'23: Proceedings of the 40th International Conference on Machine Learning

Learning heuristics for vehicle routing problems (VRPs) has gained much attention due to the less reliance on hand-crafted rules. However, existing methods are typically trained and tested on the same task with a fixed size and distribution (of nodes), ...
An iterated local search algorithm for latency vehicle routing problems with multiple depots
Abstract
The Multi-depot Cumulative Capacitated Vehicle Routing Problem (MDCCVRP) extends the recently proposed Cumulative Capacitated Vehicle Routing Problem (CCVRP). The aim is to minimize the sum of the arrival times at the customers considering a ...
Highlights
- A novel metaheuristic algorithm for the MDCCVRP, the MDk-TRP, and the LLRP.
- Valid lower bounds are proposed for the MDCCVRP and the MDk-TRP.
- Proven optimal solution values for the MDCCVRP, the MDk-TRP, and the LLRP.
- New best-...

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings

IJCAI '24: Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence

August 2024

8859 pages

ISBN:978-1-956792-04-1

Editor:
Kate Larson

Copyright © 2024 International Joint Conferences on Artificial Intelligence.

Sponsors

International Joint Conferences on Artifical Intelligence (IJCAI)

Publisher

Unknown publishers

Publication History

Published: 03 August 2024

Qualifiers

Research-article
Research
Refereed limited

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 18 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

View Table of Contents