Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3534678.3539024acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Three-Stage Root Cause Analysis for Logistics Time Efficiency via Explainable Machine Learning

Published: 14 August 2022 Publication History

Abstract

The performance of logistics highly depends on the time efficiency, and hence, plenty of efforts have been devoted to ensuring the on-time delivery in modern logistics industry. However, the delay in logistics transportation and delivery can still happen due to various practical issues, which significantly impact the quality of logistics service. In order to address this issue, this work investigates the root causes impacting the time efficiency, thereby facilitating the operation of logistics systems such that resources can be appropriately allocated to improve the performance. The proposed solution comprises three stages, where statistical methods are employed in the first stage to analyze the pattern of on-time delivery rate and detect the abnormalities induced by non-ideality of operations. Subsequently, a machine learning model is trained to capture the underlying correlations between time efficiency and potential impacting factors. Finally, explainable machine learning techniques are utilized to quantify the contributions of the impacting factors to the time efficiency, thereby recognizing the root causes. The proposed method is comprehensively studied on the real JD Logistics data through experiments, where it can identify the root causes that impact the time efficiency of logistics delivery with high accuracy. Furthermore, it is also demonstrated to outperform the baselines including a recent state-of-the-art method.

Supplemental Material

MP4 File
This paper leverages root cause analysis techniques to investigate the key impacting factors for the time efficiency of logistics. The proposed root cause analysis framework is constructed using explainable machine learning, where a machine learning model is trained to capture the global correlations between the impacting factors and time efficiency. Subsequently, the global statistics and the local characteristics of each event are strategically combined to achieve an accurate recognition. In the proposed framework, multiple mechanisms are incorporated to tackle the practical challenges. Based on the experimental results on real-world datasets, the proposed method outperforms a set of baselines including a recent state-of-the-art approach.

References

[1]
Kjersti Aas, Martin Jullum, and Anders Løland. 2021. Explaining individual predictions when features are dependent: More accurate approximations to Shapley values. Artificial Intelligence, Vol. 298 (2021), 103502.
[2]
Ashraf Abdul, Jo Vermeulen, Danding Wang, Brian Y. Lim, and Mohan Kankanhalli. 2018. Trends and Trajectories for Explainable, Accountable and Intelligible Systems: An HCI Research Agenda. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. 1--18.
[3]
Francis R. Bach and Michael I. Jordan. 2003. Learning spectral clustering. In Proceedings of International Conference on Neural Information Processing Systems. 305--312.
[4]
Sahar El Barky and Ahmed Hussein. 2015. Root Cause Analysis for the Overstock of Goods in Arab Marine Ports. International Journal of Operations and Logistics Management, Vol. 4, 3 (2015), 229--241.
[5]
Christopher M Bishop. 2006. Pattern recognition and machine learning .Springer.
[6]
Tadeusz Cali'ski and Harabasz JA. 1974. A dendrite method for cluster analysis. Communications in Statistics - Theory and Methods, Vol. 3 (01 1974), 1--27.
[7]
Ana Gómez-Andrades, Pablo Muñoz, Inmaculada Serrano, and Raquel Barco. 2016. Automatic Root Cause Analysis for LTE Networks Based on Unsupervised Techniques. IEEE Transactions on Vehicular Technology, Vol. 65, 4 (2016), 2369--2386.
[8]
Hao Huang, Shinjae Yoo, and Yunwen Xu. 2019. System Deterioration Detection and Root Cause Learning on Time Series Graphs. In Proceedings of ACM International Conference on Information and Knowledge Management. 2537--2545.
[9]
Jianfang Jiao, Weiting Zhen, Wenxiang Zhu, and Guang Wang. 2021. Quality-Related Root Cause Diagnosis Based on Orthogonal Kernel Principal Component Regression and Transfer Entropy. IEEE Transactions on Industrial Informatics, Vol. 17, 9 (2021), 6347--6356.
[10]
Bhavya Kailkhura, Brian Gallagher, Sookyung Kim, Anna Hiszpanski, and T. Yong-Jin Han. 2019. Reliable and explainable machine-learning methods for accelerated material discovery. npj Computational Materials, Vol. 5, 1 (2019).
[11]
Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. 2017. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. In Proceedings of Advances in Neural Information Processing Systems, Vol. 30. 3149--3157.
[12]
Yaguo Lei, Feng Jia, Jing Lin, Saibo Xing, and Steven X. Ding. 2016. An Intelligent Fault Diagnosis Method Using Unsupervised Feature Learning Towards Mechanical Big Data. IEEE Transactions on Industrial Electronics, Vol. 63, 5 (2016), 3137--3147.
[13]
Yang Liu and Xin Li. 2018. Predictive Modeling for Advanced Virtual Metrology: A Tree-Based Approach. In Proceedings of IEEE International Conference on Emerging Technologies and Factory Automation, Vol. 1. 845--852.
[14]
Yang Liu, Liang Yan, Sheng Liu, Ting Jiang, Feng Zhang, Yu Wang, and Shengnan Wu. 2020. Enhancing Input Parameter Estimation by Machine Learning for The Simulation of Large-Scale Logistics Networks. In Proceedings of Winter Simulation Conference. 608--619.
[15]
Scott M. Lundberg and Su-In Lee. 2017. A Unified Approach to Interpreting Model Predictions. In Proceedings of the International Conference on Neural Information Processing Systems. 4768--4777.
[16]
Diana López-Soto, Soumaya Yacout, and Francisco Angel-Bello. 2016. Root cause analysis of familiarity biases in classification of inventory items based on logical patterns recognition. Computers & Industrial Engineering, Vol. 93 (2016), 121--130.
[17]
Liang Ma, Jie Dong, Kaixiang Peng, and Chuanfang Zhang. 2019. Hierarchical Monitoring and Root-Cause Diagnosis Framework for Key Performance Indicator-Related Multiple Faults in Process Industries. IEEE Transactions on Industrial Informatics, Vol. 15, 4 (2019), 2091--2100.
[18]
Ricards Marcinkevics and Julia E. Vogt. 2020. Interpretability and Explainability: A Machine Learning Zoo Mini-tour. ArXiv, Vol. abs/2012.01805 (2020).
[19]
Alexander Marx and Jilles Vreeken. 2017. Telling Cause from Effect Using MDL-Based Local and Global Regression. In Proceedings of IEEE International Conference on Data Mining. 307--316.
[20]
Alexander Marx and Jilles Vreeken. 2019. Identifiability of Cause and Effect Using Regularized Regression. In Proceedings of ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 852--861.
[21]
D. C. Montgomery and G. C. Runger. 2013. Applied Statistics and Probability for Engineers 6th ed.). WILEY.
[22]
Renjian Pan, Zhaobo Zhang, Xin Li, Krishnendu Chakrabarty, and Xinli Gu. 2021. Unsupervised Two-Stage Root-Cause Analysis for Integrated Systems. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (2021), 1--1.
[23]
Zhongang Qi, Saeed Khorram, and Li Fuxin. 2021. Embedding deep networks into visual explanations. Artificial Intelligence, Vol. 292 (2021), 103435.
[24]
Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. "Why Should I Trust You?": Explaining the Predictions of Any Classifier. In Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1135--1144.
[25]
Cyriana M.A. Roelofs, Marc-Alexander Lutz, Stefan Faulstich, and Stephan Vogt. 2021. Autoencoder-based anomaly root cause analysis for wind turbines. Energy and AI, Vol. 4 (2021), 100065.
[26]
S. Salvador and P. Chan. 2004. Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms. In Proceedings of IEEE International Conference on Tools with Artificial Intelligence. 576--584.
[27]
Matthias Schmidt, Janine Tatjana Maier, and Lasse Härtel. 2019. Data based root cause analysis for improving logistic key performance indicators of a company's internal supply chain. Procedia CIRP, Vol. 86 (2019), 276--281.
[28]
Avanti Shrikumar, Peyton Greenside, and Anshul Kundaje. 2017. Learning Important Features Through Propagating Activation Differences. In Proceedings of the International Conference on Machine Learning, Vol. 70. 3145--3153.
[29]
Marc Solé, Victor Munté s-Mulero, Annie Ibrahim Rana, and Giovani Estrada. 2017. Survey on Models and Techniques for Root-Cause Analysis. CoRR, Vol. abs/1701.08546 (2017). showeprint[arXiv]1701.08546 http://arxiv.org/abs/1701.08546
[30]
Ilaria Tiddi and Stefan Schlobach. 2022. Knowledge graphs as tools for explainable machine learning: A survey. Artificial Intelligence, Vol. 302 (2022), 103627.
[31]
Joel A. Tropp and Anna C. Gilbert. 2007. Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit. IEEE Transactions on Information Theory, Vol. 53, 12 (2007), 4655--4666.
[32]
Danding Wang, Qian Yang, Ashraf Abdul, and Brian Y. Lim. 2019. Designing Theory-Driven User-Centric Explainable AI. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. 1--15.
[33]
Rui Wang, Xiaoqian Wang, and David I. Inouye. 2021 b. Shapley Explanation Networks. In International Conference on Learning Representations .
[34]
Yuan Wang, Shiqi Hao, Yang Liu, Shiyan Hu, and Wenming Zhe. 2021 a. Delay Aware Dynamic Risk Assessment for Logistics Delivery. In Proceedings of IEEE International Symposium on System Engineering. 1--8.
[35]
Fangming Ye, Zhaobo Zhang, Krishnendu Chakrabarty, and Xinli Gu. 2013. Board-Level Functional Fault Diagnosis Using Artificial Neural Networks, Support-Vector Machines, and Weighted-Majority Voting. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 32, 5 (2013), 723--736.
[36]
Fangming Ye, Zhaobo Zhang, Krishnendu Chakrabarty, and Xinli Gu. 2015. Information-Theoretic Syndrome Evaluation, Statistical Root-Cause Analysis, and Correlation-Based Feature Selection for Guiding Board-Level Fault Diagnosis. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 34, 6 (2015), 1014--1026.
[37]
Fangming Ye, Zhaobo Zhang, Krishnendu Chakrabarty, and Xinli Gu. 2016. Adaptive Board-Level Functional Fault Diagnosis Using Incremental Decision Trees. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 35, 2 (2016), 323--336.
[38]
K. Zhang, M. Kalander, M. Zhou, X. Zhang, and J. Ye. 2021. An Influence-Based Approach for Root Cause Alarm Discovery in Telecom Networks. In Proceedings of Service-Oriented Computing - ICSOC Workshops, Vol. 12632.
[39]
Zhaobo Zhang, Krishnendu Chakrabarty, Zhanglei Wang, Zhiyuan Wang, and Xinli Gu. 2011. Smart diagnosis: Efficient board-level diagnosis and repair using artificial neural networks. In Proceedings of IEEE International Test Conference. 1--9.
[40]
Zhaobo Zhang, Xinli Gu, Yaohui Xie, Zhiyuan Wang, Zhanglei Wang, and Krishnendu Chakrabarty. 2012. Diagnostic system based on support-vector machines for board-level functional diagnosis. In Proceedings of IEEE European Test Symposium. 1--6.

Cited By

View all
  • (2024)Illuminating the Gray Zone: Non-intrusive Gray Failure Localization in Server Operating SystemsCompanion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering10.1145/3663529.3663834(126-137)Online publication date: 10-Jul-2024
  • (2024)Status, challenges, and promises of data‐driven battery lifetime prediction under cyber‐physical system contextIET Cyber-Physical Systems: Theory & Applications10.1049/cps2.120869:3(207-217)Online publication date: 31-Jan-2024
  • (2024)Survival Factors Analysis of Out-of-Hospital Cardiac Arrest Patients via Effective Data Cleaning Techniques and Explainable Machine LearningTechnologies and Applications of Artificial Intelligence10.1007/978-981-97-1714-9_10(116-130)Online publication date: 28-Mar-2024
  • Show More Cited By

Index Terms

  1. Three-Stage Root Cause Analysis for Logistics Time Efficiency via Explainable Machine Learning

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
    August 2022
    5033 pages
    ISBN:9781450393850
    DOI:10.1145/3534678
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 14 August 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. diagnosis
    2. explainable machine learning
    3. logistics operation
    4. root cause analysis

    Qualifiers

    • Research-article

    Conference

    KDD '22
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)115
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 04 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Illuminating the Gray Zone: Non-intrusive Gray Failure Localization in Server Operating SystemsCompanion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering10.1145/3663529.3663834(126-137)Online publication date: 10-Jul-2024
    • (2024)Status, challenges, and promises of data‐driven battery lifetime prediction under cyber‐physical system contextIET Cyber-Physical Systems: Theory & Applications10.1049/cps2.120869:3(207-217)Online publication date: 31-Jan-2024
    • (2024)Survival Factors Analysis of Out-of-Hospital Cardiac Arrest Patients via Effective Data Cleaning Techniques and Explainable Machine LearningTechnologies and Applications of Artificial Intelligence10.1007/978-981-97-1714-9_10(116-130)Online publication date: 28-Mar-2024
    • (2023)Root Cause Analysis for Microservice Systems via Hierarchical Reinforcement Learning from Human FeedbackProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599934(5116-5125)Online publication date: 6-Aug-2023
    • (2023)Towards Equitable Assignment: Data-Driven Delivery Zone Partition at Last-mile LogisticsProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599915(4078-4088)Online publication date: 6-Aug-2023
    • (2023)Opportunities and Challenges of Scheduling in Logistics Industrial Park Cyber-Physical SystemsIEEE Transactions on Industrial Cyber-Physical Systems10.1109/TICPS.2023.33301181(322-334)Online publication date: 2023
    • (2023)Artificial Intelligence in Smart Logistics Cyber-Physical Systems: State-of-The-Arts and Potential ApplicationsIEEE Transactions on Industrial Cyber-Physical Systems10.1109/TICPS.2023.32832301(1-20)Online publication date: 2023
    • (2023)Counterfactual Root Cause Analysis via Anomaly Detection and Causal Graphs2023 IEEE 21st International Conference on Industrial Informatics (INDIN)10.1109/INDIN51400.2023.10218245(1-7)Online publication date: 18-Jul-2023

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media