Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3437984.3458834acmconferencesArticle/Chapter ViewAbstractPublication PageseurosysConference Proceedingsconference-collections
research-article

AutoAblation: Automated Parallel Ablation Studies for Deep Learning

Published: 26 April 2021 Publication History

Abstract

Ablation studies provide insights into the relative contribution of different architectural and regularization components to machine learning models' performance. In this paper, we introduce AutoAblation, a new framework for the design and parallel execution of ablation experiments. AutoAblation provides a declarative approach to defining ablation experiments on model architectures and training datasets, and enables the parallel execution of ablation trials. This reduces the execution time and allows more comprehensive experiments by exploiting larger amounts of computational resources. We show that AutoAblation can provide near-linear scalability by performing an ablation study on the modules of the Inception-v3 network trained on the TenGeoPSAR dataset.

References

[1]
M. Abadi et al. 2016. TensorFlow: A System for Large-Scale Machine Learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16). 265--283.
[2]
D. Berthelot et al. 2019. MixMatch: A Holistic Approach to Semi-Supervised Learning. arXiv preprint arXiv:1905.02249 (2019).
[3]
N. Carlson et al. 2009. Psychology: the Science of Behavior. Pearson.
[4]
B. Chambers and M. Zaharia. 2018. Spark: The Definitive Guide: Big Data Processing Made Simple. O'Reilly Media, Inc.
[5]
F. Chollet et al. 2015. Keras.
[6]
J. Deng et al. 2009. Imagenet: A Large-Scale Hierarchical Image Database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 248--255.
[7]
A. Erdem et al. 2019. Leave One Feature Out Importance. https://github.com/aerdem4/lofo-importance.
[8]
W. A. Falcon et al. 2019. PyTorch Lightning. GitHub. https://github.com/PyTorchLightning/pytorch-lightning 3 (2019).
[9]
R. Girshick et al. 2014. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 580--587.
[10]
M. Hessel et al. 2018. Rainbow: Combining improvements in deep reinforcement learning. In 33 AAAI Conference on Artificial Intelligence.
[11]
E. Horvitz et al. 2003. Learning and reasoning about interruption. In Proceedings of the 5th International Conference on Multimodal Interfaces. ACM, 20--27.
[12]
Y. LeCun. 1998. The MNIST Database of Handwritten Digits. http://yann.lecun.com/exdb/mnist/.
[13]
Z. C. Lipton and J. Steinhardt. 2018. Troubling trends in machine learning scholarship. arXiv preprint arXiv:1807.03341 (2018).
[14]
S. M. Lundberg and SI. Lee. 2017. A Unified Approach to Interpreting Model Predictions. Advances in Neural Information Processing Systems 30 (2017), 4765--4774.
[15]
M. Meister et al. 2020. Maggy: Scalable Asynchronous Parallel Hyperparameter Search. In Workshop on Distributed Machine Learning. 28--33.
[16]
M. Meister et al. 2020. Towards Distribution Transparency for Supervised ML With Oblivious Training Functions. In Workshop on MLOps Systems.
[17]
R. Meyes et al. 2019. Ablation Studies in Artificial Neural Networks. arXiv preprint arXiv:1901.08644 (2019).
[18]
P. Moritz et al. 2018. Ray: A Distributed Framework for Emerging AI Applications. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18). 561--577.
[19]
T. O'Malley et al. 2019. Keras Tuner. https://github.com/keras-team/keras-tuner.
[20]
A. Paszke et al. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. Advances in Neural Information Processing Systems 32 (2019), 8026--8037.
[21]
M. T. Ribeiro et al. 2016. "Why Should I Trust You?": Explaining the Predictions of Any Classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1135--1144.
[22]
M. Richardson et al. 2006. Beyond PageRank: Machine Learning for Static Ranking. In Proceedings of the 15th International Conference on World Wide Web. ACM, 707--715.
[23]
T. Sellam et al. 2019. DeepBase: Deep Inspection of Neural Networks. In Proceedings of the 2019 International Conference on Management of Data. 1117--1134.
[24]
S. Sheikholeslami. 2019. Ablation Programming for Machine Learning. Master's thesis.
[25]
C. Szegedy et al. 2016. Rethinking the Inception Architecture for Computer Vision. In IEEE Conference on Computer Vision and Pattern Recognition. 2818--2826.
[26]
C. Wang et al. 2019. A labelled ocean SAR imagery dataset of ten geophysical phenomena from Sentinel-1 wave mode. Geoscience Data Journal 6, 2 (2019), 105--115.
[27]
J. Wexler et al. 2019. The What-If Tool: Interactive Probing of Machine Learning Models. arXiv preprint arXiv:1907.04135 (2019).
[28]
L. Yang et al. 2017. Open Sourcing TensorFlowOnSpark: Distributed Deep Learning on Big-Data Clusters.
[29]
M. Zaharia et al. 2010. Spark: Cluster Computing with Working Sets. HotCloud 10, 10--10 (2010), 95.

Cited By

View all
  • (2024)Energy Demand in AR Applications—A Reverse Ablation Study of the HoloLens 2 DeviceEnergies10.3390/en1703055317:3(553)Online publication date: 23-Jan-2024
  • (2024)Insights and Considerations in Development and Performance Evaluation of Generative Adversarial Networks (GANs): What Radiologists Need to KnowDiagnostics10.3390/diagnostics1416175614:16(1756)Online publication date: 13-Aug-2024
  • (2024)Discriminative spatial-temporal feature learning for modeling network intrusion detection systemsJournal of Computer Security10.3233/JCS-22003132:1(1-30)Online publication date: 2-Feb-2024
  • Show More Cited By

Index Terms

  1. AutoAblation: Automated Parallel Ablation Studies for Deep Learning

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      EuroMLSys '21: Proceedings of the 1st Workshop on Machine Learning and Systems
      April 2021
      130 pages
      ISBN:9781450382984
      DOI:10.1145/3437984
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 26 April 2021

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Ablation Studies
      2. Deep Learning
      3. Feature Ablation
      4. Model Ablation
      5. Parallel Trial Execution

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Funding Sources

      Conference

      EuroSys '21
      Sponsor:

      Acceptance Rates

      EuroMLSys '21 Paper Acceptance Rate 18 of 26 submissions, 69%;
      Overall Acceptance Rate 18 of 26 submissions, 69%

      Upcoming Conference

      EuroSys '25
      Twentieth European Conference on Computer Systems
      March 30 - April 3, 2025
      Rotterdam , Netherlands

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)203
      • Downloads (Last 6 weeks)24
      Reflects downloads up to 30 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Energy Demand in AR Applications—A Reverse Ablation Study of the HoloLens 2 DeviceEnergies10.3390/en1703055317:3(553)Online publication date: 23-Jan-2024
      • (2024)Insights and Considerations in Development and Performance Evaluation of Generative Adversarial Networks (GANs): What Radiologists Need to KnowDiagnostics10.3390/diagnostics1416175614:16(1756)Online publication date: 13-Aug-2024
      • (2024)Discriminative spatial-temporal feature learning for modeling network intrusion detection systemsJournal of Computer Security10.3233/JCS-22003132:1(1-30)Online publication date: 2-Feb-2024
      • (2024)Leveraging graph neural networks for supporting automatic triage of patientsScientific Reports10.1038/s41598-024-63376-214:1Online publication date: 31-May-2024
      • (2024)Manufacturing Line Ablation, an approach to perform reliable early predictionProcedia Computer Science10.1016/j.procs.2024.01.075232:C(752-765)Online publication date: 2-Jul-2024
      • (2024)Optimal performance of Binary Relevance CNN in targeted multi-label text classificationKnowledge-Based Systems10.1016/j.knosys.2023.111286284:COnline publication date: 17-Apr-2024
      • (2024)Rolling the dice for better deep learning performanceInformation Sciences: an International Journal10.1016/j.ins.2024.120500667:COnline publication date: 1-May-2024
      • (2024)What do end-to-end speech models learn about speaker, language and channel information? A layer-wise and neuron-level analysisComputer Speech and Language10.1016/j.csl.2023.10153983:COnline publication date: 1-Jan-2024
      • (2024)HPExplorer: XAI Method to Explore the Relationship Between Hyperparameters and Model PerformanceMachine Learning and Knowledge Discovery in Databases. Applied Data Science Track10.1007/978-3-031-70378-2_20(319-334)Online publication date: 22-Aug-2024
      • (2024)An Exploration of Diabetic Foot Osteomyelitis X-ray Data for Deep Learning ApplicationsArtificial Intelligence in Medicine10.1007/978-3-031-66535-6_4(30-39)Online publication date: 25-Jul-2024
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media