Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3217197.3217206acmconferencesArticle/Chapter ViewAbstractPublication PageshpdcConference Proceedingsconference-collections
research-article

High-Throughput Neuroanatomy and Trigger-Action Programming: A Case Study in Research Automation

Published: 11 June 2018 Publication History

Abstract

Exponential increases in data volumes and velocities are overwhelming finite human capabilities. Continued progress in science and engineering demands that we automate a broad spectrum of currently manual research data manipulation tasks, from data transfer and sharing to acquisition, publication, and analysis. These needs are particularly evident in large-scale experimental science, in which researchers are typically granted short periods of instrument time and must maximize experiment efficiency as well as output data quality and accuracy. To address the need for automation, which is pervasive across science and engineering, we present our experiences using Trigger-Action-Programming to automate a real-world scientific workflow. We evaluate our methods by applying them to a neuroanatomy application in which a synchrotron is used to image cm-scale mouse brains with sub-micrometer resolution. In this use case, data is acquired in real-time at the synchrotron and are automatically passed through a complex automation flow that involves reconstruction using HPC resources, human-in-the-loop coordination, and finally data publication and visualization. We describe the lessons learned from these experiences and outline the design for a new research automation platform.

References

[1]
{n. d.}. Airflow. ({n. d.}). https://airflow.apache.org/. Accessed April 1, 2018.
[2]
{n. d.}. Amazon Simple Workflow Service. ({n. d.}). https://aws.amazon.com/swf/. Accessed April 1, 2018.
[3]
{n. d.}. Amazon Step Functions. ({n. d.}). https://aws.amazon.com/step-functions. Accessed April 1, 2018.
[4]
{n. d.}. Conductor. ({n. d.}). https://netflix.github.io/conductor/. Accessed April 1, 2018.
[5]
{n. d.}. If This Then That. ({n. d.}). https://platform.ifttt.com/docs. Accessed April 1, 2018.
[6]
{n. d.}. Neuroglancer. ({n. d.}). https://github.com/google/neuroglancer. Accessed April 1, 2018.
[7]
Moustafa AbdelBaky, Javier Diaz-Montes, and Manish Parashar. 2017. Software-defined environments for science and engineering. The International Journal of High Performance Computing Applications (2017), 1094342017710706.
[8]
Rachana Anathankrishnan, Kyle Chard, Ian Foster, Mattias Lidman, Brendan McCollam, Stephen Rosen, and Steven Tuecke. 2016. Globus Auth: A Research Identity and Access Management Platform.
[9]
Yadu Babuji, Alison Brizius, Kyle Chard, Ian Foster, Daniel S. Katz, Michael Wilde, and Justin Wozniak. 2017. Introducing Parsl: A Python Parallel Scripting Library. (Aug. 2017).
[10]
R. Chard, K. Chard, J. Alt, D. Y. Parkinson, S. Tuecke, and I. Foster. 2017. Ripple: Home Automation for Research Data Management. In 37th IEEE International Conference on Distributed Computing Systems Workshops (ICDCSW). 389--394.
[11]
R. Chard, K. Chard, S. Tuecke, and I. Foster. 2017. Software Defined Cyberinfrastructure for Data Management. In 13th IEEE International Conference on e-Science (e-Science). 456--457.
[12]
Francesco De Carlo, Doga Gürsoy, Federica Marone, Mark Rivers, Dilworth Y Parkinson, Faisal Khan, Nicholas Schwarz, David J Vine, Stefan Vogt, S-C Gleber, et al. 2014. Scientific data exchange: a schema for HDF5-based storage of raw and analyzed data. Journal of synchrotron radiation 21, 6 (2014), 1224--1230.
[13]
E. Deelman, K. Vahi, G. Juve, M. Rynge, S. Callaghan, P.J. Maechling, R. Mayani, W. Chen, R.F. da Silva, M. Livny, et al. 2015. Pegasus, a workflow management system for science automation. Future Generation Computer Systems 46 (2015), 17--35.
[14]
Ming Du, Rafael Vescovi, Ryan Chard, Narayanan Kasthuri, Chris Jacobsen, Eva Dyer, and Doga Gursoy. 2018. An Automated Pipeline for the Collection, Transfer, and Processing of Large-scale Tomography Data. In Biophotonics Congress: Biomedical Optics Congress 2018 (Microscopy/Translational/Brain/OTS). Optical Society of America, BF4C.2.
[15]
I. Foster, B. Blaiszik, K. Chard, and R. Chard. 2017. Software Defined Cyberinfrastructure. In 37th IEEE International Conference on Distributed Computing Systems (ICDCS). 1808--1814.
[16]
Doga Gürsoy, Francesco De Carlo, Xianghui Xiao, and Chris Jacobsen. 2014. TomoPy: A framework for the analysis of synchrotron tomographic data. Journal of Synchrotron Radiation 21, 5 (2014), 1188--1193.
[17]
Justin Huang and Maya Cakmak. 2015. Supporting Mental Model Accuracy in Trigger-action Programming. In Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing (UbiComp '15). ACM, New York, NY, USA, 215--225.
[18]
Narayanan Kasthuri, Kenneth Jeffrey Hayworth, Daniel Raimund Berger, Richard Lee Schalek, José Angel Conchello, Seymour Knowles-Barley, Dongil Lee, Amelio Vázquez-Reina, Verena Kaynig, Thouis Raymond Jones, et al. 2015. Saturated reconstruction of a volume of neocortex. Cell 162, 3 (2015), 648--661.
[19]
Thomas Leibovici. 2015. Taking back control of HPC file systems with Robinhood Policy Engine. arXiv preprint arXiv:1505.01448 (2015).
[20]
Michael J Litzkow, Miron Livny, and Matt W Mutka. 1988. Condor--a hunter of idle workstations. In 8th International Conference on Distributed Computing Systems. IEEE, 104--111.
[21]
Bertram Ludäscher, Ilkay Altintas, Chad Berkley, Dan Higgins, Efrat Jaeger, Matthew Jones, Edward A Lee, Jing Tao, and Yang Zhao. 2006. Scientific workflow management and the Kepler system. Concurrency and Computation: Practice and Experience 18, 10 (2006), 1039--1065.
[22]
Ayman Meidan, Julián Alberto García-García, MJ Escalona, and I Ramos. 2017. A survey on business processes management suites. Computer Standards & Interfaces 51 (2017), 71--86.
[23]
Tom Oinn, Matthew Addis, Justin Ferris, Darren Marvin, Martin Senger, Mark Greenwood, Tim Carver, Kevin Glover, Matthew R Pocock, Anil Wipat, et al. 2004. Taverna: A tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20, 17 (2004), 3045--3054.
[24]
Arnab K. Paul, Steven Tuecke, Ryan Chard, Ali R. Butt, Kyle Chard, and Ian Foster. 2017. Toward Scalable Monitoring on Large-scale Storage for Software Defined Cyberinfrastructure. In 2nd Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems (PDSW-DISCS '17). ACM, New York, NY, USA, 49--54.
[25]
Arcot Rajasekar, Reagan Moore, Chien-yi Hou, Christopher A Lee, Richard Marciano, Antoine de Torcy, Michael Wan, Wayne Schroeder, Sheau-Yen Chen, Lucas Gilbert, Paul Tooby, and Bing Zhu. 2010. iRODS Primer: Integrated rule-oriented data system. Synthesis Lectures on Information Concepts, Retrieval, and Services 2, 1 (2010), 1--143.
[26]
Mark L Rivers. 2012. tomoRecon: High-speed tomography reconstruction on workstations using multi-threading. In Developments in X-Ray Tomography VIII, Vol. 8506. International Society for Optics and Photonics, 85060U.
[27]
RFC Vescovi, MB Cardoso, and EX Miqueles. 2017. Radiography registration for mosaic tomography. Journal of synchrotron radiation 24, 3 (2017).
[28]
M. Wilde, M. Hategan, J.M. Wozniak, B. Clifford, D.S. Katz, and I. Foster. 2011. Swift: A language for distributed parallel scripting. Parallel Comput. 37, 9 (2011), 633--652.

Cited By

View all
  • (2023)Globus automation servicesFuture Generation Computer Systems10.1016/j.future.2023.01.010142:C(393-409)Online publication date: 1-May-2023
  • (2022)RecipeGen++: an automated trigger action programs generatorProceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3540250.3558913(1672-1676)Online publication date: 7-Nov-2022
  • (2022)WRS: Workflow Retrieval System for Cloud Automatic RemediationNOMS 2022-2022 IEEE/IFIP Network Operations and Management Symposium10.1109/NOMS54207.2022.9789843(1-10)Online publication date: 25-Apr-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
AI-Science'18: Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science
June 2018
53 pages
ISBN:9781450358620
DOI:10.1145/3217197
Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Neuroanatomy
  2. Research Automation
  3. Ripple

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

HPDC '18
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)2
Reflects downloads up to 16 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Globus automation servicesFuture Generation Computer Systems10.1016/j.future.2023.01.010142:C(393-409)Online publication date: 1-May-2023
  • (2022)RecipeGen++: an automated trigger action programs generatorProceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3540250.3558913(1672-1676)Online publication date: 7-Nov-2022
  • (2022)WRS: Workflow Retrieval System for Cloud Automatic RemediationNOMS 2022-2022 IEEE/IFIP Network Operations and Management Symposium10.1109/NOMS54207.2022.9789843(1-10)Online publication date: 25-Apr-2022
  • (2021)Understanding Trigger-Action Programs Through Novel Visualizations of Program DifferencesProceedings of the 2021 CHI Conference on Human Factors in Computing Systems10.1145/3411764.3445567(1-17)Online publication date: 6-May-2021
  • (2020)Visualizing Differences to Improve End-User Understanding of Trigger-Action ProgramsExtended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems10.1145/3334480.3382940(1-10)Online publication date: 25-Apr-2020
  • (2020)Learning to Recommend Trigger-Action Rules for End-User DevelopmentReuse in Emerging Software Engineering Practices10.1007/978-3-030-64694-3_12(190-207)Online publication date: 1-Dec-2020
  • (2019)ParslProceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing10.1145/3307681.3325400(25-36)Online publication date: 17-Jun-2019
  • (2019)DLHub: Model and Data Serving for Science2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2019.00038(283-292)Online publication date: May-2019

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media