Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3569951.3597599acmconferencesArticle/Chapter ViewAbstractPublication PagespearcConference Proceedingsconference-collections
short-paper
Open access

Scalable and Reproducible Virtual Screening through an API-Integrated Workflow

Published: 10 September 2023 Publication History

Abstract

Virtual screening is a key step of the drug discovery process which utilizes computational resources to simulate the behavior of small molecules in the binding site of a target protein. [13] Researchers often test millions of molecules when searching for an early hit compound, requiring significant CPU hours. An accessible, convenient, fast, and computationally efficient means for virtual screening is desirable in order for researchers to conserve resources in the early phase of drug discovery. We developed an application programming interface (API) integrated workflow that allows researchers to submit virtual screening batch jobs to the Lonestar6 supercomputer through a web portal. The containerized [7] workflow employs parallelized Python scripting using mpi4py [2] to efficiently distribute molecular docking tasks performed by AutoDock Vina. [3] The Texas Advanced Computing Center (TACC) API (TAPIS) framework [16], a REST API framework for research computing, was used to integrate the workflow into the University of Texas System Research Cyberinfrastructure (UTRC) web portal. [12] Five large libraries representing commercially-available small molecules or fragments were prepared and are available for screening. Here, we discuss our experience developing this service, as well as the results of extensive internal benchmarks to determine the most efficient parallelization scheme to employ for each molecule library when submitting batch jobs. Regardless of the chosen ligand library, the core, node, and parallel task specifications allow the user to run a virtual drug screening and receive their resulting top docking scores in 24 hours. The service is available to registered academic users, and more information can be found at the Drug Discovery at TACC website. [18]

References

[1]
Jason B Cross, Brajesh K Rai David C Thompson, J Christian Baber, Yongbo Hu Kristi Yi Fan, and Christine Humblet. 2009. Comparison of Several Molecular Docking Programs: Pose Prediction and Virtual Screening Accuracy. Journal of Chemical Information and Modeling 49, 6 (June 2009), 1455–1474. https://doi.org/10.1021/ci900056c
[2]
Lisandro Dalcin and Yao-Lung L. Fang. 2021. mpi4py: Status Update After 12 Years of Development. Computing in Science & Engineering 23, 4 (2021), 47–54. https://doi.org/10.1109/MCSE.2021.3083216
[3]
Jerome Eberhardt, Diogo Santos-Martins, Andreas F Tillack, and Stefano Forli. 2021. AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. Journal of Chemical Information and Modeling. 61, 8 (June 2021), 3891–3898. https://doi.org/10.1021/acs.jcim.1c00203
[4]
Niu Huang, Brian K Shoichet, and John J Irwin. 2006. Benchmarking Sets for Molecular Docking. Journal of Medical Chemistry 49 (Nov. 2006), 6789–6801. https://doi.org/10.1021/jm0608356
[5]
John J. Irwin, Khanh G. Tang, Jennifer Young, Chinzorig Dandarchuluun, Benjamin R. Wong, Munkhzul Khurelbaatar, Yurii S. Moroz, John Mayfield, and Roger A. Sayle. 2020. ZINC20—A Free Ultralarge-Scale Chemical Database for Ligand Discovery. Journal of Chemical Information and Modeling 60, 12 (Oct. 2020), 6065–6073. https://doi.org/10.1021/acs.jcim.0c00675
[6]
Brian P. Kelley, Scott P. Brown, Gregory L. Warren, and Steven W. Muchmore. 2015. POSIT: Flexible Shape-Guided Docking For Pose Prediction.Journal of Chemical Information and Modeling 55 (2015), 1771–1780. https://doi.org/10.1021/acs.jcim.5b00142
[7]
Gregory M. Kurtzer, Cclerget, Michael Bauer, Ian Kaneshiro, David Trudgian, and David Godlove. 2021. hpcng/singularity: Singularity 3.7.3. https://doi.org/10.5281/ZENODO.4667718
[8]
Katherine A. Lawrence, Michael Zentner, Nancy Wilkins-Diehr, Julie A. Wernert, Marlon Pierce, Suresh Marru, and Scott Michael. 2015. Science gateways today and tomorrow: positive perspectives of nearly 5000 members of the research community. Concurrency and Computation: Practice and Experience 27, 16 (May 2015), 4252–4268. https://doi.org/10.1002/cpe.3526
[9]
Enamine Ltd.2023. Screening Collection. https://enamine.net/compound-collections/screening-collection.
[10]
Garrett M Morris, Ruth Huey, William Lindstrom, Michel F Sanner, Richard K Belew, David S Goodsell, and Arthur J Olson. 2009. AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. Journal of Computational Chemistry 30, 16 (Dec. 2009), 2785–2791. https://doi.org/10.1002/jcc.21256
[11]
Bhushan Nagar, William G. Bornmann, Patricia Pellicena, Thomas Schindler, Darren R. Veach, W. Todd Miller, Bayard Clarkson, and John Kuriyan. 2002. Crystal Structures of the Kinase Domain of c-Abl in Complex with the Small Molecule Inhibitors PD173955 and Imatinib (STI-571)1. Cancer Research 62, 15 (08 2002), 4236–4243. arXiv:https://aacrjournals.org/cancerres/article-pdf/62/15/4236/2496540/ch1502004236.pdf
[12]
The University of Texas. 2020. University of Texas Research Cyberinfrastructure Portal. https://utrc.tacc.utexas.edu/.
[13]
Nataraj S. Pagadala, Khajamohiddin Syed, and Jack Tuszynski. 2017. Software for molecular docking: a review. Biophysical Reviews 9, 2 (Jan. 2017), 91–102. https://doi.org/10.1007/s12551-016-0247-1
[14]
Eric F Pettersen, Conrad C Huang Thomas D Goddard, Elaine C Meng, Gregory S Couch, Tristan I Croll, John H Morris, and Thomas E Ferrin. 2021. UCSF ChimeraX: Structure visualization for researchers, educators, and developers. Protein Sci. 30, 1 (Jan. 2021), 70–82. https://doi.org/10.1002/pro.3943
[15]
Thomas Schindler, William Bornmann, Patricia Pellicena, W. Todd Miller, Bayard Clarkson, and John Kuriyan. 2000. Structural Mechanism for STI-571 Inhibition of Abelson Tyrosine Kinase. Science 289, 5486 (Sept. 2000), 1938–1942. https://doi.org/10.1126/science.289.5486.1938
[16]
Joe Stubbs, Richard Cardone, Mike Packard, Anagha Jamthe, Smruti Padhy, Steve Terry, Julia Looney, Joseph Meiring, Steve Black, Maytal Dahan, Sean Cleveland, and Gwen Jacobs. 2021. Tapis: An API Platform for Reproducible, Distributed Computational Research. In Advances in Intelligent Systems and Computing. Springer International Publishing, Vancouver, Canada, 878–900. https://doi.org/10.1007/978-3-030-73100-7_61
[17]
TACC. 2023. Autodock-Vina-App. https://github.com/TACC/autodock-vina-app.
[18]
TACC. 2023. Drug Discovery at TACC. https://drugdiscovery.tacc.utexas.edu/home.
[19]
Oleg Trott and Arthur J Olson. 2010. AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of computational chemistry 31, 2 (Jan. 2010), 455–461. https://doi.org/10.1002/jcc.21334
[20]
Guido Van Rossum and Fred L. Drake. 2009. Python 3 Reference Manual. CreateSpace, Scotts Valley, CA.
[21]
Usha Viswanathan, Suzanne M. Tomlinson, John M. Fonner, Stephen A. Mock, and Stanley J. Watowich. 2014. Identification of a Novel Inhibitor of Dengue Virus Protease through Use of a Virtual Screening Drug Discovery Web Portal. Journal of Chemical Information Modeling 54 (Sept. 2014), 2816–2825. https://doi.org/10.1021/ci500531r

Cited By

View all
  • (2023)Perspectives and Experiences Supporting Containers for Research Computing at the Texas Advanced Computing CenterProceedings of the SC '23 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis10.1145/3624062.3624587(155-164)Online publication date: 12-Nov-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
PEARC '23: Practice and Experience in Advanced Research Computing 2023: Computing for the Common Good
July 2023
519 pages
ISBN:9781450399852
DOI:10.1145/3569951
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 September 2023

Check for updates

Author Tags

  1. API
  2. Apptainer
  3. Bioinformatics
  4. Containers
  5. Datasets
  6. High-performance computing
  7. MPI
  8. Molecular Docking
  9. Visualization

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

PEARC '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 133 of 202 submissions, 66%

Upcoming Conference

PEARC '25
Practice and Experience in Advanced Research Computing
July 20 - 24, 2025
Columbus , OH , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)226
  • Downloads (Last 6 weeks)32
Reflects downloads up to 25 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Perspectives and Experiences Supporting Containers for Research Computing at the Texas Advanced Computing CenterProceedings of the SC '23 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis10.1145/3624062.3624587(155-164)Online publication date: 12-Nov-2023

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media