Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1143997.1144042acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
Article

Genetic programming for human oral bioavailability of drugs

Published: 08 July 2006 Publication History

Abstract

Automatically assessing the value of bioavailability from the chemical structure of a molecule is a very important issue in biomedicine and pharmacology. In this paper, we present an empirical study of some well known Machine Learning techniques, including various versions of Genetic Programming, which have been trained to this aim using a dataset of molecules with known bioavailability. Genetic Programming has proven the most promising technique among the ones that have been considered both from the point of view of the accurateness of the solutions proposed, of the generalization capabilities and of the correlation between predicted data and correct ones. Our work represents a first answer to the demand for quantitative bioavailability estimation methods proposed in literature, since the previous contributions focus on the classification of molecules into classes with similar bioavailability.

References

[1]
Accelrys Inc., the world leader in cheminformatics for drug development. See www.accelrys.com.]]
[2]
Andrews, C. W., Bennett, L. & Yu, L. X. Predicting human oral bioavailability of a compound: development of a novel quantitative structure-bioavailability relationship. Pharm. Res. 17, 639--644 (2000).]]
[3]
Akaike, H., 2nd International Symposium on Information Theory, Chapter Information theory and an extension of maximum likelihood principle, pp. 267--281, 1973. Akademia Kiado.]]
[4]
Bains W, Gilbert R, Sriridenko L, et al., Evolutionary computational methods to predict oral bioavailability QSPRs, Curr Opin Drug Discov. Devel., 2002, Jan 5(1):44--51.]]
[5]
Cagnoni S., D. Rivero and L. Vanneschi. A purely-evolutionary memetic algorithm as a first step towards symbiotic coevolution. In Proceedings of the 2005 IEEE Congress on Evolutionary Computation (CEC 2005), pages 1156--1163, Edinburgh, Scotland, 2005. IEEE Press, Piscataway, NJ.]]
[6]
Chemical Information Systems Inc., the company that introduced SMILES molecule representation. See http://www.daylight.com/dayhtml/smiles.]]
[7]
Drug Bank, a recently developed database of FDA approved and experimental drugs. See http://redpoll.pharmacy.ualberta.ca/drugbank/.]]
[8]
Fröhlich, J. Wegner, F. Sieker, A. Zell: Kernel Functions for Attributed Molecular Graphs - A New Similarity Based Approach To ADME Prediction in Classification and Regression, QSAR & Combinatorial Science, 2005.]]
[9]
Hall, M. A. 1998. Correlation-based Feature Selection for Machine Learning. Ph.D diss. Hamilton, NZ: Waikato University, Department of Computer Science.]]
[10]
Haykin S., Neural Networks: a comprehensive foundation. Prentice Hall, London, UK, 1999.]]
[11]
I.T. Jolliffe, Principal Component Analysis, Second edition, Springer series in statistics.]]
[12]
Keijzer M., Improving symbolic regression with interval arithmetic and linear scaling. In C. Ryan et al. editors, Genetic Programming, Proceedings of the 6th European Conference, EuroGP 2003, volume 2610, of LNCS, pages 71--83, Essex, 2003. Springer, Berlin, Heidelberg, New York.]]
[13]
Kennedy, T. Managing the drug discovery development interface. Drug Disc. Today 2, 436--444 (1997).]]
[14]
Koza J.R. Genetic Programming. The MIT Press, Cambridge, Massachusetts, 1992.]]
[15]
Langdon W. B. and S. J. Barrett, Genetic Programming in data mining for drug discovery, in Evolutionary computing in data mining, p. 211--235, 2004 Springer.]]
[16]
Lipinsky et al., Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings, Adv. Drug Deliv. Rev., 23:3--25, (1997)]]
[17]
Pharma Algorithms, a company active in the field of ADMET predictions. See www.ap-algorithms.com.]]
[18]
Pintore, M., Van de Waterbeemd, H., Piclin, N. & Chréétien, J. R., Prediction of oral bioavailability by adaptive fuzzy partitioning, Eur. J. Med. Chem. 2003, Apr 38(4):427--31.]]
[19]
Rousseeuw, Peter J, Robust regression and outlier detection / Peter J. Rousseeuw, Annick M. Leroy, New York: Wiley, 1987.]]
[20]
Simulation Plus Inc., company that use both statistical methods and differential equations based simulations for ADME parameter estimation. www.simulationsplus.com.]]
[21]
Smola Alex J., Bernhard Scholkopf (1998). A Tutorial on Support Vector Regression. NeuroCOLT2 Technical Report Series - NC2-TR-1998-030.]]
[22]
Todeschini, R. & Consonni, V. Handbook of Molecular Descriptors (Wiley-VCH, Weinheim, 2000).]]
[23]
Topchy A. and W. F. Punch. Faster genetic programming based on local gradient search of numeric leaf values. In L. Spector et al. editors, Proceedings of the Genetic and Evolutionary Computation Conference, GECCO 2001, pages 155--162. San Francisco, CA, 2001. Morgan Kaufmann.]]
[24]
Van de Waterbeemd H. and Eric Gifford, ADMET in silico modeling: towards prediction paradise? Nature Reviews Drug Discovery, MARCH 2003,Vol. 2.]]
[25]
Van de Waterbeemd, H. & Rose, S. In The Practice of Medicinal Chemistry 2nd (ed Wermuth, L. G.) 1367--1385(Academic Press, 2003).]]
[26]
Veber, D. F., Johnson, S. R., Cheng, H. Y., Smith, B. R., Ward, K. W. & Kopple, K. D. Molecular properties that influence the oral bioavailability of drug candidates. J. Med. Chem. 45, 2615--2623 (2002).]]
[27]
Weka, a multi-task machine learning software developed by Waikato University. See www.cs.waikato.ac.nz/ml/weka/.]]
[28]
Yoshida, F. & Topliss, J. G. QSAR model for drug human oral bioavailability, J. Med. Chem. 43, 2575--2585 (2000).]]
[29]
Zupan J, Gasteiger, Neural Networks in chemistry and drug design: an introduction, 2nd edition, Wiley 1999.]]

Cited By

View all
  • (2024)On the Nature of the Phenotype in Tree Genetic ProgrammingProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654129(868-877)Online publication date: 14-Jul-2024
  • (2024)Implementation of Hybrid Bat Algorithm-Ensemble on Human Oral Bioavailability Prediction of Drug Candidate2024 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems (ICETSIS)10.1109/ICETSIS61505.2024.10459424(1663-1667)Online publication date: 28-Jan-2024
  • (2023)Dynamic Grammar Pruning for Program Size Reduction in Symbolic RegressionSN Computer Science10.1007/s42979-023-01840-y4:4Online publication date: 17-May-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
GECCO '06: Proceedings of the 8th annual conference on Genetic and evolutionary computation
July 2006
2004 pages
ISBN:1595931864
DOI:10.1145/1143997
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 July 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. bioavailability
  2. bioinformatics
  3. genetic programming
  4. molecular descriptors

Qualifiers

  • Article

Conference

GECCO06
Sponsor:
GECCO06: Genetic and Evolutionary Computation Conference
July 8 - 12, 2006
Washington, Seattle, USA

Acceptance Rates

GECCO '06 Paper Acceptance Rate 205 of 446 submissions, 46%;
Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)15
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2024)On the Nature of the Phenotype in Tree Genetic ProgrammingProceedings of the Genetic and Evolutionary Computation Conference10.1145/3638529.3654129(868-877)Online publication date: 14-Jul-2024
  • (2024)Implementation of Hybrid Bat Algorithm-Ensemble on Human Oral Bioavailability Prediction of Drug Candidate2024 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems (ICETSIS)10.1109/ICETSIS61505.2024.10459424(1663-1667)Online publication date: 28-Jan-2024
  • (2023)Dynamic Grammar Pruning for Program Size Reduction in Symbolic RegressionSN Computer Science10.1007/s42979-023-01840-y4:4Online publication date: 17-May-2023
  • (2022)The Effect of Multi-Generational Selection in Geometric Semantic Genetic ProgrammingApplied Sciences10.3390/app1210483612:10(4836)Online publication date: 10-May-2022
  • (2022)Automated grammar-based feature selection in symbolic regressionProceedings of the Genetic and Evolutionary Computation Conference10.1145/3512290.3528852(902-910)Online publication date: 8-Jul-2022
  • (2021)Multi-region symbolic regression: combining functions under a multi-objective approachNatural Computing: an international journal10.1007/s11047-021-09851-520:4(753-773)Online publication date: 1-Dec-2021
  • (2020)Rule-centred genetic programming (RCGP): an imperialist competitive approachApplied Intelligence10.1007/s10489-019-01601-650:8(2589-2609)Online publication date: 1-Aug-2020
  • (2018)Comparison of semantic-based local search methods for multiobjective genetic programmingGenetic Programming and Evolvable Machines10.1007/s10710-018-9325-419:4(535-563)Online publication date: 1-Dec-2018
  • (2017)Sensitivity-like analysis for feature selection in genetic programmingProceedings of the Genetic and Evolutionary Computation Conference10.1145/3071178.3071338(401-408)Online publication date: 1-Jul-2017
  • (2017)Geometric semantic genetic programming for biomedical applications: A state of the art upgrade2017 IEEE Congress on Evolutionary Computation (CEC)10.1109/CEC.2017.7969311(177-184)Online publication date: 5-Jun-2017
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media