Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

KenDTI: An Ensemble Model for Predicting Drug-Target Interaction by Integrating Multi-Source Information

IEEE/ACM Trans Comput Biol Bioinform. 2021 Jul-Aug;18(4):1305-1314. doi: 10.1109/TCBB.2021.3074401. Epub 2021 Aug 6.

Abstract

The identification of drug-target interactions (DTIs) is an essential step in the process of drug discovery. As experimental validation suffers from high cost and low success rate, various computational models have been exploited to infer potential DTIs. The performance of DTI prediction depends heavily on the features extracted from drugs and target proteins. The existing predictors vary in input information and each has its own advantages. Therefore, combining the advantages of individual models and generating high-quality representations for drug-target pairs are effective ways to improve the performance of DTI prediction. In this study, we exploit both biochemical characteristics of drugs via network integration and molecular sequences via word embeddings, then we develop an ensemble model, KenDTI, based on two types of methods, i.e., network-based and classification-based. We assess the performance of KenDTI on two large-scale datasets, The experimental results show that KenDTI outperforms the state-of-the-art DTI predictors by a large margin. Moreover, KenDTI is robust against missing data in input networks and lack of prior knowledge. It is able to predict for drug-candidate chemical compounds with scarce information.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Computer Simulation
  • Databases, Pharmaceutical
  • Databases, Protein
  • Drug Development / methods*
  • Humans
  • Natural Language Processing*
  • Neural Networks, Computer*
  • Pharmaceutical Preparations / chemistry
  • Pharmaceutical Preparations / metabolism
  • Proteins / chemistry
  • Proteins / metabolism
  • Software

Substances

  • Pharmaceutical Preparations
  • Proteins