Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1015330.1015447acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Learning first-order rules from data with multiple parts: applications on mining chemical compound data

Published: 04 July 2004 Publication History

Abstract

Inductive learning of first-order theory based on examples has serious bottleneck in the enormous hypothesis search space needed, making existing learning approaches perform poorly when compared to the propositional approach. Moreover, in order to choose the appropiate candidates, all Inductive Logic Programming (ILP) systems only use quantitive information, e.g. number of examples covered and length of rules, which is insufficient for search space having many similar candidates. This paper introduces a novel approach to improve ILP by incorporating the qualitative information into the search heuristics by focusing only on a kind of data where one instance consists of several parts, as well as relations among parts. This approach aims to find the hypothesis describing each class by using both individual and relational characteristics of parts of examples. This kind of data can be found in various domains, especially in representing chemical compound structure. Each compound is composed of atoms as parts, and bonds as relations between two atoms. We apply the proposed approach for discovering rules describing the activity of compounds from their structures from two real-world datasets: mutagenicity in nitroaromatic compounds and dopamine antagonist compounds. The results were compared to the existing method using ten-fold cross validation, and we found that the proposed method significantly produced more accurate results in prediction.

References

[1]
Chevaleyre, Y., & Zucker, J.-D. (2001). A framework for learning rules from multiple instance data. Proc. 12th European Conf. on Machine Learning (pp. 49--60). Springer.]]
[2]
Dietterich, T. G., Lathrop, R. H., & Lozano-Perez, T. (1997). Solving the multiple instance problem with axis-parallel rectangles. Artificial Intelligence, 89, 31--71.]]
[3]
Finn, P., Muggleton, S., Page, D., & Srinivasan, A. (1998). Pharmacophore discovery using the inductive logic programming system PROGOL. Machine Learning, 30, 241--270.]]
[4]
Gärtner, T., Flach, P. A., Kowalczyk, A., & Smola, A. J. (2002). Multi-instance kernels. Proc. 19th International Conf. on Machine Learning (pp. 179--186). Morgan Kaufmann.]]
[5]
King, R. D., Sternberg, M. J. E., & Srinivasan, A. (1995). Relating chemical activity to structure: An examination of ILP successes. New Generation Computing, 13, 411--433.]]
[6]
Marchand-Geneste, N., Watson, K. A., Alsberg, B. K., & King, R. D. (2002). New approach to pharmacophore mapping and qsar analysis using inductive logic programming. application to thermolysin inhibitors and glycogen phosphorylase b inhibitors. Journal of Medicinal Chemistry, 45, 399--409.]]
[7]
Maron, O., & Lozano-Péérez, T. (1998). A framework for multiple-instance learning. Advances in Neural Information Processing Systems. The MIT Press.]]
[8]
Muggleton, S. (1995). Inverse entailment and Progol. New Generation Computing, Special issue on Inductive Logic Programming, 13, 245--286.]]
[9]
Okada, T. (2002). Active user's response: Lessons from the structure-activity relationship analysis of dopamine antagonists. Proc. International Workshop on Active Mining (AM-2002 in ICML 2002) (pp. 103--107).]]
[10]
Quinlan, J. R. (1990). Learning logical definitions from relations. Machine Learning, 5, 239--266.]]
[11]
Selman, B., Levesque, H. J., & Mitchell, D. (1992). A new method for solving hard satisfiability problems. Proceedings 10th National Conference on Artificial Intelligence (pp. 440--446).]]
[12]
Srinivasan, A. (2001). The Aleph manual. http://web.comlab.ox.ac.uk/oucl/research/areas-/machlearn/Aleph/.]]
[13]
Srinivasan, A., Muggleton, S., King, R., & Sternberg, M. (1994). Mutagenesis: ILP experiments in a non-determinate biological domain. Proc. 4th International Workshop on Inductive Logic Programming (pp. 217--232).]]
[14]
Srinivasan, A., Muggleton, S., Sternberg, M. J. E., & King, R. D. (1996). Theories for mutagenicity: A study in first-order and feature-based induction. Artificial Intelligence, 85, 277--299.]]

Cited By

View all
  • (2018)Modelling affective-based music compositional intelligence with the aid of ANS analysesKnowledge-Based Systems10.1016/j.knosys.2007.11.01021:3(200-208)Online publication date: 31-Dec-2018
  • (2009)Constructive Adaptive User Interfaces Based on Brain WavesProceedings of the 13th International Conference on Human-Computer Interaction. Part II: Novel Interaction Methods and Techniques10.1007/978-3-642-02577-8_65(596-605)Online publication date: 14-Jul-2009
  • (2008)Modelling Affective-based Music Compositional Intelligence with the Aid of ANS AnalysesResearch and Development in Intelligent Systems XXIV10.1007/978-1-84800-094-0_8(95-108)Online publication date: 2008
  • Show More Cited By
  1. Learning first-order rules from data with multiple parts: applications on mining chemical compound data

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      ICML '04: Proceedings of the twenty-first international conference on Machine learning
      July 2004
      934 pages
      ISBN:1581138385
      DOI:10.1145/1015330
      • Conference Chair:
      • Carla Brodley
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 04 July 2004

      Permissions

      Request permissions for this article.

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate 140 of 548 submissions, 26%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)5
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 16 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2018)Modelling affective-based music compositional intelligence with the aid of ANS analysesKnowledge-Based Systems10.1016/j.knosys.2007.11.01021:3(200-208)Online publication date: 31-Dec-2018
      • (2009)Constructive Adaptive User Interfaces Based on Brain WavesProceedings of the 13th International Conference on Human-Computer Interaction. Part II: Novel Interaction Methods and Techniques10.1007/978-3-642-02577-8_65(596-605)Online publication date: 14-Jul-2009
      • (2008)Modelling Affective-based Music Compositional Intelligence with the Aid of ANS AnalysesResearch and Development in Intelligent Systems XXIV10.1007/978-1-84800-094-0_8(95-108)Online publication date: 2008
      • (2007)Music compositional intelligence with an affective flavorProceedings of the 12th international conference on Intelligent user interfaces10.1145/1216295.1216335(216-224)Online publication date: 28-Jan-2007

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media