Abstract
We present the concept of a functional programming language called VML (View Modeling Language), providing facilities to increase the efficiency of the iterative, trial-and-error cycle which frequently appears in any knowledge discovery process. In VML, functions can be specified so that returning values implicitly “remember”, with a special internal representation, that it was calculated from the corresponding function. VML also provides facilities for “matching” the remembered representation so that one can easily obtain, from a given value, the functions and/or parameters used to create the value. Further, we describe, as VML programs, successful knowledge discovery tasks which we have actually experienced in the biological domain, and argue that computational knowledge discovery experiments can be efficiently developed and conducted using this language.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
H. Bannai, Y. Tamada, O. Maruyama, K. Nakai, and S. Miyano. Views: Fundamental building blocks in the process of knowledge discovery. In Proceedings of the 14th International FLAIRS Conference, pages 233–238. AAAI Press, 2001.
P. Cheeseman and J. Stutz. Bayesian classification (AutoClass): Theory and results. In Advances in Knowledge Discoveryand Data Mining. AAAI Press/MIT Press, 1996.
O. Emanuelsson, H. Nielsen, S. Brunak, and G. von Heijne. Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J. Mol. Biol., 300(4):1005–1016, July 2000.
U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth. From data mining to knowledge discovery in databases. AI Magazine, 17(3):37–54, 1996.
J. D. Helmann. Compilation and analysis of bacillus subtilis σA-dependent promoter sequences: evidence for extended contact between RNA polymerase and upstream promoter DNA. Nucleic Acids Res., 23(13):2351–2360, 1995.
J. Hughes. Why functional programming matters. Computer Journal, 32(2):98–107, 1989.
S. Kawashima and M. Kanehisa. AAindex: Amino Acid index database. Nucleic Acids Res., 28(1):374, 2000.
T. Khabaza and C. Shearer. Data mining with Clementine. IEE Colloquium on ‘Knowledge Discovery in Databases’, 1995. IEE Digest No. 1995/021(B), London.
P. Langley. The computer-aided discovery of scientific knowledge. In Lecture Notes in Artificial Intelligence, volume 1532, pages 25–39, 1998.
P. Langley and H. A. Simon. Applications of machine learning and rule induction. Communications of the ACM, 38(11):54–64, 1995.
X. Liu, D. L. Brutlag, and J. S. Liu. BioProspector: Discovering conserved DNA motifs in upstream regulatory regions of co-expressed genes. In Pacific Symposium on Biocomputing 2001, volume 6, pages 127–138, 2001.
O. Maruyama and S. Miyano. Design aspects of discovery systems. IEICE Transactions on Information and Systems, E83–D:61–70, 2000.
O. Maruyama, T. Uchida, T. Shoudai, and S. Miyano. Toward genomic hypothesis creator: View designer for discovery. In Discovery Science, volume 1532 of Lecture Notes in Artificial Intelligence, pages 105–116, 1998.
O. Maruyama, T. Uchida, K. L. Sim, and S. Miyano. Designing views in HypothesisCreator: System for assisting in discovery. In Discovery Sciene, volume 1721 of Lecture Notes in Artificial Intelligence, pages 115–127, 1999.
B. W. Matthews. Comparison of predicted and observed secondary structure of t4 phage lysozyme. Biochim. Biophys. Acta, 405:442–451, 1975.
R. Milner, M. Tofte, R. Harper, and D. MacQueen. The Definition of Standard ML (Revised). MIT Press, 1997.
K. Nakai. Protein sorting signals and prediction of subcellular localization. In P. Bork, editor, Analysis of Amino Acid Sequences, volume 54 of Advances in Protein Chemistry, pages 277–344. Academic Press, San Diego, 2000.
J. Quinlan. Induction of decision trees. Machine Learning 1, 1:81–106, 1986.
S. Shimozono. Alphabet indexing for approximating features of symbols. Theor. Comput. Sci., 210:245–260, 1999.
E. Sumii and H. Bannai. VMlambda: A functional calculus for scientific discovery. http://www.yl.is.s.u-tokyo.ac.jp/~sumii/pub/, 2001.
S. Wrobel, D. Wettschereck, E. Sommer, and W. Emde. Extensibility in data mining systems. In Proceedings of the 2nd International Conference on Knowledge Discoveryand Data Mining (KDD-96), pages 214–219, 1996.
S. Wu and U. Manber. Fast text searching allowing errors. Commun. ACM, 35:83–91, 1992.
Camlp4-http://caml.inria.fr/camlp4/.
GenBank-http://www.ncbi.nlm.nih.gov/Genbank.
HypothesisCreator-http://www.hypothesiscreator.net/.
Objective Caml-http://caml.inria.fr/ocaml/.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bannai, H., Tamada, Y., Maruyama, O., Miyano, S. (2001). VML: A View Modeling Language for Computational Knowledge Discovery. In: Jantke, K.P., Shinohara, A. (eds) Discovery Science. DS 2001. Lecture Notes in Computer Science(), vol 2226. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45650-3_6
Download citation
DOI: https://doi.org/10.1007/3-540-45650-3_6
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42956-2
Online ISBN: 978-3-540-45650-6
eBook Packages: Springer Book Archive