Abstract
Administering service-oriented architecture (SOA) systems could require sophisticated rules to decide for instance whether to add or remove servers and when. Rule construction often necessitates experts to study patterns that contribute to changes or events. This is a time consuming and error-prone process for complex software systems. In this paper we test the feasibility of automating this process by mining historical data such as past service requests (in time series) and server change events that the administrator committed. We propose a new method to relate frequent patterns in a given time series to changes recorded in the event’s history. We implemented and tested our method on a simulation system for SOA applications. First, we use Euclidean distance, DTW, and FastDTW to identify frequent patterns in a time series that represents performance metric of a SOA simulation system. Then, we calculate the confidence and support of frequent patterns that contribute to changes to identify a set of rules for automating changes. We tested rules that are generated using the proposed method in a training set on a testing set. The average accuracy of generated rules for the change event “remove” exceeded 80% in our experiments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lin, J., Keogh, E., Patel, P., Lonardi, S.: Finding motifs in time series. In: 8th ACM International Conference on Knowledge Discovery and Data Mining, Edmonton, Canada, pp. 53–68 (2002)
Arita, D., Yoshimatsu, H., Taniguchi, R.: Frequent motion pattern extraction for motion recognition in real-time human proxy. In: JSAI Workshop on Conversational Informatics, pp. 25–30 (2005)
Guyet, T., Garbay, C., Dojat, M.: Knowledge construction from time series data using a collaborative exploration system. Journal of Biomedical Informatics 40, 672–687 (2007)
Androulakis, I.P., Wu, J., Vitolo, J., Roth, C.: Selecting maximally informative genes to enable temporal expression profiling analysis. FoSystems Biology in Engineering (2005)
McGovern, A., Rosendahl, D., Kruger, A., Beaton, M., Brown, R., Droegemeier, K.: Understanding the formation of tornadoes through data mining. In: 5th Conference on Artificial Intelligence and its Applications to Environmental Sciences at the American Meteorological Society (2007)
Keogh, E., Kasetty, S.: On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration. In: 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, Canada, pp. 102–111 (2002)
Post, A.R., Harrison, J.H.: Temporal Data Mining. Clinics in Laboratory Medicine 28, 83–100 (2008)
Morchen, F.: Time Series Knowledge Mining, Dissertation: Philipps-University Marburg, Germany (2006)
Salvador, S., Chan, P.: FastDTW: Toward accurate dynamic time warping in linear time and space. In: 3rd Workshop on Mining Temporal and Sequential Data, ACM KDD 2004, Seattle, Washington, USA (2004)
Kruskall, J.B., Liberman, M.: The Symmetric Time Warping Problem: From Continuous to Discrete. In: Time Warps, String Edits and Macromolecules: The Theory and Practice of Sequence Comparison, pp. 125–161. Addison-Wesley Publishing Co., Reading (1983)
Smit, M., Nisbet, A., Stroulia, E., Iszlai, G., Edgar, A.: Toward a simulation-generated knowledge base of service performance. In: 4th International Workshop on Middleware for Service Oriented Computing, New York, USA, pp. 19–24 (2009)
Rockwell, G.: Tapor: Building a portal for text analysis. In: Siemens, R., Moorman, D. (eds.) Mind Technologies, Humanities Computing and the Canadian Academic Community, pp. 285–299. University of Calgary Press, Calgary (2006)
Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions Acoustics, Speech, and Signal Processing 26 (1978)
Itakura, S.: Minimum Prediction Residual Principle Applied to Speech Recognition. IEEE Transactions Acoustics, Speech, and Signal 23, 5–72 (1975)
Keogh, E., Pazzani, M.: Scaling up Dynamic Time Warping for Datamining Applications. In: 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston, Massachuseetts, pp. 285–289 (2000)
Kim, S., Park, S., Chu, W.: An Index-based Approach for Similarity Search Supporting Time Warping in Large Sequence Databases. In: 17th International Conference on Data Engineering, Heidelberg, Germany, pp. 607–614 (2001)
Agrawal, R., Psaila, G., Wimmers, E.L., Zait, M.: Querying Shapes of Histories. In: 21st International Conference on Very Large Databases, Zurich, Switzerland, pp. 502–514 (1995)
Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases. In: Proceedings of ACM SIGMOD Conference on Management of Data, Santa Barbara, CA, USA, pp. 151–162 (2001)
Kalpakis, K., Gada, D., Puttagunta, V.: Distance Measures for Effective Clustering of ARIMA Time Series. In: Proceedings of the 2001 IEEE International Conference on Data Mining, San Jose, CA, pp. 273–280 (2001)
Fayyad, U., Reina, C., Bradley, P.: Initialization of Iterative Refinement Clustering Algorithms. In: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, New York, NY, pp. 194–198 (1998)
Hetland, L., Saetrom, P.: Evolutionary rule mining in time series databases. Journal of Machine Learning 58, 107–125 (2005)
Freitas, A.: Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer, Heidelberg (2002)
Mannila, H., Toivonen, H., Verkamo, A.I.: Discovery of frequent episodes in event sequences. Journal of Data Mining and Knowledge Discovery 1, 259–289 (1997)
Höppner, F., Klawonn, F.: Finding informative rules in interval sequences. In: Hoffmann, F., Adams, N., Fisher, D., Guimarães, G., Hand, D.J. (eds.) IDA 2001. LNCS, vol. 2189, pp. 125–134. Springer, Heidelberg (2001)
Weiss, G.M., Hirsh, H.: Learning to predict rare events in event sequences. In: Agrawal, R., Stolorz, P., Piatetsky-Shapiro, G. (eds.) 4th International Conference on Knowledge Discovery and Data Mining (KDD), Menlo Park, USA, pp. 359–363 (1998)
Hetland, L., Saetrom, P.: Temporal rule discovery using genetic programming and specialized hardware. In: 4th International Conference on Recent Advances in Soft Computing (RASC), pp. 182–188 (2002)
Hipp, J., Guntzer, U., Nakhaeizadeh, G.: Algorithms for association rule mining – a general survey and comparison. SIGKDD Explorations 2, 58–64 (2000)
Agrawal, R., Srikant, R.: Mining sequential patterns. In: ICDE, Taiwan (1995)
Lo, D., Khoo, S.C., Liu, C.: Efficient mining of iterative patterns for software specification discovery. In: 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), San Jose, USA, pp. 460–469 (2007)
Mannila, H., Toivonen, H., Verkamo, A.I.: Discovery of frequent episodes in event sequences. Journal of Data Mining and Knowledge Discovery (DMKD) 1, 259–289 (1997)
Lo, D., Cheng, H., Han, J., Khoo, S.C.: Classification of software behaviors for failure detection: A discriminative pattern mining approach. In: 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, pp. 557–566 (2009)
Hatonen, K., Klemettinen, M., Mannila, H., Ronkainen, P., Toivonen, H.: Knowledge discovery from telecomunication network alarm databases. In: 12th International Conference of Data Engineering, ICDE (1996)
Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. In: Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 307–328. AAAI Press, MIT Press, Menlo Park, Cambridge (1996)
Savasere, Omiecinski, E., Navathe, S.: An efficient algorithm for mining association rules in large databases. In: VLDB Conference, Zurich, Switzerland (1995)
Zaki, M.J., Parthasarathy, S., Ogihara, M., Li, W.: New algorithms for fast discovery of association rules. In: Proceedings of 3rd International Conference on Knowledge Discovery and Data Mining (1997)
Keogh, E., Lonardi, S., Chiu, W.: Finding Surprising Patterns in a Time Series Database in Linear Time and Space. In: 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, Canada, pp. 550–556 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Golmohammadi, K., Smit, M., Zaiane, O.R. (2011). Learning Actions in Complex Software Systems. In: Cuzzocrea, A., Dayal, U. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2011. Lecture Notes in Computer Science, vol 6862. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23544-3_28
Download citation
DOI: https://doi.org/10.1007/978-3-642-23544-3_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23543-6
Online ISBN: 978-3-642-23544-3
eBook Packages: Computer ScienceComputer Science (R0)