Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
A Novel Approach on Designing Augmented Fuzzy Cognitive Maps Using Fuzzified Decision Trees Elpiniki I. Papageorgiou Department of Informatics & Computer Technology, Technological Educational Institute of Lamia, 3rd Km PEO Lamia-Athens, 35100 Lamia, Greece epapageorgiou@teilam.gr Abstract. This paper proposes a new methodology for designing Fuzzy Cognitive Maps using crisp decision trees that have been fuzzified. Fuzzy cognitive map is a knowledge-based technique that works as an artificial cognitive network inheriting the main aspects of cognitive maps and artificial neural networks. Decision trees, in the other hand, are well known intelligent techniques that extract rules from both symbolic and numeric data. Fuzzy theoretical techniques are used to fuzzify crisp decision trees in order to soften decision boundaries at decision nodes inherent in this type of trees. Comparisons between crisp decision trees and the fuzzified decision trees suggest that the later fuzzy tree is significantly more robust and produces a more balanced decision making. The approach proposed in this paper could incorporate any type of fuzzy decision trees. Through this methodology, new linguistic weights were determined in FCM model, thus producing augmented FCM tool. The framework is consisted of a new fuzzy algorithm to generate linguistic weights that describe the cause-effect relationships among the concepts of the FCM model, from induced fuzzy decision trees. 1 Introduction Nowadays, the knowledge acquisition and representation constitutes a major knowledge engineering bottleneck. A large number of techniques in the field of artificial intelligence used to represent knowledge: production rules, decision trees, rule-based architectures semantic nets, frameworks, fuzzy logic, causal cognitive maps, among others. The decision trees gained popularity because of their conceptual transparency. The well-developed design methodology comes with efficient design techniques supporting their construction, cf. [1-3]. The decision trees generated by these methods were found useful in building knowledge-based expert systems. Due to the character of continuous attributes as well as various facets of uncertainty one has to take into consideration, there has been a visible trend to cope with the factor of fuzziness when carrying out learning from examples in the case of tree induction. In a nutshell, this trend gave rise to the name of fuzzy decision trees and has resulted in a series of development alternatives; cf. [4-6]. The incorporation of fuzzy sets [7-10] into decision trees enables us to combine the uncertainty handling and approximate reasoning capabilities of the former with the comprehensibility and ease of application of the latter. Fuzzy decision trees [10,11] assume that all domain attributes Z. Cai et al. (Eds.): ISICA 2009, CCIS 51, pp. 266–275, 2009. © Springer-Verlag Berlin Heidelberg 2009 A Novel Approach on Designing Augmented Fuzzy Cognitive Maps 267 or linguistic variables have pre-defined fuzzy terms for each fuzzy attribute. Those could be determined in a data driven manner. The information gain measure, used for splitting a node, is modified for fuzzy representation and a pattern can have nonzero degree of matching to one or more leaves [12,13]. Fuzzy logic and causal cognitive maps, in the other hand, are some of the main topics of artificial intelligence on representation of knowledge and approximation of reasoning with uncertainty [14]. The choice of a particular technique is based on two main factors: the nature of the application and the user’s skills. The fuzzy logic theory, based on representation of knowledge and approximation of reasoning with uncertainty, is very close to the expert’s reasoning, and it is well known as artificial intelligence-based method, especially in the field of medical decision making. An outcome of this theory is fuzzy cognitive maps [15,16]. Fuzzy cognitive maps are diagrams used as causal representations between knowledge/data to represent events relations. They are modeling methods based on knowledge and experience for describing particular domains using concepts (variables, states, inputs, outputs) and the relationships between them. FCM can describe any system using a model having signed causality (that indicates positive or negative relationship), strengths of the causal relationships (that take fuzzy values), and causal links that are dynamic (i.e. the effect of a change in one concept/node affects other nodes, which in turn may affect other nodes). Most decision tree induction methods used for extracting knowledge in classification problems do not deal with cognitive uncertainties such as vagueness and ambiguity associated with human thinking and perception. Fuzzy decision trees represent classification knowledge more naturally to the way of human thinking and are more robust in tolerating imprecise, conflict, and missing information. In this work, a new algorithm for constructing fuzzy cognitive maps by using pregenerated fuzzy decision trees is proposed. The methodology is partly data driven and knowledge driven so some expert knowledge of the domain is required. The fuzzy decision tree approach is used to implement the fuzzy algorithmic methodology in order to assign new linguistic weights among the FCM nodes as well as new paths between FCM nodes that enhance their structure and improve their operational ability to handle with complex modeling processes. This naturally enhances the representative power of FCMs with the knowledge component inherent in fuzzy decision trees rule induction. 2 Main aspects of fuzzy decision trees Fuzzy decision trees are an extension of the classical artificial intelligence concept of decision trees. The main fundamental difference between fuzzy and crisp trees is that with fuzzy trees, gradual transitions exist between attribute values [7]. The reasoning process within the tree allows all rules to be fired to some degree, with the final crisp classification being the result of combining all membership grades. Recent approaches to developing such trees were through modifications to the ID3 algorithm [3,5,6,8,18]. Sison and Chong [3] proposed a fuzzy version of ID3 which automatically generated a fuzzy rule base for a plant controller from a set of input– 268 E.I. Papageorgiou output data. Umano et al. [5] also proposed a new fuzzy ID3 algorithm. This algorithm generates an understandable fuzzy decision tree using fuzzy sets defined by the user. The fuzzy tree methodologies proposed by [3,5] require the data to have been pre-fuzzified before the fuzzy decision trees are induced. More recent work by Janikow involves the induction of fuzzy decision trees directly from data sets by the FID algorithm [10,11]. The [10] takes a detailed introduction about the non fuzzy rules and the different kind of fuzzy rules. In this point it is essential to refer that the data (real values) are partitioned into fuzzy sets by experts. This approach consists on the following steps: Step 1: A fuzzy clustering algorithm is used for input domain partition. The supervised method takes into account the class labels during the clustering. Therefore the resulted partitions, the fuzzy membership functions (fuzzy sets) represent not only the distribution of data, but the distribution of the classes too. Step 2: During a pre-pruning method the resulted partitions could analyze and combine the unduly overlapped fuzzy sets. Step 3: The results of the pre-pruning step are input parameters (beside data) for the tree induction algorithm. The applied tree induction method is the FID (Fuzzy Induction on Decision Tree) algorithm by C. Z. Janikow. Step 4: The fuzzy ID3 is used to extract rules which are then used for generating fuzzy rule base. Step 5: While the FID algorithm could generate larger and complex decision tree as it is necessary, therefore a post pruning method is applied. The rule which yields the maximal fulfillment degree in the least number of cases is deleted. This method provides compact fuzzy rule base that can be used for building FCMDSS. 2.1 Fuzzy Cognitive Mapping causal algebra Fuzzy cognitive maps are an intelligent modeling methodology for complex decision systems, which originated from the combination of Fuzzy Logic and Neural Networks [14]. An FCM describes the behavior of an intelligent system in terms of concepts; each concept represents an entity, a state, a variable, or a characteristic of the system [15]. FCM nodes are named by such concepts forming the set of concepts C = {C1,C2, . . . ,Cn}. Arcs (Cj,Ci) are oriented and represent causal links between concepts; that is how concept Cj causes concept Ci Weights of arcs are associated with a weight value matrix Wn·n, where each element of the matrix wji taking values in [-1, . . .,1] . Kosko has developed a fuzzy causal algebra that describes the causal propagation and combination of concepts in an FCM. The algebra depends only on the partial ordering P, the range set of the fuzzy causal edge function e, and on general fuzzy-graph properties (e.g., path connectivity). Kosko notes that this algebra can be used on any digraph knowledge representation scheme. A causal path from some concept node Ci to concept node Cj, say Ci--~Ck1, Ckl-~… Ckn, Ckn --~Cj, can be indicated by the sequence (i, k, . . . . . kn,j). Then the indirect effect of Ci on Cj is the causality C~I imparts to Cj via the path (i, kl . . . . . kn,j). The total effect of Ci on Cj is the composite of all the indirect-effect causalities A Novel Approach on Designing Augmented Fuzzy Cognitive Maps 269 C~ imparts to Cj. If there is only one causal path from Ci to Cj, the total effect C~ imparts to Cj reduces to the indirect effect. The indeterminacy can be removed with a numeric weighting scheme. A fuzzy causal algebra, and hence FCMs, bypasses the knowledge acquisition processing tradeoff. weak C1 C2 strong v.strong C3 weak v.strong C4 medium C5 Fig. 1. A cognitive map with fuzzy labels at the edges A simple fuzzy causal algebra is created by interpreting the indirect effect operator I as the minimum operator (min) and the total effect operator T as the maximum operator (max) on a partially ordered set P of causal values. Formally, let ~ be a causal concept space, and let e: ~ × ~ P be a fuzzy causal edge function, and assume that there are m-many causal paths from Ci to Cj: (i, k~ ..... k~, j) for 1 ~< r ~< m. Then let Ir(Ci, Cj) denote the indirect effect of concept Ci on concept Cj via the rth causal path, and let T( i, Cj) denote the total effect of Ci on Cj over all m causal paths. Then I~(Ci,Cj)=min{e(Cp,Cp+,):(p,p+ 1) ~ (i,k~ . . . . . k,~,j) } T(Ci,Cj)= max( Ir(Ci,Cj)) , where l <~r<~m where p and p + 1 are contiguous left-to right path indices. The indirect effect operation amounts to specifying the weakest causal link in a path and the total effect operation amounts to specifying the strongest of the weakest links. For example, suppose the causal values are given by P = {none, weak, medium, strong, v.strong} and the FCM is defined as in Figure 1. There are three causal paths from C1 to C5: (C1, C3, C5) , (C1, C3, C4, C5) , (C1, C2, C4, C5). The three indirect effects of C1 on C5 are: I1(C1,C5) = min {el3 , e35 ) = min {strong, v.strong} = strong 12 (C1 ,C5) = min{e13,e34,e45}= weak, I3(C1,C5) = min{e12,e24,e45}= weak. Hence, the total effect of C 1 on C5 is: T(C1,C5) = max {I1,(C1,C5), I2(C1,C5), I3(C1,C5) } = max {strong, weak, weak} = strong. In words, C1 can be said to impart strong causality to C5. 270 E.I. Papageorgiou 3 Novel Approach on Designing Augmented Fuzzy Cognitive Maps There is a necessity to develop a framework extracting fuzzy interconnections among attributes from available data using knowledge extraction techniques and then insert these fuzzy linguistic interconnections to restructure the fuzzy cognitive map model producing a new augmented FCM tool for medical decision making. The framework can incorporate any decision tree algorithm, but for the purpose of this work C4.5 has been chosen as it is a well-known and well-tested decision tree induction algorithm for classification problems [17]. As it has already been stated, the central idea of the proposed method is to combine a fuzzy decision tree to extract the available knowledge of data and to generate fuzzy linguistic weights. The resulted fuzzy relationships among leaf nodes are applied to restructure the FCM model. Among the different fuzzy inference techniques we selected for our approach the Zadeh’s union and intersection parameters. The derived FCM model is subsequently trained using an unsupervised learning algorithm to achieve improved decision accuracy. The inference algorithm of FCMs remains the same and only the weight matrix multiplied with previous concept values was changed. Figure 2 illustrates the proposed framework with the corresponding steps and final decision. The algorithmic approach for the restructure of FCM using fuzzy decision trees is consisting of the following steps: Step 1: For all the M experts, set credibility weight bk = 1 Step 2: Each of the M experts is asked to suggest and describe each of the N concepts that comprise the FCM. Step 3: For all the ordered pair of concepts (Ci and Cj) each kth of the M experts is asked to make the following statement (using an if-then rule): IF the value of concept Ci {increases, decreases,is stable} THEN causes value of concept Cj to {increase, decrease, nothing} THUS the influence of concept Ci on concept Cj is T(influence) Through this step a number of linguistic weights have been assigned by experts. Step 4: If quantitative data (numeric or symbolic) are available, the approach of using fuzzified crisp decision trees (presented in above section 2.1) is implemented into the data set to derive the available structure of fuzzy decision trees and the fuzzy labels in the branches Di. Step 5: From the created fuzzy decision trees, a number of causal paths among the branches i, connecting leaf nodes Di to Dj, is determined. These causal paths transferred in FCM model as causal paths interconnecting concepts Ci to Cj, through a number of direct positive relationships. Step 6: Using the fuzzy causal algebra, an indirect effect operator I used as the minimum operator (min) on an ordered set P of causal values. The simple fuzzy causal algebra is created by interpreting the indirect effect operator I as the minimum operator (min) on the set P of fuzzy values, corresponding to the above designed causal paths among the FCM concepts. Then the max operator T is applied to the resulted effect operators I, and a new linguistic weight produced among Ci and Cj. The overall linguistic weight is the sum of the path products. Thus a new linguistic weight is assigned between the concepts Ci and Cj. Step 7: IF for one interconnection between the concepts Ci and Cj, more than 3M/4 different linguistic weights are suggested THEN ask experts to reassign weights for this particular interconnection and go to step 3. A Novel Approach on Designing Augmented Fuzzy Cognitive Maps 271 Fig. 2. Framework for constructing augmented FCMs by complementary use of fuzzy decision trees Step 8: Aggregate all the linguistic weights proposed for every interconnection using the SUM method where the membership function μ suggested by kth expert is multiplied by the corresponding credibility weight bk. Use the COG defuzzification method to calculate the numerical weight Wij for every interconnection. Step 9: IF there is an ordered concept pair not examined go to step 3, ELSE construct the weight matrix W whose are the defuzzified weights Wij END. Using the above algorithm, someone could use fuzzy decision trees to pass available knowledge into FCM reconstructed them by paths. Experts construct fuzzy sets and fuzzy membership functions for each problem and these fuzzy sets are used into the fuzzy decision tree algorithm due to compatibility reasons. This happens in the case of FCMs to derive the respective linguistic variables and then make the necessary comparisons. The causal paths of the leaf nodes used to determine new causal paths in the FCM model. Thus the FCM model was augmented as new direct and indirect relationships among concepts determined. 272 E.I. Papageorgiou low C1 C3 med low C2 low med C5 C4 med v.low v.low C6 high C8 low C7 C8:high C8:low Fig. 3. Example FCM model with initial linguistic labels on interconnections (weights) 4 An illustrative generic example An illustrative example, of FCM model with eight concepts and eleven interconnections among concepts, with fuzzy labels at the edges of connections, is presented in Figure 3. This initial FCM will be restructured using the proposed methodology and the available knowledge from fuzzified decision trees. Only for implementation reason, we consider that using the fuzzified decision trees on the available data which have been fuzzified, the following tree is produced in Figure 4. The produced tree has a number of three paths for C1 to C8, two paths for C2 to C8, and one path of each one of the other concepts to C8, thus defining new interconnections and/or update the initial ones of the FCM model. C1 low high C3 C2 med med low C5 C4 v.low low C6 high C8:low med C7 C8:hig Fig. 4. Example Fuzzy decision tree induced from the data showing membership grades at each branch Here, the causal effect of C1 to C8 is determined by taking the minimum of the attached labels of the individual paths. Let I1, I2 and I3 denote the effect of C1 to C8 through the paths 1 to 3 respectively, and eij be the label attached with edge from node ith to node jth. Then, to determine the total effect of C1 to C8, we take the maximum of A Novel Approach on Designing Augmented Fuzzy Cognitive Maps 273 paths I1 through I3 causal paths. Path 1 from C1 to C8: c1Æc3Æ c6Æ c8 I1(C1 to C8)=min(low, med, high)=low Path 2 from C1 to C8:c1Æc2Æ c5Æ c7 Æc8 I2(C1 to C8)=min(high, low, v.low, med)=v.low Path 3 from C1 to C8: c1Æc2Æ c4 Æc8 I3(C1 to C8)=min(high, med, low)=low Thus total effect of C1 to C8, denoted by T(C1,C8) is computed below: T(c1,c8)=max{I1,I2,I3}=max{low,v. low,low}=low In words, c1 imparts low causality to c8. To determine the total effect of C2 to C8, we take the maximum of paths I4 through I5. Path 4 from C2 to C8: c2Æ c5Æ c7 Æc8 I4(C2 to C8)=min(low, v.low, med)=v. low Path 5 from C2 to C8: c2Æ c4 Æc8 I5(C2 to C8)=min(med, low)=low Thus total effect of C2 to C8, denoted by T(C2,C8) is: T(c2,c8)=max{I4,I5}=max{low, v. low}=low In words, c2 imparts low causality to c8. Path 6 from C6 to C8: c6 Æc8: I6=high To determine the total effect of C6 to C8, we take the maximum of path I6. In words, c6 imparts high causality to c8. Path 7 from C4 to C8: c4 Æc8: I7=low The total effect of C4 to C8 is determined by taking the maximum of path I7. In words, c4 imparts low causality to c8. To determine the total effect of C5 to C8, we take the maximum of path I8. Path 8 from C5 to C8: C5Æ C7 ÆC8: I8(C5 to C8)=min(v.low, med)=v.low Thus total effect of C5 to C8, denoted by T(C5,C8) is computed: T(C5,C8)=max{I8}=v. low In words, C5 imparts v.low causality to C8. Summarizing, new causal paths describing the interconnections among concepts as well as some interconnections have updated their initial values due to the above paths. After the implementation of the investigating methodology, the FCM model was restructured and a new FCM model was produced illustrated in Figure 5. Where each branch has fuzzy labels, fuzzy values derived from corresponding fuzzy sets as they have been initially prescribed by experts. Some of the important points in the proposed approach are: ¾ Each attribute is represented by a fuzzy set. ¾ All branches will fire to some degree. ¾ Multiple input-single output fuzzy if-then rules. ¾ Each case passes through the tree fires all rules to some degree. 274 E.I. Papageorgiou low C1 C3 med low C2 low low med C5 C4 low v.low low C6 high C8 med C7 high low Fig. 5. The new restructured FCM model using the proposed framework Some of the limitations of the proposed approach are the way of data fuzzification which has to be done automatically from data and without the experts’ intervention. The proposed algorithm and the methodology for constructing FCMs using complementary the fuzzy decision trees as knowledge extraction methods can be used for decision making tasks especially in the medical field. In medical decision making there is enough knowledge hidden in data and the experts-physicians have difficulty to recognize and suggest this knowledge. Thus though the complementary use of fuzzy decision trees as knowledge extraction algorithm and the knowledge representation model of FCMs, an advanced decision making tool with sufficient accuracy and interpretability can be produced. This tool keeps the advantages of FCMs and FDTs coming to a promising task. 5 Conclusion In this study, it was shown the role of the fuzzy decision tree framework in the design and analysis of augmented fuzzy cognitive maps. We discussed the role of the fuzzy decision tree in the determination of fuzzy linguistic weights and causal paths of FCM. It was stressed that this technique takes advantage of the available experimental data. We proposed a detailed design algorithm producing augmented FCMs that offer a comprehensive interpretation of the cognitive model. In particular, the formalism of fuzzified crisp decision trees helped us come up with endowing the medical decision making results with meaningful models. A Novel Approach on Designing Augmented Fuzzy Cognitive Maps 275 Acknowledgment The research was supported in part by the European Commission’s Seventh Framework Information Society Technologies (IST) Programme, Unit ICT for Health, project DEBUGIT (no. 217139). References 1. J. Quinlan, Induction of Decision Trees, Machine Learning, vol. 1, Kluwer Academic Press, Dordrecht, 1986 pp. 81–106. 2. S. Sestino, T. Dillon, Using single-layered neural networks for the extraction of conjunctive rules and hierarchical classifications, J. Appl. Intell. 1 (1991) 157–173. 3. L. Sison, E. Chong, Fuzzy modeling by induction and pruning of decision trees, IEEE Symposium on Intelligent Control U.S.A., 1994, pp. 166–171. 4. S.Mitra, K.M. Konwar, K.P. Sankar, “Fuzzy decision tree, linguistic rules and fuzzy knowledge-based network: generation and evaluation”, IEEE Trans. Syst. Man Cybern. Part C: Appl. Rev. 32 (4) (2002) 328–339. 5. M. Umano, H. Okamoto, I. Hatono, H. Tamura, Generation of fuzzy decision trees by fuzzy ID3 algorithm and its application to diagnosis by gas in oil, Japan–U.S.A. Symposium, 1994, pp. 1445–1450. 6. C.W. Olaru, A complete fuzzy decision tree technique, Fuzzy Sets and Systems 138 (2003) 221–254. 7. W. Pedrycz, A. Sosnowski, Designing decision trees with the use of fuzzy granulation, IEEE Trans. Syst. Man Cybern. A 30, (2000) 151–159. 8. K. Crockett, Z. Bandar, D. Mclean, J. O’Shea, On constructing a fuzzy inference framework using crisp decision trees, Fuzzy Sets and Systems 157 (2006) 2809 – 2832. 9. H. Ishibuchi, K. Nozaki, N. Yamamoto, N. Tanaka, Selecting fuzzy if–then rules for classification problems using genetic algorithms, IEEE Trans. Fuzzy Systems 3 (3) (1995) 260–270. 10. C.Z. Janikow, Fuzzy decision trees: issues and methods, IEEE Trans. Systems Man and Cybernetics, 28 (1) (1998) 1–14. 11. C.Z. Janikow, Fuzzy partitioning with FID3.1, Proceedings of the 18th International Conference of the North American Fuzzy Information Society, 1999, pp. 467–471. 12. Y. Yuan, M.J. Shaw, Induction of fuzzy decision trees, Fuzzy Sets Systems 69 (1995) 125– 139. 13. R. Weber, Fuzzy ID3: a class of methods for automatic knowledge acquisition, Second International Conference on Fuzzy Logic and Neural Networks, Iizuka, Japan, 1992, pp. 265–268. 14. B. Kosko, 1992. Neural Networks and Fuzzy Systems: A Dynamical Systems Approach to Machine Intelligence. Prentice-Hall, New Jersey. 15. Kosko, B.: Fuzzy Cognitive Maps, International Journal of Man-Machine Studies, vol. 24, pp. 65-75, 1986. 16. Papageorgiou E.I., Stylios C.D., Groumpos P.: An Integrated Two-Level Hierarchical Decision Making System based on Fuzzy Cognitive Maps (FCMs), IEEE Trans Biomed Engin, 2003; 50(12): 1326-1339. 17. J.R. Quinlan, Is C5.0 better than C4.5, Available: http://www.rulequest.com/see5comparison.html, 2002. 18. I. Hayashi, T. Maeda, A. Bastian, L.C. Jain, Generation of fuzzy decision trees by fuzzy ID3 with adjusting mechanism of and/or operators, in: Proc. Int. Conf. Fuzzy Syst., 1998, 681– 685.