Engelbert Mephu Nguifo

Publication Date: Oct 7, 2014

Publication Name: F1000posters

Download (.pdf)

Publication Date: 1993

Publication Date: Dec 17, 2014

Download (.pdf)

Publication Date: 1990

Publication Date: 1993

Publication Name: Http Www Theses Fr

Publication Date: 2000

Publication Name: IEEE Transactions on Learning Technologies

Publication Date: 2007

Publication Date: Aug 25, 2011

Publication Name: F1000posters

Download (.pdf)

Publication Date: 2007

Publication Name: F Egc

Publication Date: 2007

Publication Name: F Egc

In the last few years, the amount of collected data, in various computer science applications, has grown considerably. These large volumes of data need to be analyzed in order to extract useful hidden knowledge. This work focuses on... more

In the last few years, the amount of collected data, in various computer science applications, has grown considerably. These large volumes of data need to be analyzed in order to extract useful hidden knowledge. This work focuses on association rule extraction. This technique is one of the most popular in data mining. Nevertheless, the number of extracted association rules is often very high, and many of them are redundant. In this paper, we propose a new algorithm, called PRINCE. Its main feature is the construction of a partially ordered structure for extracting subsets of association rules, called generic bases. Without loss of information these subsets form representation of the whole association rule set. To reduce the cost of such a construction, the partially ordered structure is built thanks to the minimal generators associated to frequent closed patterns. The closed ones are simultaneously derived with generic bases thanks to a simple bottom-up traversal of the obtained structure. The experimentations we carried out in benchmark and "worst case" contexts showed the efficiency of the proposed algorithm, compared to algorithms like CLOSE, A-CLOSE and TITANIC.

Publication Date: Dec 5, 2013

Download (.pdf)

... {tarek.hamrouni@fst.rnu.tn, hamrouni@cril.univ-artois.fr} ... On the other hand, in many real-life applications like market basket analysis, medical data analysis, social network anal-ysis and bioinformatics, etc., the disjunctive... more

... {tarek.hamrouni@fst.rnu.tn, hamrouni@cril.univ-artois.fr} ... On the other hand, in many real-life applications like market basket analysis, medical data analysis, social network anal-ysis and bioinformatics, etc., the disjunctive connector link-ing items can bring key information as ...

Publication Date: 2010

Publication Name: Twenty Third International Flairs Conference

Research Interests:
Algorithm

Publication Date: 2007

Publication Name: F Egc

Publication Date: 2015

Publication Name: 2015 IEEE Trustcom/BigDataSE/ISPA

Publication Date: Jul 1, 2013

Research Interests:
Computer Graphics, Data Mining, Computational Biology, Biological Sciences, Mathematical Sciences, and 3 moreProteins, Amino Acids, and Protein Conformation

Download (.pdf)

Publication Date: Dec 17, 2014

Download (.pdf)

Publication Date: 2007

Publication Name: Cla

Research Interests:
Data Mining, Neural Network, Concept lattice, Network Architecture, Data mining application, and 2 moreCla and Artificial Neural Network

Download (.pdf)

Publication Date: 2005

Publication Name: Cap

Research Interests:
CAP

Download (.pdf)

One of the most powerful techniques to study protein structures is to look for recurrent fragments (also called substructures or spatial motifs), then use them as patterns to characterize the proteins under study. An emergent trend... more

One of the most powerful techniques to study protein structures is to look for recurrent fragments (also called substructures or spatial motifs), then use them as patterns to characterize the proteins under study. An emergent trend consists in parsing proteins three-dimensional (3D) structures into graphs of amino acids. Hence, the search of recurrent spatial motifs is formulated as a process of frequent subgraph discovery where each subgraph represents a spatial motif. In this scope, several efficient approaches for frequent subgraph discovery have been proposed in the literature. However, the set of discovered frequent subgraphs is too large to be efficiently analyzed and explored in any further process. In this paper, we propose a novel pattern selection approach that shrinks the large number of discovered frequent subgraphs by selecting the representative ones. Existing pattern selection approaches do not exploit the domain knowledge. Yet, in our approach we incorporate the evolutionary information of amino acids defined in the substitution matrices in order to select the representative subgraphs. We show the effectiveness of our approach on a number of real datasets. The results issued from our experiments show that our approach is able to considerably decrease the number of motifs while enhancing their interestingness.

Publication Date: Mar 8, 2013

Download (.pdf)

Publication Date: 2009

Research Interests:
Semantic similarity, Ontology Matching, and Ontology Alignment

Download (.pdf)

Publication Date: 2011

Download (.pdf)

Publication Date: Jun 21, 2012

Download (.pdf)

With the emergence of graph databases, the task of frequent subgraph discovery has been extensively addressed. Although the proposed approaches in the literature have made this task feasible, the number of discovered frequent subgraphs is... more

With the emergence of graph databases, the task of frequent subgraph discovery has been extensively addressed. Although the proposed approaches in the literature have made this task feasible, the number of discovered frequent subgraphs is still very high to be efficiently used in any further exploration. Feature selection for graph data is a way to reduce the high number of frequent subgraphs based on exact or approximate structural similarity. However, current structural similarity strategies are not efficient enough in many real-world applications, besides, the combinatorial nature of graphs makes it computationally very costly. In order to select a smaller yet structurally irredundant set of subgraphs, we propose a novel approach that mines the top-k topological representative subgraphs among the frequent ones. Our approach allows detecting hidden structural similarities that existing approaches are unable to detect such as the density or the diameter of the subgraph. In addition, it can be easily extended using any user defined structural or topological attributes depending on the sought properties. Empirical studies on real and synthetic graph datasets show that our approach is fast and scalable.

Publication Date: Jul 28, 2013

Download (.pdf)

Publication Date: Jul 14, 2011

Publication Name: F1000posters

Download (.pdf)

ABSTRACT

CiteSeerX - Document Details (Isaac Councill, Lee Giles): APRESW workshop represented a meeting point for individuals working on adaptive, personalized and recommender systems for the Social-semantic Web. The main objectives of this... more

CiteSeerX - Document Details (Isaac Councill, Lee Giles): APRESW workshop represented a meeting point for individuals working on adaptive, personalized and recommender systems for the Social-semantic Web. The main objectives of this meeting were to gather state of the art ...

Research Interests:
Biological Sciences, Humans, Female, Animals, Male, and Glycoproteins

Download (.pdf)

Publication Date: 2011

Publication Name: Extraction et Gestion des Connaissances

Formal Concept Analysis "FCA" is a data analysis method which enables to discover hidden knowledge existing in data. A kind of hidden knowledge extracted from data is association rules. Different quality measures were reported... more

Formal Concept Analysis "FCA" is a data analysis method which enables to discover hidden knowledge existing in data. A kind of hidden knowledge extracted from data is association rules. Different quality measures were reported in the literature to extract only relevant association rules. Given a dataset, the choice of a good quality measure remains a challenging task for a user.

Publication Date: 2010

Publication Name: Computing Research Repository

Research Interests:
Formal Concept Analysis, FCA, Knowledge Extraction, Clustering Method, data analysis methods in fMRI, and 4 moreClustering Interestingness Measures, K Means, Association Rule, and Quality Measures

Download (.pdf)

Multistrategy learning (MSL) consists of combining at least two different learning strategies to bring out a powerful system, where the drawbacks of the basic algorithms are avoided. In this scope, instance-based learning (IBL) techniques... more

Multistrategy learning (MSL) consists of combining at least two different learning strategies to bring out a powerful system, where the drawbacks of the basic algorithms are avoided. In this scope, instance-based learning (IBL) techniques are often used as the basic component. However, one of the major drawbacks of IBL is the prototype selection problem which consists in selecting a subset

Publication Date: 1999

Publication Name: Proceedings 11th International Conference on Tools with Artificial Intelligence

Research Interests:
Power System, Instance-based learning, Learning Strategies, Learning System, Standard Ml, and Prototype Selection

Publication Date: 2005

Publication Name: IFIP International Federation for Information Processing

Download (.pdf)

Publication Date: 2004

Publication Name: Extraction et Gestion des Connaissances

The increasing growth of databases raises an urgent need for more accurate methods to better understand the stored data. In this scope, association rules were extensively used for the analysis and the comprehension of huge amounts of... more

The increasing growth of databases raises an urgent need for more accurate methods to better understand the stored data. In this scope, association rules were extensively used for the analysis and the comprehension of huge amounts of data. However, the number of generated rules is too large to be efficiently analyzed and explored in any further process. Association rules selection is a classical topic to address this issue, yet, new innovated approaches are required in order to provide help to decision makers. Hence, many interesting- ness measures have been defined to statistically evaluate and filter the association rules. However, these measures present two major problems. On the one hand, they do not allow eliminating irrelevant rules, on the other hand, their abun- dance leads to the heterogeneity of the evaluation results which leads to confusion in decision making. In this paper, we propose a two-winged approach to select statistically in- teresting and semantically incompara...

Download (.pdf)

Durant ces dernières années, l’utilisation de graphes a fait l’objet de nombreux travaux, notamment en bases de données, apprentissage automatique, bioinformatique et en analyse des réseaux sociaux. La fouille de sous-graphes fréquents... more

Durant ces dernières années, l’utilisation de graphes a fait l’objet de nombreux travaux, notamment en bases de données, apprentissage automatique, bioinformatique et en analyse des réseaux sociaux. La fouille de sous-graphes fréquents constitue un défi majeur dans le contexte de très grandes bases de graphes. Dans ce papier, nous présentons une nouvelle approche basée sur le paradigme MapReduce pour la fouille de sous-graphes fréquents à grande échelle. L’approche proposée offre une nouvelle technique de partitionnement qui tient compte des caractéristiques des données et qui améliore le partitionnement par défaut de MapReduce. L’étude des performances de notre approche réalisée en utilisant un nuage privé a montré son efficacité.

Publication Date: 2007

Publication Name: Extraction et Gestion des Connaissances

Multi-layer neural networks have been successfully applied in a wide range of supervised and unsupervised learning applications. As they often produce incomprehensible models they are not widely used in data mining applications. To avoid... more

Multi-layer neural networks have been successfully applied in a wide range of supervised and unsupervised learning applications. As they often produce incomprehensible models they are not widely used in data mining applications. To avoid such limitations, comprehensive models have been previously introduced making use of an apriori knowl- edge to build the network architecture. They permit to neural network methods

Publication Date: 2007

Publication Name: Concept Lattices and their Applications

Research Interests:
Data Mining, Neural Network, Unsupervised Learning, Concept lattice, Supervised Classification, and 4 moreNetwork Architecture, Data mining application, Cla, and Artificial Neural Network

Download (.pdf)

This paper concerns the use of an object-oriented database for the analysis of protein sequences. We describe proteins either by bibliographic information or by prediction function such as Prosite patterns [2, 5]. We propose to use... more

This paper concerns the use of an object-oriented database for the analysis of protein sequences. We describe proteins either by bibliographic information or by prediction function such as Prosite patterns [2, 5]. We propose to use concept lattices---a tool used in information retrieval to build thesauruses---to classify protein sequences. This classification of proteins may help finding sequence alignments, or discussing about them.

Download (.pdf)

Publication Date: 2009

Research Interests:
Ontology Alignment

Download (.pdf)

Publication Date: 2008

Publication Name: Lecture Notes in Computer Science

Research Interests:
Experimental Study, Information Loss, and Association Rule

Download (.pdf)

Publication Date: 2011

Publication Name: Mathématiques et sciences humaines

Research Interests:
Formal Concept Analysis, Lattice, and Association Rule

Download (.pdf)

This paper describes a new approach to problem solving by splitting up problem component parts between software and hardware. Our main idea arises from the combination of two previously published works. The first one proposed a conceptual... more

This paper describes a new approach to problem solving by splitting up problem component parts between software and hardware. Our main idea arises from the combination of two previously published works. The first one proposed a conceptual environment of concept modelling in which the machine and the human expert interact. The second one reported an algorithm based on reconfigurable hardware system which outperforms any kind of previously published genetic data base scanning hardware or algorithms. Here we show how efficient the interaction between the machine and the expert is when the concept modelling is based on reconfigurable hardware system. Their cooperation is thus achieved with an real time interaction speed. The designed system has been partially applied to the recognition of primate splice junctions sites in genetic sequences.

Publication Date: 1999

Publication Name: Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Research Interests:
Algorithms, Biocomputing, DNA, Software, Computers, and 5 moreHumans, Computer Simulation, Computer User Interface Design, Base Sequence, and Nucleic Acid Conformation

Download (.pdf)

We propose a cooperative conceptual modelling environment in which two agents interact: the machine and the human expert. The former is able to extract knowledge from data using a symbolic-numeric machine learning system, and the latter... more

We propose a cooperative conceptual modelling environment in which two agents interact: the machine and the human expert. The former is able to extract knowledge from data using a symbolic-numeric machine learning system, and the latter is able to control the learning process by accepting and validating the machine results, or by criticizing those results or the explanation that the system produces on them. The improvement of the conceptual modelling relies on the cooperation between the two agents. Results obtained with our method on prediction of primate splice junctions sites in genetic sequences are far better than those reported in the literature with other symbolic machine learning systems, and are as better as those obtained with some artificial neural networks methods reported at present. But in opposite to neural networks which lack of argumentation, our system provides the user a plausible explanation of its prediction.

Publication Date: 1993

Publication Name: Proceedings / ... International Conference on Intelligent Systems for Molecular Biology ; ISMB. International Conference on Intelligent Systems for Molecular Biology

Research Interests:
Algorithms, Artificial Intelligence, Expert Systems, Forecasting, Primates, and 5 moreAnimals, Knowledge Acquisition, Computer User Interface Design, Introns, and Base Sequence

Download (.pdf)

Publication Date: 2007

Publication Name: Lecture Notes in Computer Science

Research Interests:
Association Rule Mining, Profitability, and Association Rule

Download (.pdf)

Publication Date: 2012

Publication Name: Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine - BCB '12

Download (.pdf)

Research Interests:
CAP

Download (.pdf)

... 1060 Tunis, Tunisie 62307 Lens Cedex, France {tarek.hamrouni, sadok.benyahia}@fst.rnu.tn mephu@cril.univ-artois.fr ... On the other hand, a dense context has many frequently occurring items and/or strong correlations between several... more

... 1060 Tunis, Tunisie 62307 Lens Cedex, France {tarek.hamrouni, sadok.benyahia}@fst.rnu.tn mephu@cril.univ-artois.fr ... On the other hand, a dense context has many frequently occurring items and/or strong correlations between several items and/or many items in each object. ...

Publication Date: Oct 7, 2014

Publication Name: F1000posters

Publication Date: 1993

Publication Date: Dec 17, 2014

Publication Date: 1990

Publication Date: 1993

Publication Name: Http Www Theses Fr

Publication Date: 2000

Publication Name: IEEE Transactions on Learning Technologies

Publication Date: 2007

Publication Date: Aug 25, 2011

Publication Name: F1000posters

Publication Date: 2007

Publication Name: F Egc

Publication Date: 2007

Publication Name: F Egc

Publication Date: Dec 5, 2013

Publication Date: 2010

Publication Name: Twenty Third International Flairs Conference

Research Interests: Algorithm<div>()</div>

Publication Date: 2007

Publication Name: F Egc

Publication Date: 2015

Publication Name: 2015 IEEE Trustcom/BigDataSE/ISPA

Publication Date: Jul 1, 2013

Publication Date: Dec 17, 2014

Publication Date: 2007

Publication Name: Cla

Publication Date: 2005

Publication Name: Cap

Research Interests: CAP<div>()</div>

Publication Date: Mar 8, 2013

Publication Date: 2009

Research Interests: Semantic similarity, Ontology Matching, and Ontology Alignment<div>()</div>

Publication Date: 2011

Publication Date: Jun 21, 2012

Publication Date: Jul 28, 2013

Publication Date: Jul 14, 2011

Publication Name: F1000posters

Research Interests: Biological Sciences, Humans, Female, Animals, Male, and Glycoproteins<div>()</div>

Publication Date: 2011

Publication Name: Extraction et Gestion des Connaissances

Publication Date: 2010

Publication Name: Computing Research Repository

Publication Date: 1999

Publication Name: Proceedings 11th International Conference on Tools with Artificial Intelligence

Research Interests: Power System, Instance-based learning, Learning Strategies, Learning System, Standard Ml, and Prototype Selection<div>()</div>

Publication Date: 2005

Publication Name: IFIP International Federation for Information Processing

Publication Date: 2004

Publication Name: Extraction et Gestion des Connaissances

Publication Date: 2007

Publication Name: Extraction et Gestion des Connaissances

Publication Date: 2007

Publication Name: Concept Lattices and their Applications

Publication Date: 2009

Research Interests: Ontology Alignment<div>()</div>

Publication Date: 2008

Publication Name: Lecture Notes in Computer Science

Research Interests: Experimental Study, Information Loss, and Association Rule<div>()</div>

Publication Date: 2011

Publication Name: Mathématiques et sciences humaines

Research Interests: Formal Concept Analysis, Lattice, and Association Rule<div>()</div>

Publication Date: 1999

Publication Name: Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Publication Date: 1993

Publication Name: Proceedings / ... International Conference on Intelligent Systems for Molecular Biology ; ISMB. International Conference on Intelligent Systems for Molecular Biology

Publication Date: 2007

Publication Name: Lecture Notes in Computer Science

Research Interests: Association Rule Mining, Profitability, and Association Rule<div>()</div>

Publication Date: 2012

Publication Name: Proceedings of the ACM Conference on Bioinformatics, Computational Biology and Biomedicine - BCB '12

Research Interests: CAP<div>()</div>

Log In

Research Interests:
Algorithm

Research Interests:
CAP

Research Interests:
Semantic similarity, Ontology Matching, and Ontology Alignment

Research Interests:
Biological Sciences, Humans, Female, Animals, Male, and Glycoproteins

Research Interests:
Power System, Instance-based learning, Learning Strategies, Learning System, Standard Ml, and Prototype Selection

Research Interests:
Ontology Alignment

Research Interests:
Experimental Study, Information Loss, and Association Rule

Research Interests:
Formal Concept Analysis, Lattice, and Association Rule

Research Interests:
Association Rule Mining, Profitability, and Association Rule

Research Interests:
CAP