Frequent item set mining
Frequent item set mining is one of the best known and most popular data mining methods. Originally developed for market basket analysis, it is used nowadays for almost any task that requires discovering regularities between (nominal) variables. This ...
On new emerging concepts of petroleum digital ecosystem
Petroleum-bearing sedimentary basin is an emerging digital ecosystem. Petroleum system and its elements are narrated for each and every oil and gas field in each and every petroleum-bearing sedimentary basin. A new concept of ecosystem and its ...
Seeing beyond reading: a survey on visual text analytics
We review recent visualization techniques aimed at supporting tasks that require the analysis of text documents, from approaches targeted at visually summarizing the relevant content of a single document to those aimed at assisting exploratory ...
Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics
The random forest (RF) algorithm by Leo Breiman has become a standard data analysis tool in bioinformatics. It has shown excellent performance in settings where the number of variables is much larger than the number of observations, can cope with ...