Review of statistical network analysis: models, algorithms, and software
The analysis of network data is an area that is rapidly growing, both within and outside of the discipline of statistics.
This review provides a concise summary of methods and models used in the statistical analysis of network data, including the Erdős–...
Effective graph classification based on topological and label attributes
Graph classification is an important data mining task, and various graph kernel methods have been proposed recently for this task. These methods have proven to be effective, but they tend to have high computational overhead. In this paper, we propose an ...
From black and white to full color: extending redescription mining outside the Boolean world
Redescription mining is a powerful data analysis tool that is used to find multiple descriptions of the same entities. Consider geographical regions as an example. They can be characterized by the fauna that inhabits them on one hand and by their ...
A resampling approach for interval-valued data regression
We consider interval-valued data that frequently appear with advanced technologies in current data collection processes. Interval-valued data refer to the data that are observed as ranges instead of single values. In the last decade, several approaches ...
Nearest-neighbors medians clustering
We propose a nonparametric cluster algorithm based on local medians. Each observation is substituted by its local median and this new observation moves toward the peaks and away from the valleys of the distribution. The process is repeated until each ...