JMLR: Vol 16, No 1

Volume 16, Issue 1January 2015

Volume 16, Issue 1

January 2015

Editor:

Kevin Murphy
Google
,
Bernhard Schölkopf
MPI for Intelligent Systems

Publisher:

JMLR.org

ISSN:1532-4435

EISSN:1533-7928

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

article

Free

Statistical decision making for optimal budget allocation in crowd labeling

Pages 1–46

It has become increasingly popular to obtain machine learning labels through commercial crowdsourcing services. The crowdsourcing workers or annotators are paid for each label they provide, but the task requester usually has only a limited amount of the ...

article

Free

Simultaneous pursuit of sparseness and rank structures for matrix decomposition

Pages 47–75

In multi-response regression, pursuit of two different types of structures is essential to battle the curse of dimensionality. In this paper, we seek a sparsest decomposition representation of a parameter matrix in terms of a sum of sparse and low rank ...

article

Free

Statistical topological data analysis using persistence landscapes

Peter Bubenik

Pages 77–102

We define a new topological summary for data that we call the persistence landscape. Since this summary lies in a vector space, it is easy to combine with tools from statistics and machine learning, in contrast to the standard topological summaries. ...

article

Free

Links between multiplicity automata, observable operator models and predictive state representations: a unified learning framework

Pages 103–147

Stochastic multiplicity automata (SMA) are weighted finite automata that generalize probabilistic automata. They have been used in the context of probabilistic grammatical inference. Observable operator models (OOMs) are a generalization of hidden ...

article

Free

SAMOA: scalable advanced massive online analysis

Pages 149–153

SAMOA (SCALABLE ADVANCED MASSIVE ONLINE ANALYSIS) is a platform for mining big data streams. It provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and ...

article

Free

Online learning via sequential complexities

Pages 155–186

We consider the problem of sequential prediction and provide tools to study the minimax value of the associated game. Classical statistical learning theory provides several useful complexity measures to study learning with i.i.d. data. Our proposed ...

article

Free

Learning transformations for clustering and classification

Pages 187–225

A low-rank transformation learning framework for subspace clustering and classification is proposed here. Many high-dimensional data, such as face images and motion sequences, approximately lie in a union of low-dimensional subspaces. The corresponding ...

article

Free

Multi-layered gesture recognition with Kinect

Pages 227–254

This paper proposes a novel multi-layered gesture recognition method with Kinect. We explore the essential linguistic characters of gestures: the components concurrent character and the sequential organization character, in a multi-layered framework, ...

article

Free

Multimodal gesture recognition via multiple hypotheses rescoring

Pages 255–284

We present a new framework for multimodal gesture recognition that is based on a multiple hypotheses rescoring fusion scheme. We specifically deal with a demanding Kinect-based multimodal data set, introduced in a recent gesture recognition challenge (...

article

Free

An asynchronous parallel stochastic coordinate descent algorithm

Pages 285–322

We describe an asynchronous parallel stochastic coordinate descent algorithm for minimizing smooth unconstrained or separably constrained functions. The method achieves a linear convergence rate on functions that satisfy an essential strong convexity ...

article

Free

Geometric intuition and algorithms for Ev-SVM

Pages 323–369

In this work we address the Ev-SVM model proposed by Pérez-Cruz et al. as an extension of the traditional v support vector classification model (v-SVM). Through an enhancement of the range of admissible values for the regularization parameter v, the Ev-...

article

Free

Composite self-concordant minimization

Pages 371–416

We propose a variable metric framework for minimizing the sum of a self-concordant function and a possibly non-smooth convex function, endowed with an easily computable proximal operator. We theoretically establish the convergence of our framework ...

article

Free

Network granger causality with inherent grouping structure

Pages 417–453

The problem of estimating high-dimensional network models arises naturally in the analysis of many biological and socio-economic systems. In this work, we aim to learn a network structure from temporal panel data, employing the framework of Granger ...

article

Free

Iterative and active graph clustering using trace norm minimization without cluster size constraints

Pages 455–490

This paper investigates graph clustering under the planted partition model in the presence of small clusters. Traditional results dictate that for an algorithm to provably correctly recover the underlying clusters, all clusters must be sufficiently ...

article

Free

A classification module for genetic programming algorithms in JCLEC

Pages 491–494

JCLEC-Classification is a usable and extensible open source library for genetic programming classification algorithms. It houses implementations of rule-based methods for classification based on genetic programming, supporting multiple model ...

article

Free

AD³: alternating directions dual decomposition for MAP inference in graphical models

Pages 495–545

We present AD³, a new algorithm for approximate maximum a posteriori (MAP) inference on factor graphs, based on the alternating directions method of multipliers. Like other dual decomposition algorithms, AD³ has a modular architecture, where local ...

article

Free

Introducing CURRENNT: the Munich open-source CUDA recurrent neural network toolkit

Pages 547–551

In this article, we introduce CURRENNT, an open-source parallel implementation of deep recurrent neural networks (RNNs) supporting graphics processing units (GPUs) through NVIDIA's Computed Unified Device Architecture (CUDA). CURRENNT supports uni- and ...

article

Free

The flare package for high dimensional linear regression and precision matrix estimation in R

Pages 553–557

This paper describes an R package named flare, which implements a family of new high dimensional regression methods (LAD Lasso, SQRT Lasso, l_q Lasso, and Dantzig selector) and their extensions to sparse precision matrix estimation (TIGER and CLIME). ...

article

Free

Regularized M-estimators with nonconvexity: statistical and algorithmic theory for local optima

Pages 559–616

We provide novel theoretical results regarding local optima of regularized M-estimators, allowing for nonconvexity in both loss and penalty functions. Under restricted strong convexity on the loss and suitable regularity conditions on the penalty, we ...

article

Free

Generalized hierarchical kernel learning

Pages 617–652

This paper generalizes the framework of Hierarchical Kernel Learning (HKL) and illustrates its utility in the domain of rule learning. HKL involves Multiple Kernel Learning over a set of given base kernels assumed to be embedded on a directed acyclic ...

article

Free

Discrete restricted Boltzmann machines

Pages 653–672

We describe discrete restricted Boltzmann machines: probabilistic graphical models with bipartite interactions between visible and hidden discrete variables. Examples are binary restricted Boltzmann machines and discrete naïve Bayes models. We detail ...

article

Free

Evolving GPU machine code

Pages 673–712

Parallel Graphics Processing Unit (GPU) implementations of GP have appeared in the literature using three main methodologies: (i) compilation, which generates the individuals in GPU code and requires compilation; (ii) pseudo-assembly, which generates ...

article

Free

A compression technique for analyzing disagreement-based active learning

Pages 713–745

We introduce a new and improved characterization of the label complexity of disagreement-based active learning, in which the leading quantity is the version space compression set size. This quantity is defined as the size of the smallest subset of the ...

article

Free

Response-based approachability with applications to generalized no-regret problems

Pages 747–773

Blackwell's theory of approachability provides fundamental results for repeated games with vector-valued payoffs, which have been usefully applied in the theory of learning in games, and in devising online learning algorithms in the adversarial setup. A ...

article

Free

Strong consistency of the prototype based clustering in probabilistic space

Vladimir Nikulin

Pages 775–785

In this paper we formulate in general terms an approach to prove strong consistency of the Empirical Risk Minimisation inductive principle applied to the prototype or distance based clustering. This approach was motivated by the Divisive Information-...

article

Free

Risk bounds for the majority vote: from a PAC-Bayesian analysis to a learning algorithm

Pages 787–860

We propose an extensive analysis of the behavior of majority votes in binary classification. In particular, we introduce a risk bound for majority votes, called the C-bound, that takes into account the average quality of the voters and their average ...

article

Free

A statistical perspective on algorithmic leveraging

Pages 861–911

One popular method for dealing with large-scale data sets is sampling. For example, by using the empirical statistical leverage scores as an importance sampling distribution, the method of algorithmic leveraging samples and rescales rows/columns of data ...

article

Free

Distributed matrix completion and robust factorization

Pages 913–960

If learning methods are to scale to the massive sizes of modern data sets, it is essential for the field of machine learning to embrace parallel and distributed computing. Inspired by the recent development of matrix factorization methods with rich ...

article

Free

Combined l₁ and greedy l₀ penalized least squares for linear model selection

Pages 961–992

We introduce a computationally effective algorithm for a linear model selection consisting of three steps: screening-ordering-selection (SOS). Screening of predictors is based on the thresholded Lasso that is l₁ penalized least squares. The screened ...

article

Free

Learning with the maximum correntropy criterion induced losses for regression

Pages 993–1034

Within the statistical learning framework, this paper studies the regression model associated with the correntropy induced losses. The correntropy, as a similarity measure, has been frequently employed in signal processing and pattern recognition. ...

The Journal of Machine Learning Research

Sections

Statistical decision making for optimal budget allocation in crowd labeling

Simultaneous pursuit of sparseness and rank structures for matrix decomposition

Statistical topological data analysis using persistence landscapes

Links between multiplicity automata, observable operator models and predictive state representations: a unified learning framework

SAMOA: scalable advanced massive online analysis

Online learning via sequential complexities

Learning transformations for clustering and classification

Multi-layered gesture recognition with Kinect

Multimodal gesture recognition via multiple hypotheses rescoring

An asynchronous parallel stochastic coordinate descent algorithm

Geometric intuition and algorithms for Ev-SVM

Composite self-concordant minimization

Network granger causality with inherent grouping structure

Iterative and active graph clustering using trace norm minimization without cluster size constraints

A classification module for genetic programming algorithms in JCLEC

AD³: alternating directions dual decomposition for MAP inference in graphical models

Introducing CURRENNT: the Munich open-source CUDA recurrent neural network toolkit

The flare package for high dimensional linear regression and precision matrix estimation in R

Regularized M-estimators with nonconvexity: statistical and algorithmic theory for local optima

Generalized hierarchical kernel learning

Discrete restricted Boltzmann machines

Evolving GPU machine code

A compression technique for analyzing disagreement-based active learning

Response-based approachability with applications to generalized no-regret problems

Strong consistency of the prototype based clustering in probabilistic space

Risk bounds for the majority vote: from a PAC-Bayesian analysis to a learning algorithm

A statistical perspective on algorithmic leveraging

Distributed matrix completion and robust factorization

Combined l₁ and greedy l₀ penalized least squares for linear model selection

Learning with the maximum correntropy criterion induced losses for regression

Sections

Save to Binder

Subjects

Comments