Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleJune 2006
Kernel Predictive Linear Gaussian models for nonlinear stochastic dynamical systems
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 1017–1024https://doi.org/10.1145/1143844.1143972The recent Predictive Linear Gaussian model (or PLG) improves upon traditional linear dynamical system models by using a predictive representation of state, which makes consistent parameter estimation possible without any loss of modeling power and ...
- ArticleJune 2006
Probabilistic inference for solving discrete and continuous state Markov Decision Processes
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 945–952https://doi.org/10.1145/1143844.1143963Inference in Markov Decision Processes has recently received interest as a means to infer goals of an observed action, policy recognition, and also as a tool to compute policies. A particularly interesting aspect of the approach is that any existing ...
- ArticleJune 2006
Fast and space efficient string kernels using suffix arrays
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 929–936https://doi.org/10.1145/1143844.1143961String kernels which compare the set of all common substrings between two given strings have recently been proposed by Vishwanathan & Smola (2004). Surprisingly, these kernels can be computed in linear time and linear space using annotated suffix trees. ...
- ArticleJune 2006
Multiclass reduced-set support vector machines
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 921–928https://doi.org/10.1145/1143844.1143960There are well-established methods for reducing the number of support vectors in a trained binary support vector machine, often with minimal impact on accuracy. We show how reduced-set methods can be applied to multiclass SVMs made up of several binary ...
- ArticleJune 2006
Deterministic annealing for semi-supervised kernel machines
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 841–848https://doi.org/10.1145/1143844.1143950An intuitive approach to utilizing unlabeled data in kernel-based classification algorithms is to simply treat unknown labels as additional optimization variables. For margin-based loss functions, one can view this approach as attempting to learn low-...
-
- ArticleJune 2006
Permutation invariant SVMs
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 817–824https://doi.org/10.1145/1143844.1143947We extend Support Vector Machines to input spaces that are sets by ensuring that the classifier is invariant to permutations of sub-elements within each input. Such permutations include reordering of scalars in an input vector, re-orderings of tuples in ...
- ArticleJune 2006
The support vector decomposition machine
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 689–696https://doi.org/10.1145/1143844.1143931In machine learning problems with tens of thousands of features and only dozens or hundreds of independent training examples, dimensionality reduction is essential for good learning performance. In previous work, many researchers have treated the ...
- ArticleJune 2006
Concept boundary detection for speeding up SVMs
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 681–688https://doi.org/10.1145/1143844.1143930Support Vector Machines (SVMs) suffer from an O(n2) training cost, where n denotes the number of training instances. In this paper, we propose an algorithm to select boundary instances as training data to substantially reduce n. Our proposed algorithm ...
- ArticleJune 2006
Pruning in ordered bagging ensembles
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 609–616https://doi.org/10.1145/1143844.1143921We present a novel ensemble pruning method based on reordering the classifiers obtained from bagging and then selecting a subset for aggregation. Ordering the classifiers generated in bagging makes it possible to build subensembles of increasing size by ...
- ArticleJune 2006
Nonstationary kernel combination
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 553–560https://doi.org/10.1145/1143844.1143914The power and popularity of kernel methods stem in part from their ability to handle diverse forms of structured inputs, including vectors, graphs and strings. Recently, several methods have been proposed for combining kernels from heterogeneous data ...
- ArticleJune 2006
A probabilistic model for text kernels
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 537–544https://doi.org/10.1145/1143844.1143912This paper explores several kernels in the context of text classification. A novel view of how documents might have been created is introduced and kernels are derived from this framework. The relations between these kernels as well as to the Gaussian ...
- ArticleJune 2006
Simpler knowledge-based support vector machines
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 521–528https://doi.org/10.1145/1143844.1143910If appropriately used, prior knowledge can significantly improve the predictive accuracy of learning algorithms or reduce the amount of training data needed. In this paper we introduce a simple method to incorporate prior knowledge in support vector ...
- ArticleJune 2006
Learning low-rank kernel matrices
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 505–512https://doi.org/10.1145/1143844.1143908Kernel learning plays an important role in many machine learning tasks. However, algorithms for learning a kernel matrix often scale poorly, with running times that are cubic in the number of data points. In this paper, we propose efficient algorithms ...
- ArticleJune 2006
Learning a kernel function for classification with small training samples
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 401–408https://doi.org/10.1145/1143844.1143895When given a small sample, we show that classification with SVM can be considerably enhanced by using a kernel function learned from the training data prior to discrimination. This kernel is also shown to enhance retrieval based on data similarity. ...
- ArticleJune 2006
Fast transpose methods for kernel learning on sparse data
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 385–392https://doi.org/10.1145/1143844.1143893Kernel-based learning algorithms, such as Support Vector Machines (SVMs) or Perceptron, often rely on sequential optimization where a few examples are added at each iteration. Updating the kernel matrix usually requires matrix-vector multiplications. We ...
- ArticleJune 2006
Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 369–376https://doi.org/10.1145/1143844.1143891Many real-world sequence learning tasks require the prediction of sequences of labels from noisy, unsegmented input data. In speech recognition, for example, an acoustic signal is transcribed into words or sub-word units. Recurrent neural networks (RNNs)...
- ArticleJune 2006
Learning user preferences for sets of objects
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 273–280https://doi.org/10.1145/1143844.1143879Most work on preference learning has focused on pairwise preferences or rankings over individual items. In this paper, we present a method for learning preferences over sets of items. Our learning method takes as input a collection of positive examples--...
- ArticleJune 2006
Discriminative cluster analysis
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 241–248https://doi.org/10.1145/1143844.1143875Clustering is one of the most widely used statistical tools for data analysis. Among all existing clustering techniques, k-means is a very popular method because of its ease of programming and because it accomplishes a good trade-off between achieved ...
- ArticleJune 2006
The relationship between Precision-Recall and ROC curves
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 233–240https://doi.org/10.1145/1143844.1143874Receiver Operator Characteristic (ROC) curves are commonly used to present results for binary decision problems in machine learning. However, when dealing with highly skewed datasets, Precision-Recall (PR) curves give a more informative picture of an ...
- ArticleJune 2006
Trading convexity for scalability
ICML '06: Proceedings of the 23rd international conference on Machine learningPages 201–208https://doi.org/10.1145/1143844.1143870Convex learning algorithms, such as Support Vector Machines (SVMs), are often seen as highly desirable because they offer strong practical properties and are amenable to theoretical analysis. However, in this work we show how non-convexity can provide ...