Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–21 of 21 results for author: Király, F J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2101.04938  [pdf, other

    cs.SE cs.LG

    Designing Machine Learning Toolboxes: Concepts, Principles and Patterns

    Authors: Franz J. Király, Markus Löning, Anthony Blaom, Ahmed Guecioueur, Raphael Sonabend

    Abstract: Machine learning (ML) and AI toolboxes such as scikit-learn or Weka are workhorses of contemporary data scientific practice -- their central role being enabled by usable yet powerful designs that allow to easily specify, train and validate complex modeling pipelines. However, despite their universal success, the key design principles in their construction have never been fully analyzed. In this pa… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  2. arXiv:2008.08080  [pdf, ps, other

    stat.CO cs.LG stat.ML

    mlr3proba: An R Package for Machine Learning in Survival Analysis

    Authors: Raphael Sonabend, Franz J. Király, Andreas Bender, Bernd Bischl, Michel Lang

    Abstract: As machine learning has become increasingly popular over the last few decades, so too has the number of machine learning interfaces for implementing these models. Whilst many R libraries exist for machine learning, very few offer extended support for survival analysis. This is problematic considering its importance in fields like medicine, bioinformatics, economics, engineering, and more. mlr3prob… ▽ More

    Submitted 14 December, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: Submitted to Bioinformatics

  3. arXiv:2004.08545  [pdf, other

    stat.ML cs.LG

    Kernels for time series with irregularly-spaced multivariate observations

    Authors: Ahmed Guecioueur, Franz J. Király

    Abstract: Time series are an interesting frontier for kernel-based methods, for the simple reason that there is no kernel designed to represent them and their unique characteristics in full generality. Existing sequential kernels ignore the time indices, with many assuming that the series must be regularly-spaced; some such kernels are not even psd. In this manuscript, we show that a "series kernel" that is… ▽ More

    Submitted 18 April, 2020; originally announced April 2020.

    Comments: 11 pages

  4. arXiv:1909.07872  [pdf, ps, other

    cs.LG stat.ML

    sktime: A Unified Interface for Machine Learning with Time Series

    Authors: Markus Löning, Anthony Bagnall, Sajaysurya Ganesh, Viktor Kazakov, Jason Lines, Franz J. Király

    Abstract: We present sktime -- a new scikit-learn compatible Python library with a unified interface for machine learning with time series. Time series data gives rise to various distinct but closely related learning tasks, such as forecasting and time series classification, many of which can be solved by reducing them to related simpler tasks. We discuss the main rationale for creating a unified interface,… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

  5. arXiv:1901.03678  [pdf, other

    cs.LG cs.AI stat.ML

    Machine Learning Automation Toolbox (MLaut)

    Authors: Viktor Kazakov, Franz J. Király

    Abstract: In this paper we present MLaut (Machine Learning AUtomation Toolbox) for the python data science ecosystem. MLaut automates large-scale evaluation and benchmarking of machine learning algorithms on a large number of datasets. MLaut provides a high-level workflow interface to machine algorithm algorithms, implements a local back-end to a database of dataset collections, trained algorithms, and expe… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

  6. arXiv:1812.10404  [pdf

    cs.CY cs.LG stat.AP stat.ML

    Machine learning and AI research for Patient Benefit: 20 Critical Questions on Transparency, Replicability, Ethics and Effectiveness

    Authors: Sebastian Vollmer, Bilal A. Mateen, Gergo Bohner, Franz J Király, Rayid Ghani, Pall Jonsson, Sarah Cumbers, Adrian Jonas, Katherine S. L. McAllister, Puja Myles, David Granger, Mark Birse, Richard Branson, Karel GM Moons, Gary S Collins, John P. A. Ioannidis, Chris Holmes, Harry Hemingway

    Abstract: Machine learning (ML), artificial intelligence (AI) and other modern statistical methods are providing new opportunities to operationalize previously untapped and rapidly growing sources of data for patient benefit. Whilst there is a lot of promising research currently being undertaken, the literature as a whole lacks: transparency; clear reporting to facilitate replicability; exploration for pote… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    Comments: 25 pages, 2 boxes, 1 figure

    MSC Class: 68T01

  7. arXiv:1812.07519  [pdf, other

    cs.LG cs.AI stat.ML

    NIPS - Not Even Wrong? A Systematic Review of Empirically Complete Demonstrations of Algorithmic Effectiveness in the Machine Learning and Artificial Intelligence Literature

    Authors: Franz J Király, Bilal Mateen, Raphael Sonabend

    Abstract: Objective: To determine the completeness of argumentative steps necessary to conclude effectiveness of an algorithm in a sample of current ML/AI supervised learning literature. Data Sources: Papers published in the Neural Information Processing Systems (NeurIPS, née NIPS) journal where the official record showed a 2017 year of publication. Eligibility Criteria: Studies reporting a (semi-)super… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

  8. arXiv:1801.00753  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Probabilistic supervised learning

    Authors: Frithjof Gressmann, Franz J. Király, Bilal Mateen, Harald Oberhauser

    Abstract: Predictive modelling and supervised learning are central to modern data science. With predictions from an ever-expanding number of supervised black-box strategies - e.g., kernel methods, random forests, deep learning aka neural networks - being employed as a basis for decision making processes, it is crucial to understand the statistical uncertainty associated with these predictions. As a genera… ▽ More

    Submitted 7 May, 2019; v1 submitted 2 January, 2018; originally announced January 2018.

  9. arXiv:1711.05869  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Predictive Independence Testing, Predictive Conditional Independence Testing, and Predictive Graphical Modelling

    Authors: Samuel Burkart, Franz J Király

    Abstract: Testing (conditional) independence of multivariate random variables is a task central to statistical inference and modelling in general - though unfortunately one for which to date there does not exist a practicable workflow. State-of-art workflows suffer from the need for heuristic or subjective manual choices, high computational complexity, or strong parametric assumptions. We address these pr… ▽ More

    Submitted 28 April, 2018; v1 submitted 15 November, 2017; originally announced November 2017.

  10. arXiv:1701.08055  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Modelling Competitive Sports: Bradley-Terry-Élő Models for Supervised and On-Line Learning of Paired Competition Outcomes

    Authors: Franz J. Király, Zhaozhi Qian

    Abstract: Prediction and modelling of competitive sports outcomes has received much recent attention, especially from the Bayesian statistics and machine learning communities. In the real world setting of outcome prediction, the seminal Élő update still remains, after more than 50 years, a valuable baseline which is difficult to improve upon, though in its original form it is a heuristic and not a proper st… ▽ More

    Submitted 27 January, 2017; originally announced January 2017.

  11. Machine Learning in Falls Prediction; A cognition-based predictor of falls for the acute neurological in-patient population

    Authors: Bilal A. Mateen, Matthias Bussas, Catherine Doogan, Denise Waller, Alessia Saverino, Franz J Király, E Diane Playford

    Abstract: Background Information: Falls are associated with high direct and indirect costs, and significant morbidity and mortality for patients. Pathological falls are usually a result of a compromised motor system, and/or cognition. Very little research has been conducted on predicting falls based on this premise. Aims: To demonstrate that cognitive and motor tests can be used to create a robust predict… ▽ More

    Submitted 5 July, 2016; originally announced July 2016.

  12. arXiv:1601.08169  [pdf, ps, other

    stat.ML cs.DM cs.LG math.ST stat.ME

    Kernels for sequentially ordered data

    Authors: Franz J Király, Harald Oberhauser

    Abstract: We present a novel framework for kernel learning with sequential data of any kind, such as time series, sequences of graphs, or strings. Our approach is based on signature features which can be seen as an ordered variant of sample (cross-)moments; it allows to obtain a "sequentialized" version of any static kernel. The sequential kernels are efficiently computable for discrete sequences and are sh… ▽ More

    Submitted 29 January, 2016; originally announced January 2016.

  13. arXiv:1411.7817  [pdf, other

    stat.ML cs.LG math.ST

    Learning with Algebraic Invariances, and the Invariant Kernel Trick

    Authors: Franz J. Király, Andreas Ziehe, Klaus-Robert Müller

    Abstract: When solving data analysis problems it is important to integrate prior knowledge and/or structural invariances. This paper contributes by a novel framework for incorporating algebraic invariance structure into kernels. In particular, we show that algebraic properties such as sign symmetries in data, phase independence, scaling etc. can be included easily by essentially performing the kernel trick… ▽ More

    Submitted 28 November, 2014; originally announced November 2014.

  14. arXiv:1406.2646  [pdf, other

    cs.LG math.AC stat.ML

    Learning with Cross-Kernels and Ideal PCA

    Authors: Franz J Király, Martin Kreuzer, Louis Theran

    Abstract: We describe how cross-kernel matrices, that is, kernel matrices between the data and a custom chosen set of `feature spanning points' can be used for learning. The main potential of cross-kernels lies in the fact that (a) only one side of the matrix scales with the number of data points, and (b) cross-kernels, as opposed to the usual kernel matrices, can be used to certify for the data manifold. O… ▽ More

    Submitted 10 June, 2014; originally announced June 2014.

  15. arXiv:1403.0873  [pdf, ps, other

    math.ST cs.DM cs.LG stat.ME stat.ML

    Matroid Regression

    Authors: Franz J Király, Louis Theran

    Abstract: We propose an algebraic combinatorial method for solving large sparse linear systems of equations locally - that is, a method which can compute single evaluations of the signal without computing the whole signal. The method scales only in the sparsity of the system and not in its size, and allows to provide error estimates for any solution method. At the heart of our approach is the so-called regr… ▽ More

    Submitted 4 March, 2014; originally announced March 2014.

  16. arXiv:1402.4053  [pdf, other

    math.FA cs.CV cs.IT math.AG stat.ML

    The Algebraic Approach to Phase Retrieval and Explicit Inversion at the Identifiability Threshold

    Authors: Franz J Király, Martin Ehler

    Abstract: We study phase retrieval from magnitude measurements of an unknown signal as an algebraic estimation problem. Indeed, phase retrieval from rank-one and more general linear measurements can be treated in an algebraic way. It is verified that a certain number of generic rank-one or generic linear measurements are sufficient to enable signal reconstruction for generic signals, and slightly more gener… ▽ More

    Submitted 17 February, 2014; originally announced February 2014.

  17. arXiv:1402.0099  [pdf, other

    stat.ML cs.LG math.AC math.AG math.ST

    Dual-to-kernel learning with ideals

    Authors: Franz J. Király, Martin Kreuzer, Louis Theran

    Abstract: In this paper, we propose a theory which unifies kernel learning and symbolic algebraic methods. We show that both worlds are inherently dual to each other, and we use this duality to combine the structure-awareness of algebraic methods with the efficiency and generality of kernels. The main idea lies in relating polynomial rings to feature space, and ideals to manifolds, then exploiting this gene… ▽ More

    Submitted 1 February, 2014; originally announced February 2014.

    Comments: 15 pages, 1 figure

  18. arXiv:1309.3233  [pdf, ps, other

    stat.ML cs.LG math.ST

    Efficient Orthogonal Tensor Decomposition, with an Application to Latent Variable Model Learning

    Authors: Franz J. Király

    Abstract: Decomposing tensors into orthogonal factors is a well-known task in statistics, machine learning, and signal processing. We study orthogonal outer product decompositions where the factors in the summands in the decomposition are required to be orthogonal across summands, by relating this orthogonal decomposition to the singular value decompositions of the flattenings. We show that it is a non-triv… ▽ More

    Submitted 12 September, 2013; originally announced September 2013.

  19. arXiv:1302.2767  [pdf, other

    cs.LG cs.IT math.AG stat.ML

    Coherence and sufficient sampling densities for reconstruction in compressed sensing

    Authors: Franz J. Király, Louis Theran

    Abstract: We give a new, very general, formulation of the compressed sensing problem in terms of coordinate projections of an analytic variety, and derive sufficient sampling rates for signal reconstruction. Our bounds are linear in the coherence of the signal space, a geometric parameter independent of the specific signal and measurement, and logarithmic in the ambient dimension where the signal is present… ▽ More

    Submitted 2 November, 2013; v1 submitted 12 February, 2013; originally announced February 2013.

    Comments: 16 pages, 1 figure. v2 streamlines the exposition

  20. arXiv:1211.7369  [pdf, other

    stat.ML cs.LG math.NA

    Approximate Rank-Detecting Factorization of Low-Rank Tensors

    Authors: Franz J. Király, Andreas Ziehe

    Abstract: We present an algorithm, AROFAC2, which detects the (CP-)rank of a degree 3 tensor and calculates its factorization into rank-one components. We provide generative conditions for the algorithm to work and demonstrate on both synthetic and real world data that AROFAC2 is a potentially outperforming alternative to the gold standard PARAFAC over which it has the advantages that it can intrinsically d… ▽ More

    Submitted 30 November, 2012; originally announced November 2012.

  21. arXiv:1211.4116  [pdf, other

    cs.LG math.AG math.CO math.NA stat.ML

    The Algebraic Combinatorial Approach for Low-Rank Matrix Completion

    Authors: Franz J. Király, Louis Theran, Ryota Tomioka

    Abstract: We present a novel algebraic combinatorial view on low-rank matrix completion based on studying relations between a few entries with tools from algebraic geometry and matroid theory. The intrinsic locality of the approach allows for the treatment of single entries in a closed theoretical and practical framework. More specifically, apart from introducing an algebraic combinatorial theory of low-ran… ▽ More

    Submitted 19 August, 2014; v1 submitted 17 November, 2012; originally announced November 2012.

    Comments: 37 pages, with an appendix by Takeaki Uno