Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 68 results for author: Niepert, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04489  [pdf, other

    cs.CV

    Dude: Dual Distribution-Aware Context Prompt Learning For Large Vision-Language Model

    Authors: Duy M. H. Nguyen, An T. Le, Trung Q. Nguyen, Nghiem T. Diep, Tai Nguyen, Duy Duong-Tran, Jan Peters, Li Shen, Mathias Niepert, Daniel Sonntag

    Abstract: Prompt learning methods are gaining increasing attention due to their ability to customize large vision-language models to new domains using pre-trained contextual knowledge and minimal training data. However, existing works typically rely on optimizing unified prompt inputs, often struggling with fine-grained classification tasks due to insufficient discriminative attributes. To tackle this, we c… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Version 1

  2. arXiv:2406.03919  [pdf, other

    cs.LG cs.AI cs.CV cs.NE physics.comp-ph

    Vectorized Conditional Neural Fields: A Framework for Solving Time-dependent Parametric Partial Differential Equations

    Authors: Jan Hagnberger, Marimuthu Kalimuthu, Daniel Musekamp, Mathias Niepert

    Abstract: Transformer models are increasingly used for solving Partial Differential Equations (PDEs). Several adaptations have been proposed, all of which suffer from the typical problems of Transformers, such as quadratic memory and time complexity. Furthermore, all prevalent architectures for PDE solving lack at least one of several desirable properties of an ideal surrogate model, such as (i) generalizat… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted for publication at the 41st International Conference on Machine Learning (ICML) 2024

  3. arXiv:2405.17311  [pdf, other

    cs.LG

    Probabilistic Graph Rewiring via Virtual Nodes

    Authors: Chendi Qian, Andrei Manolache, Christopher Morris, Mathias Niepert

    Abstract: Message-passing graph neural networks (MPNNs) have emerged as a powerful paradigm for graph-based machine learning. Despite their effectiveness, MPNNs face challenges such as under-reaching and over-squashing, where limited receptive fields and structural bottlenecks hinder information flow in the graph. While graph transformers hold promise in addressing these issues, their scalability is limited… ▽ More

    Submitted 7 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.02156

  4. arXiv:2405.16148  [pdf, other

    cs.LG

    Accelerating Transformers with Spectrum-Preserving Token Merging

    Authors: Hoai-Chau Tran, Duy M. H. Nguyen, Duy M. Nguyen, Trung-Tin Nguyen, Ngan Le, Pengtao Xie, Daniel Sonntag, James Y. Zou, Binh T. Nguyen, Mathias Niepert

    Abstract: Increasing the throughput of the Transformer architecture, a foundational component used in numerous state-of-the-art models for vision and language tasks (e.g., GPT, LLaVa), is an important problem in machine learning. One recent and effective strategy is to merge token representations within Transformer models, aiming to reduce computational and memory requirements while maintaining accuracy. Pr… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: Version 1

  5. arXiv:2405.15506  [pdf, other

    cs.LG

    Learning to Discretize Denoising Diffusion ODEs

    Authors: Vinh Tong, Anji Liu, Trung-Dung Hoang, Guy Van den Broeck, Mathias Niepert

    Abstract: Diffusion Probabilistic Models (DPMs) are powerful generative models showing competitive performance in various domains, including image synthesis and 3D point cloud generation. However, sampling from pre-trained DPMs involves multiple neural function evaluations (NFE) to transform Gaussian noise samples into images, resulting in higher computational costs compared to single-step generative models… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  6. arXiv:2405.14253  [pdf, other

    cs.LG physics.comp-ph

    Higher-Rank Irreducible Cartesian Tensors for Equivariant Message Passing

    Authors: Viktor Zaverkin, Francesco Alesiani, Takashi Maruyama, Federico Errica, Henrik Christiansen, Makoto Takamoto, Nicolas Weber, Mathias Niepert

    Abstract: The ability to perform fast and accurate atomistic simulations is crucial for advancing the chemical sciences. By learning from high-quality data, machine-learned interatomic potentials achieve accuracy on par with ab initio and first-principles methods at a fraction of their computational cost. The success of machine-learned interatomic potentials arises from integrating inductive biases such as… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  7. arXiv:2402.01975  [pdf, other

    cs.LG

    Structure-Aware E(3)-Invariant Molecular Conformer Aggregation Networks

    Authors: Duy M. H. Nguyen, Nina Lukashina, Tai Nguyen, An T. Le, TrungTin Nguyen, Nhat Ho, Jan Peters, Daniel Sonntag, Viktor Zaverkin, Mathias Niepert

    Abstract: A molecule's 2D representation consists of its atoms, their attributes, and the molecule's covalent bonds. A 3D (geometric) representation of a molecule is called a conformer and consists of its atom types and Cartesian coordinates. Every conformer has a potential energy, and the lower this energy, the more likely it occurs in nature. Most existing machine learning methods for molecular property p… ▽ More

    Submitted 10 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024

  8. arXiv:2401.03349  [pdf, other

    cs.CV cs.LG

    Image Inpainting via Tractable Steering of Diffusion Models

    Authors: Anji Liu, Mathias Niepert, Guy Van den Broeck

    Abstract: Diffusion models are the current state of the art for generating photorealistic images. Controlling the sampling process for constrained image generation tasks such as inpainting, however, remains challenging since exact conditioning on such constraints is intractable. While existing methods use various techniques to approximate the constrained posterior, this paper proposes to exploit the ability… ▽ More

    Submitted 28 November, 2023; originally announced January 2024.

  9. arXiv:2312.16560  [pdf, other

    cs.LG

    Adaptive Message Passing: A General Framework to Mitigate Oversmoothing, Oversquashing, and Underreaching

    Authors: Federico Errica, Henrik Christiansen, Viktor Zaverkin, Takashi Maruyama, Mathias Niepert, Francesco Alesiani

    Abstract: Long-range interactions are essential for the correct description of complex systems in many scientific fields. The price to pay for including them in the calculations, however, is a dramatic increase in the overall computational costs. Recently, deep graph networks have been employed as efficient, data-driven surrogate models for predicting properties of complex systems represented as graphs. The… ▽ More

    Submitted 20 March, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

  10. arXiv:2311.11096  [pdf, other

    eess.IV cs.CV

    On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation

    Authors: Duy Minh Ho Nguyen, Tan Ngoc Pham, Nghiem Tuong Diep, Nghi Quoc Phan, Quang Pham, Vinh Tong, Binh T. Nguyen, Ngan Hoang Le, Nhat Ho, Pengtao Xie, Daniel Sonntag, Mathias Niepert

    Abstract: Constructing a robust model that can effectively generalize to test samples under distribution shifts remains a significant challenge in the field of medical imaging. The foundational models for vision and language, pre-trained on extensive sets of natural image and text data, have emerged as a promising approach. It showcases impressive learning abilities across different tasks with the need for… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: Advances in Neural Information Processing Systems (NeurIPS) 2023, Workshop on robustness of zero/few-shot learning in foundation models

  11. arXiv:2310.13977  [pdf, other

    cs.LG cs.IT

    Continual Invariant Risk Minimization

    Authors: Francesco Alesiani, Shujian Yu, Mathias Niepert

    Abstract: Empirical risk minimization can lead to poor generalization behavior on unseen environments if the learned model does not capture invariant feature representations. Invariant risk minimization (IRM) is a recent proposal for discovering environment-invariant representations. IRM was introduced by Arjovsky et al. (2019) and extended by Ahuja et al. (2020). IRM assumes that all environments are avail… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: Shorter version of this paper was presented at RobustML workshop of ICLR 2021

  12. arXiv:2310.02156  [pdf, other

    cs.LG cs.NE

    Probabilistically Rewired Message-Passing Neural Networks

    Authors: Chendi Qian, Andrei Manolache, Kareem Ahmed, Zhe Zeng, Guy Van den Broeck, Mathias Niepert, Christopher Morris

    Abstract: Message-passing graph neural networks (MPNNs) emerged as powerful tools for processing graph-structured input. However, they operate on a fixed input graph structure, ignoring potential noise and missing information. Furthermore, their local aggregation mechanism can lead to problems such as over-squashing and limited expressive power in capturing relevant graph structures. Existing solutions to t… ▽ More

    Submitted 26 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  13. arXiv:2308.06585  [pdf, other

    cs.LG cs.AI cs.DB cs.LO cs.NE

    Approximate Answering of Graph Queries

    Authors: Michael Cochez, Dimitrios Alivanistos, Erik Arakelyan, Max Berrendorf, Daniel Daza, Mikhail Galkin, Pasquale Minervini, Mathias Niepert, Hongyu Ren

    Abstract: Knowledge graphs (KGs) are inherently incomplete because of incomplete world knowledge and bias in what is the input to the KG. Additionally, world knowledge constantly expands and evolves, making existing facts deprecated or introducing new ones. However, we would still want to be able to answer queries as if the graph were complete. In this chapter, we will give an overview of several methods wh… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: Preprint of Ch. 17 "Approximate Answering of Graph Queries" in "Compendium of Neurosymbolic Artificial Intelligence", https://ebooks.iospress.nl/ISBN/978-1-64368-406-2

  14. arXiv:2307.14193  [pdf, other

    cs.LG

    Efficient Learning of Discrete-Continuous Computation Graphs

    Authors: David Friede, Mathias Niepert

    Abstract: Numerous models for supervised and reinforcement learning benefit from combinations of discrete and continuous model components. End-to-end learnable discrete-continuous models are compositional, tend to generalize better, and are more interpretable. A popular approach to building discrete-continuous computation graphs is that of integrating discrete probability distributions into neural networks… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Journal ref: NeurIPS 34 (2021) 6720-6732

  15. arXiv:2307.14151  [pdf, other

    cs.LG stat.ML

    Learning Disentangled Discrete Representations

    Authors: David Friede, Christian Reimers, Heiner Stuckenschmidt, Mathias Niepert

    Abstract: Recent successes in image generation, model-based reinforcement learning, and text-to-image generation have demonstrated the empirical advantages of discrete latent representations, although the reasons behind their benefits remain unclear. We explore the relationship between discrete latent spaces and disentangled representations by replacing the standard Gaussian variational autoencoder (VAE) wi… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  16. arXiv:2306.11925  [pdf, other

    cs.CV

    LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching

    Authors: Duy M. H. Nguyen, Hoang Nguyen, Nghiem T. Diep, Tan N. Pham, Tri Cao, Binh T. Nguyen, Paul Swoboda, Nhat Ho, Shadi Albarqouni, Pengtao Xie, Daniel Sonntag, Mathias Niepert

    Abstract: Obtaining large pre-trained models that can be fine-tuned to new tasks with limited annotated samples has remained an open challenge for medical imaging data. While pre-trained deep networks on ImageNet and vision-language foundation models trained on web-scale data are prevailing approaches, their effectiveness on medical tasks is limited due to the significant domain shift between natural and me… ▽ More

    Submitted 18 November, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted at NeurIPS 2023

  17. arXiv:2305.10544  [pdf, other

    cs.LG cs.AI

    Tractable Probabilistic Graph Representation Learning with Graph-Induced Sum-Product Networks

    Authors: Federico Errica, Mathias Niepert

    Abstract: We introduce Graph-Induced Sum-Product Networks (GSPNs), a new probabilistic framework for graph representation learning that can tractably answer probabilistic queries. Inspired by the computational trees induced by vertices in the context of message-passing neural networks, we build hierarchies of sum-product networks (SPNs) where the parameters of a parent SPN are learnable transformations of t… ▽ More

    Submitted 16 February, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: The 12th International Conference on Learning Representations (ICLR 2024)

  18. arXiv:2304.14118  [pdf, other

    cs.LG cs.CE physics.comp-ph physics.flu-dyn physics.geo-ph

    Learning Neural PDE Solvers with Parameter-Guided Channel Attention

    Authors: Makoto Takamoto, Francesco Alesiani, Mathias Niepert

    Abstract: Scientific Machine Learning (SciML) is concerned with the development of learned emulators of physical systems governed by partial differential equations (PDE). In application domains such as weather forecasting, molecular dynamics, and inverse design, ML-based surrogate models are increasingly used to augment or replace inefficient and often non-differentiable numerical simulation algorithms. Whi… ▽ More

    Submitted 21 July, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: accepted for publication in ICML2023

  19. Learning Sparsity of Representations with Discrete Latent Variables

    Authors: Zhao Xu, Daniel Onoro Rubio, Giuseppe Serra, Mathias Niepert

    Abstract: Deep latent generative models have attracted increasing attention due to the capacity of combining the strengths of deep learning and probabilistic models in an elegant way. The data representations learned with the models are often continuous and dense. However in many applications, sparse representations are expected, such as learning sparse high dimensional embedding of data in an unsupervised… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  20. arXiv:2212.05178  [pdf, ps, other

    cs.LG

    State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions

    Authors: Cheng Wang, Carolin Lawrence, Mathias Niepert

    Abstract: Recurrent neural networks are a widely used class of neural architectures. They have, however, two shortcomings. First, they are often treated as black-box models and as such it is difficult to understand what exactly they learn as well as how they arrive at a particular prediction. Second, they tend to work poorly on sequences requiring long-term memorization, despite having this capacity in prin… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: To appear at IEEE Transactions on Pattern Analysis and Machine Intelligence. The extended version of State-Regularized Recurrent Neural Networks [arXiv:1901.08817]

  21. arXiv:2210.08922  [pdf, other

    cs.CL

    Joint Multilingual Knowledge Graph Completion and Alignment

    Authors: Vinh Tong, Dat Quoc Nguyen, Trung Thanh Huynh, Tam Thanh Nguyen, Quoc Viet Hung Nguyen, Mathias Niepert

    Abstract: Knowledge graph (KG) alignment and completion are usually treated as two independent tasks. While recent work has leveraged entity and relation alignments from multiple KGs, such as alignments between multilingual KGs with common entities and relations, a deeper understanding of the ways in which multilingual KG completion (MKGC) can aid the creation of multilingual KG alignments (MKGA) is still l… ▽ More

    Submitted 18 October, 2022; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022 (Findings), to appear

  22. arXiv:2210.07182  [pdf, other

    cs.LG cs.CV physics.flu-dyn physics.geo-ph

    PDEBENCH: An Extensive Benchmark for Scientific Machine Learning

    Authors: Makoto Takamoto, Timothy Praditia, Raphael Leiteritz, Dan MacKinlay, Francesco Alesiani, Dirk Pflüger, Mathias Niepert

    Abstract: Machine learning-based modeling of physical systems has experienced increased interest in recent years. Despite some impressive progress, there is still a lack of benchmarks for Scientific ML that are easy to use but still challenging and representative of a wide range of problems. We introduce PDEBench, a benchmark suite of time-dependent simulation tasks based on Partial Differential Equations (… ▽ More

    Submitted 13 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 16 pages (main body) + 34 pages (supplemental material), accepted for publication in NeurIPS 2022 Track Datasets and Benchmarks

  23. arXiv:2210.01941  [pdf, other

    cs.LG cs.AI

    SIMPLE: A Gradient Estimator for $k$-Subset Sampling

    Authors: Kareem Ahmed, Zhe Zeng, Mathias Niepert, Guy Van den Broeck

    Abstract: $k$-subset sampling is ubiquitous in machine learning, enabling regularization and interpretability through sparsity. The challenge lies in rendering $k$-subset sampling amenable to end-to-end learning. This has typically involved relaxing the reparameterized samples to allow for backpropagation, with the risk of introducing high bias and high variance. In this work, we fall back to discrete $k… ▽ More

    Submitted 6 June, 2024; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: ICLR 2023; fixed typo in Theorem 1

  24. arXiv:2209.14402  [pdf, other

    cs.LG cs.AI

    L2XGNN: Learning to Explain Graph Neural Networks

    Authors: Giuseppe Serra, Mathias Niepert

    Abstract: Graph Neural Networks (GNNs) are a popular class of machine learning models. Inspired by the learning to explain (L2X) paradigm, we propose L2XGNN, a framework for explainable GNNs which provides faithful explanations by design. L2XGNN learns a mechanism for selecting explanatory subgraphs (motifs) which are exclusively used in the GNNs message-passing operations. L2XGNN is able to select, for eac… ▽ More

    Submitted 14 June, 2024; v1 submitted 28 September, 2022; originally announced September 2022.

  25. arXiv:2209.04862  [pdf, other

    cs.LG cs.AI cs.CL cs.NE

    Adaptive Perturbation-Based Gradient Estimation for Discrete Latent Variable Models

    Authors: Pasquale Minervini, Luca Franceschi, Mathias Niepert

    Abstract: The integration of discrete algorithmic components in deep learning architectures has numerous applications. Recently, Implicit Maximum Likelihood Estimation (IMLE, Niepert, Minervini, and Franceschi 2021), a class of gradient estimators for discrete exponential family distributions, was proposed by combining implicit differentiation through perturbation with the path-wise gradient estimator. Howe… ▽ More

    Submitted 5 February, 2023; v1 submitted 11 September, 2022; originally announced September 2022.

    Comments: Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023)

  26. arXiv:2206.11168  [pdf, other

    cs.LG cs.AI cs.DS cs.NE stat.ML

    Ordered Subgraph Aggregation Networks

    Authors: Chendi Qian, Gaurav Rattan, Floris Geerts, Christopher Morris, Mathias Niepert

    Abstract: Numerous subgraph-enhanced graph neural networks (GNNs) have emerged recently, provably boosting the expressive power of standard (message-passing) GNNs. However, there is a limited understanding of how these approaches relate to each other and to the Weisfeiler-Leman hierarchy. Moreover, current approaches either use all subgraphs of a given size, sample them uniformly at random, or use hand-craf… ▽ More

    Submitted 15 October, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Accepted at NeurIPS 2022. Fixed link to code repository

  27. arXiv:2110.08144  [pdf, other

    cs.CL cs.AI

    milIE: Modular & Iterative Multilingual Open Information Extraction

    Authors: Bhushan Kotnis, Kiril Gashteovski, Daniel Oñoro Rubio, Vanesa Rodriguez-Tembras, Ammar Shaker, Makoto Takamoto, Mathias Niepert, Carolin Lawrence

    Abstract: Open Information Extraction (OpenIE) is the task of extracting (subject, predicate, object) triples from natural language sentences. Current OpenIE systems extract all triple slots independently. In contrast, we explore the hypothesis that it may be beneficial to extract triple slots iteratively: first extract easy slots, followed by the difficult ones by conditioning on the easy slots, and theref… ▽ More

    Submitted 25 April, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  28. arXiv:2109.07464  [pdf, other

    cs.CL

    AnnIE: An Annotation Platform for Constructing Complete Open Information Extraction Benchmark

    Authors: Niklas Friedrich, Kiril Gashteovski, Mingying Yu, Bhushan Kotnis, Carolin Lawrence, Mathias Niepert, Goran Glavaš

    Abstract: Open Information Extraction (OIE) is the task of extracting facts from sentences in the form of relations and their corresponding arguments in schema-free manner. Intrinsic performance of OIE systems is difficult to measure due to the incompleteness of existing OIE benchmarks: the ground truth extractions do not group all acceptable surface realizations of the same fact that can be extracted from… ▽ More

    Submitted 13 April, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

  29. arXiv:2109.06850  [pdf, other

    cs.CL cs.AI

    BenchIE: A Framework for Multi-Faceted Fact-Based Open Information Extraction Evaluation

    Authors: Kiril Gashteovski, Mingying Yu, Bhushan Kotnis, Carolin Lawrence, Mathias Niepert, Goran Glavaš

    Abstract: Intrinsic evaluations of OIE systems are carried out either manually -- with human evaluators judging the correctness of extractions -- or automatically, on standardized benchmarks. The latter, while much more cost-effective, is less reliable, primarily because of the incompleteness of the existing OIE benchmarks: the ground truth extractions do not include all acceptable variants of the same fact… ▽ More

    Submitted 13 April, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

  30. arXiv:2106.13642  [pdf, other

    cs.LG stat.ML

    VEGN: Variant Effect Prediction with Graph Neural Networks

    Authors: Jun Cheng, Carolin Lawrence, Mathias Niepert

    Abstract: Genetic mutations can cause disease by disrupting normal gene function. Identifying the disease-causing mutations from millions of genetic variants within an individual patient is a challenging problem. Computational methods which can prioritize disease-causing mutations have, therefore, enormous applications. It is well-known that genes function through a complex regulatory network. However, exis… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Comments: Accepted at Workshop on Computational Biology, co-located with the 38th International Conference on Machine Learning

  31. arXiv:2106.01798  [pdf, other

    cs.LG cs.AI

    Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions

    Authors: Mathias Niepert, Pasquale Minervini, Luca Franceschi

    Abstract: Combining discrete probability distributions and combinatorial optimization problems with neural network components has numerous applications but poses several challenges. We propose Implicit Maximum Likelihood Estimation (I-MLE), a framework for end-to-end learning of models combining discrete exponential family distributions and differentiable neural components. I-MLE is widely applicable as it… ▽ More

    Submitted 27 October, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 camera-ready; repo: https://github.com/nec-research/tf-imle

  32. arXiv:2011.12010  [pdf, other

    cs.LG

    Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs

    Authors: Cheng Wang, Carolin Lawrence, Mathias Niepert

    Abstract: Uncertainty quantification is crucial for building reliable and trustable machine learning systems. We propose to estimate uncertainty in recurrent neural networks (RNNs) via stochastic discrete state transitions over recurrent timesteps. The uncertainty of the model can be quantified by running a prediction several times, each time sampling from the recurrent state transition distribution, leadin… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  33. arXiv:2010.05516  [pdf, other

    cs.LG cs.AI stat.ML

    Explaining Neural Matrix Factorization with Gradient Rollback

    Authors: Carolin Lawrence, Timo Sztyler, Mathias Niepert

    Abstract: Explaining the predictions of neural black-box models is an important problem, especially when such models are used in applications where user trust is crucial. Estimating the influence of training examples on a learned neural model's behavior allows us to identify training examples most responsible for a given prediction and, therefore, to faithfully explain the output of a black-box model. The m… ▽ More

    Submitted 15 December, 2020; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: 35th AAAI Conference on Artificial Intelligence, 2021. Includes Appendix

  34. arXiv:2004.02596  [pdf, other

    cs.AI cs.LG

    Answering Complex Queries in Knowledge Graphs with Bidirectional Sequence Encoders

    Authors: Bhushan Kotnis, Carolin Lawrence, Mathias Niepert

    Abstract: Representation learning for knowledge graphs (KGs) has focused on the problem of answering simple link prediction queries. In this work we address the more ambitious challenge of predicting the answers of conjunctive queries with multiple missing entities. We propose Bi-Directional Query Embedding (BIQE), a method that embeds conjunctive queries with models based on bi-directional attention mechan… ▽ More

    Submitted 4 February, 2021; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: 8 pages, 2 figures

  35. arXiv:1908.05915  [pdf, other

    stat.ML cs.CL cs.LG

    Attending to Future Tokens For Bidirectional Sequence Generation

    Authors: Carolin Lawrence, Bhushan Kotnis, Mathias Niepert

    Abstract: Neural sequence generation is typically performed token-by-token and left-to-right. Whenever a token is generated only previously produced tokens are taken into consideration. In contrast, for problems such as sequence classification, bidirectional attention, which takes both past and future tokens into consideration, has been shown to perform much better. We propose to make the sequence generatio… ▽ More

    Submitted 17 September, 2019; v1 submitted 16 August, 2019; originally announced August 2019.

    Comments: Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019, Hong Kong, China

  36. arXiv:1903.11960  [pdf, other

    cs.LG stat.ML

    Learning Discrete Structures for Graph Neural Networks

    Authors: Luca Franceschi, Mathias Niepert, Massimiliano Pontil, Xiao He

    Abstract: Graph neural networks (GNNs) are a popular class of machine learning models whose major advantage is their ability to incorporate a sparse and discrete dependency structure between data points. Unfortunately, GNNs can only be used when such a graph-structure is available. In practice, however, real-world graphs are often noisy and incomplete or might not be available at all. With this work, we pro… ▽ More

    Submitted 19 June, 2020; v1 submitted 28 March, 2019; originally announced March 2019.

    Comments: ICML 2019, code at https://github.com/lucfra/LDS - Revision of Sec. 3

  37. arXiv:1903.10794  [pdf, other

    cs.IR cs.LG

    RecSys-DAN: Discriminative Adversarial Networks for Cross-Domain Recommender Systems

    Authors: Cheng Wang, Mathias Niepert, Hui Li

    Abstract: Data sparsity and data imbalance are practical and challenging issues in cross-domain recommender systems. This paper addresses those problems by leveraging the concepts which derive from representation learning, adversarial learning and transfer learning (particularly, domain adaptation). Although various transfer learning methods have shown promising performance in this context, our proposed nov… ▽ More

    Submitted 10 April, 2019; v1 submitted 26 March, 2019; originally announced March 2019.

    Comments: 10 pages, IEEE-TNNLS

  38. arXiv:1903.05485  [pdf, other

    cs.AI cs.CL

    MMKG: Multi-Modal Knowledge Graphs

    Authors: Ye Liu, Hui Li, Alberto Garcia-Duran, Mathias Niepert, Daniel Onoro-Rubio, David S. Rosenblum

    Abstract: We present MMKG, a collection of three knowledge graphs that contain both numerical features and (links to) images for all entities as well as entity alignments between pairs of KGs. Therefore, multi-relational link prediction and entity matching communities can benefit from this resource. We believe this data set has the potential to facilitate the development of novel multi-modal learning approa… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Comments: ESWC 2019

  39. arXiv:1901.08817  [pdf, other

    cs.LG stat.ML

    State-Regularized Recurrent Neural Networks

    Authors: Cheng Wang, Mathias Niepert

    Abstract: Recurrent neural networks are a widely used class of neural architectures. They have, however, two shortcomings. First, it is difficult to understand what exactly they learn. Second, they tend to work poorly on sequences requiring long-term memorization, despite having this capacity in principle. We aim to address both shortcomings with a class of recurrent networks that use a stochastic state tra… ▽ More

    Submitted 7 May, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: to appear at ICML2019, 20 pages

  40. arXiv:1811.04752  [pdf, other

    cs.LG stat.ML

    Learning Representations of Missing Data for Predicting Patient Outcomes

    Authors: Brandon Malone, Alberto Garcia-Duran, Mathias Niepert

    Abstract: Extracting actionable insight from Electronic Health Records (EHRs) poses several challenges for traditional machine learning approaches. Patients are often missing data relative to each other; the data comes in a variety of modalities, such as multivariate time series, free text, and categorical demographic information; important relationships among patients can be difficult to detect; and many o… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

  41. arXiv:1810.09227  [pdf, other

    cs.DB cs.LG stat.ML

    Knowledge Graph Completion to Predict Polypharmacy Side Effects

    Authors: Brandon Malone, Alberto García-Durán, Mathias Niepert

    Abstract: The polypharmacy side effect prediction problem considers cases in which two drugs taken individually do not result in a particular side effect; however, when the two drugs are taken in combination, the side effect manifests. In this work, we demonstrate that multi-relational knowledge graph completion achieves state-of-the-art results on the polypharmacy side effect prediction problem. Empirical… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Comments: 13th International Conference on Data Integration in the Life Sciences (DILS2018)

  42. arXiv:1809.03202  [pdf, other

    cs.AI cs.CL

    Learning Sequence Encoders for Temporal Knowledge Graph Completion

    Authors: Alberto García-Durán, Sebastijan Dumančić, Mathias Niepert

    Abstract: Research on link prediction in knowledge graphs has mainly focused on static multi-relational data. In this work we consider temporal knowledge graphs where relations between entities may only hold for a time interval or a specific point in time. In line with previous work on static knowledge graphs, we propose to address this problem by learning latent entity and relation type representations. To… ▽ More

    Submitted 10 September, 2018; originally announced September 2018.

    Comments: EMNLP'18

  43. arXiv:1808.06791  [pdf, other

    cs.IR cs.LG stat.ML

    LRMM: Learning to Recommend with Missing Modalities

    Authors: Cheng Wang, Mathias Niepert, Hui Li

    Abstract: Multimodal learning has shown promising performance in content-based recommendation due to the auxiliary user and item information of multiple modalities such as text and images. However, the problem of incomplete and missing modality is rarely explored and most existing methods fail in learning a recommendation model with missing or corrupted modalities. In this paper, we propose LRMM, a novel fr… ▽ More

    Submitted 30 August, 2018; v1 submitted 21 August, 2018; originally announced August 2018.

    Comments: 11 pages, EMNLP 2018

  44. arXiv:1806.11391  [pdf, other

    cs.AI cs.LG stat.ML

    A Comparative Study of Distributional and Symbolic Paradigms for Relational Learning

    Authors: Sebastijan Dumancic, Alberto Garcia-Duran, Mathias Niepert

    Abstract: Many real-world domains can be expressed as graphs and, more generally, as multi-relational knowledge graphs. Though reasoning and learning with knowledge graphs has traditionally been addressed by symbolic approaches, recent methods in (deep) representation learning has shown promising results for specialized tasks such as knowledge base completion. These approaches abandon the traditional symbol… ▽ More

    Submitted 24 March, 2020; v1 submitted 29 June, 2018; originally announced June 2018.

    Comments: corrected version: incorrect evaluation fixed; IJCAI 2019

  45. arXiv:1806.04009  [pdf, other

    cs.CV

    Contextual Hourglass Networks for Segmentation and Density Estimation

    Authors: Daniel Oñoro-Rubio, Mathias Niepert

    Abstract: Hourglass networks such as the U-Net and V-Net are popular neural architectures for medical image segmentation and counting problems. Typical instances of hourglass networks contain shortcut connections between mirroring layers. These shortcut connections improve the performance and it is hypothesized that this is due to mitigating effects on the vanishing gradient problem and the ability of the m… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

  46. arXiv:1805.02919  [pdf, other

    cs.CV

    Learning Short-Cut Connections for Object Counting

    Authors: Daniel Oñoro-Rubio, Mathias Niepert, Roberto J. López-Sastre

    Abstract: Object counting is an important task in computer vision due to its growing demand in applications such as traffic monitoring or surveillance. In this paper, we consider object counting as a learning problem of a joint feature extraction and pixel-wise object density estimation with Convolutional-Deconvolutional networks. We introduce a novel counting model, named Gated U-Net (GU-Net). Specifically… ▽ More

    Submitted 15 November, 2018; v1 submitted 8 May, 2018; originally announced May 2018.

  47. arXiv:1805.01837  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Towards a Spectrum of Graph Convolutional Networks

    Authors: Mathias Niepert, Alberto Garcia-Duran

    Abstract: We present our ongoing work on understanding the limitations of graph convolutional networks (GCNs) as well as our work on generalizations of graph convolutions for representing more complex node attribute dependencies. Based on an analysis of GCNs with the help of the corresponding computation graphs, we propose a generalization of existing GCNs where the aggregation operations are (a) determined… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

  48. arXiv:1804.08378  [pdf, other

    cs.DC cs.AI cs.CV cs.NE cs.PF

    BrainSlug: Transparent Acceleration of Deep Learning Through Depth-First Parallelism

    Authors: Nicolas Weber, Florian Schmidt, Mathias Niepert, Felipe Huici

    Abstract: Neural network frameworks such as PyTorch and TensorFlow are the workhorses of numerous machine learning applications ranging from object recognition to machine translation. While these frameworks are versatile and straightforward to use, the training of and inference in deep neural networks is resource (energy, compute, and memory) intensive. In contrast to recent works focusing on algorithmic en… ▽ More

    Submitted 23 April, 2018; originally announced April 2018.

    Comments: Technical Report, 13 pages

  49. arXiv:1802.00673  [pdf, other

    cs.DC cs.LG cs.OS

    Representation Learning for Resource Usage Prediction

    Authors: Florian Schmidt, Mathias Niepert, Felipe Huici

    Abstract: Creating a model of a computer system that can be used for tasks such as predicting future resource usage and detecting anomalies is a challenging problem. Most current systems rely on heuristics and overly simplistic assumptions about the workloads and system statistics. These heuristics are typically a one-size-fits-all solution so as to be applicable in a wide range of applications and systems… ▽ More

    Submitted 2 February, 2018; originally announced February 2018.

    Comments: 3 pages, 2 figures, SysML 2018

  50. arXiv:1801.10095  [pdf, other

    cs.IR cs.CL

    TransRev: Modeling Reviews as Translations from Users to Items

    Authors: Alberto Garcia-Duran, Roberto Gonzalez, Daniel Onoro-Rubio, Mathias Niepert, Hui Li

    Abstract: The text of a review expresses the sentiment a customer has towards a particular product. This is exploited in sentiment analysis where machine learning models are used to predict the review score from the text of the review. Furthermore, the products costumers have purchased in the past are indicative of the products they will purchase in the future. This is what recommender systems exploit by le… ▽ More

    Submitted 18 April, 2018; v1 submitted 30 January, 2018; originally announced January 2018.