Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–7 of 7 results for author: Damrich, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.03087  [pdf, other

    cs.LG math.AT

    Persistent Homology for High-dimensional Data Based on Spectral Methods

    Authors: Sebastian Damrich, Philipp Berens, Dmitry Kobak

    Abstract: Persistent homology is a popular computational tool for analyzing the topology of point clouds, such as the presence of loops or voids. However, many real-world datasets with low intrinsic dimensionality reside in an ambient space of much higher dimensionality. We show that in this case traditional persistent homology becomes very sensitive to noise and fails to detect the correct topology. The sa… ▽ More

    Submitted 8 May, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: 48 pages, 39 figures

  2. arXiv:2306.17638  [pdf, other

    cs.LG

    Geometric Autoencoders -- What You See is What You Decode

    Authors: Philipp Nazari, Sebastian Damrich, Fred A. Hamprecht

    Abstract: Visualization is a crucial step in exploratory data analysis. One possible approach is to train an autoencoder with low-dimensional latent space. Large network depth and width can help unfolding the data. However, such expressive networks can achieve low reconstruction error even when the latent representation is distorted. To avoid such misleading visualizations, we propose first a differential g… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: 24 pages, ICML 2023

  3. arXiv:2206.01816  [pdf, other

    cs.LG cs.HC

    From $t$-SNE to UMAP with contrastive learning

    Authors: Sebastian Damrich, Jan Niklas Böhm, Fred A. Hamprecht, Dmitry Kobak

    Abstract: Neighbor embedding methods $t$-SNE and UMAP are the de facto standard for visualizing high-dimensional datasets. Motivated from entirely different viewpoints, their loss functions appear to be unrelated. In practice, they yield strongly differing embeddings and can suggest conflicting interpretations of the same data. The fundamental reasons for this and, more generally, the exact relationship bet… ▽ More

    Submitted 28 February, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: ICLR 2023. 44 pages, 19 figures. Code at https://github.com/hci-unihd/cl-tsne-umap and https://github.com/berenslab/contrastive-ne

    Journal ref: ICLR 2023

  4. arXiv:2103.14608  [pdf, other

    cs.LG stat.ML

    On UMAP's true loss function

    Authors: Sebastian Damrich, Fred A. Hamprecht

    Abstract: UMAP has supplanted t-SNE as state-of-the-art for visualizing high-dimensional datasets in many disciplines, but the reason for its success is not well understood. In this work, we investigate UMAP's sampling based optimization scheme in detail. We derive UMAP's effective loss function in closed form and find that it differs from the published one. As a consequence, we show that UMAP does not aim… ▽ More

    Submitted 22 April, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: 20 pages, 15 figures; minor changes, added run-times and error bars

  5. arXiv:2102.05892  [pdf, other

    q-bio.QM cs.NE

    Visualizing hierarchies in scRNA-seq data using a density tree-biased autoencoder

    Authors: Quentin Garrido, Sebastian Damrich, Alexander Jäger, Dario Cerletti, Manfred Claassen, Laurent Najman, Fred Hamprecht

    Abstract: Motivation: Single cell RNA sequencing (scRNA-seq) data makes studying the development of cells possible at unparalleled resolution. Given that many cellular differentiation processes are hierarchical, their scRNA-seq data is expected to be approximately tree-shaped in gene expression space. Inference and representation of this tree-structure in two dimensions is highly desirable for biological in… ▽ More

    Submitted 22 April, 2022; v1 submitted 11 February, 2021; originally announced February 2021.

    Journal ref: Bioinformatics, Oxford University Press (OUP), In press

  6. arXiv:2011.13228  [pdf, other

    cs.CV eess.IV

    MultiStar: Instance Segmentation of Overlapping Objects with Star-Convex Polygons

    Authors: Florin C. Walter, Sebastian Damrich, Fred A. Hamprecht

    Abstract: Instance segmentation of overlapping objects in biomedical images remains a largely unsolved problem. We take up this challenge and present MultiStar, an extension to the popular instance segmentation method StarDist. The key novelty of our method is that we identify pixels at which objects overlap and use this information to improve proposal sampling and to avoid suppressing proposals of truly ov… ▽ More

    Submitted 14 January, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: Accepted for ISBI 2021

  7. arXiv:1911.02921  [pdf, other

    cs.DS cs.CV

    Probabilistic Watershed: Sampling all spanning forests for seeded segmentation and semi-supervised learning

    Authors: Enrique Fita Sanmartin, Sebastian Damrich, Fred A. Hamprecht

    Abstract: The seeded Watershed algorithm / minimax semi-supervised learning on a graph computes a minimum spanning forest which connects every pixel / unlabeled node to a seed / labeled node. We propose instead to consider all possible spanning forests and calculate, for every node, the probability of sampling a forest connecting a certain seed with that node. We dub this approach "Probabilistic Watershed".… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: To be published in NeurIPS2019