Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–5 of 5 results for author: Shvets, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.00134  [pdf, other

    cs.CV

    Joint Depth Prediction and Semantic Segmentation with Multi-View SAM

    Authors: Mykhailo Shvets, Dongxu Zhao, Marc Niethammer, Roni Sengupta, Alexander C. Berg

    Abstract: Multi-task approaches to joint depth and segmentation prediction are well-studied for monocular images. Yet, predictions from a single-view are inherently limited, while multiple views are available in many robotics applications. On the other end of the spectrum, video-based and full 3D methods require numerous frames to perform reconstruction and segmentation. With this work we propose a Multi-Vi… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: To appear in the 2024 IEEE/CVF Winter Conference on Applications of Computer Vision

  2. arXiv:1912.09390  [pdf, other

    cs.CV

    Tangent Images for Mitigating Spherical Distortion

    Authors: Marc Eder, Mykhailo Shvets, John Lim, Jan-Michael Frahm

    Abstract: In this work, we propose "tangent images," a spherical image representation that facilitates transferable and scalable $360^\circ$ computer vision. Inspired by techniques in cartography and computer graphics, we render a spherical image to a set of distortion-mitigated, locally-planar image grids tangent to a subdivided icosahedron. By varying the resolution of these grids independently of the sub… ▽ More

    Submitted 22 May, 2020; v1 submitted 19 December, 2019; originally announced December 2019.

    Comments: Updated version of CVPR 2020 publication (9 pages, 13 pages supplementary). Code: https://github.com/meder411/Tangent-Images

  3. arXiv:1905.13372  [pdf, other

    cs.LG cs.AI q-bio.MN q-bio.QM stat.ML

    MolecularRNN: Generating realistic molecular graphs with optimized properties

    Authors: Mariya Popova, Mykhailo Shvets, Junier Oliva, Olexandr Isayev

    Abstract: Designing new molecules with a set of predefined properties is a core problem in modern drug discovery and development. There is a growing need for de-novo design methods that would address this problem. We present MolecularRNN, the graph recurrent generative model for molecular structures. Our model generates diverse realistic molecular graphs after likelihood pretraining on a big database of mol… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  4. arXiv:1901.03353  [pdf, other

    cs.CV

    RetinaMask: Learning to predict masks improves state-of-the-art single-shot detection for free

    Authors: Cheng-Yang Fu, Mykhailo Shvets, Alexander C. Berg

    Abstract: Recently two-stage detectors have surged ahead of single-shot detectors in the accuracy-vs-speed trade-off. Nevertheless single-shot detectors are immensely popular in embedded vision applications. This paper brings single-shot detectors up to the same level as current two-stage techniques. We do this by improving training for the state-of-the-art single-shot detector, RetinaNet, in three ways: in… ▽ More

    Submitted 10 January, 2019; originally announced January 2019.

  5. arXiv:1803.04610  [pdf, other

    cs.CV

    Target Driven Instance Detection

    Authors: Phil Ammirato, Cheng-Yang Fu, Mykhailo Shvets, Jana Kosecka, Alexander C. Berg

    Abstract: While state-of-the-art general object detectors are getting better and better, there are not many systems specifically designed to take advantage of the instance detection problem. For many applications, such as household robotics, a system may need to recognize a few very specific instances at a time. Speed can be critical in these applications, as can the need to recognize previously unseen inst… ▽ More

    Submitted 1 October, 2019; v1 submitted 12 March, 2018; originally announced March 2018.