Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–4 of 4 results for author: Hrúz, M

.
  1. arXiv:2301.03769  [pdf, other

    cs.CV

    Learning from What is Already Out There: Few-shot Sign Language Recognition with Online Dictionaries

    Authors: Matyáš Boháček, Marek Hrúz

    Abstract: Today's sign language recognition models require large training corpora of laboratory-like videos, whose collection involves an extensive workforce and financial resources. As a result, only a handful of such systems are publicly available, not to mention their limited localization capabilities for less-populated sign languages. Utilizing online text-to-video dictionaries, which inherently hold an… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: 6 pages, 2 figures, IEEE Face & Gestures 2023

    ACM Class: I.2.10; J.5

  2. arXiv:2210.00893  [pdf, other

    cs.CV

    Combining Efficient and Precise Sign Language Recognition: Good pose estimation library is all you need

    Authors: Matyáš Boháček, Zhuo Cao, Marek Hrúz

    Abstract: Sign language recognition could significantly improve the user experience for d/Deaf people with the general consumer technology, such as IoT devices or videoconferencing. However, current sign language recognition architectures are usually computationally heavy and require robust GPU-equipped hardware to run in real-time. Some models aim for lower-end devices (such as smartphones) by minimizing t… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

    Comments: 5 pages, 2 figures, CVPR 2022 AVA workshop extended abstract

    ACM Class: I.4.8; I.4.9

  3. arXiv:2003.13764  [pdf, other

    cs.CV

    Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction

    Authors: Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, MingXiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang, Haifeng Sun, Marek Hrúz, Jakub Kanis, Zdeněk Krňoul, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou , et al. (10 additional authors not shown)

    Abstract: We study how well different types of approaches generalise in the task of 3D hand pose estimation under single hand scenarios and hand-object interaction. We show that the accuracy of state-of-the-art methods can drop, and that they fail mostly on poses absent from the training set. Unfortunately, since the space of hand poses is highly dimensional, it is inherently not feasible to cover the whole… ▽ More

    Submitted 10 September, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: European Conference on Computer Vision (ECCV), 2020

  4. UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge

    Authors: Zbyněk Zajíc, Marie Kunešová, Marek Hrúz, Jan Vaněk

    Abstract: In this paper, we present our system developed by the team from the New Technologies for the Information Society (NTIS) research center of the University of West Bohemia in Pilsen, for the Second DIHARD Speech Diarization Challenge. The base of our system follows the currently-standard approach of segmentation, i/x-vector extraction, clustering, and resegmentation. The hyperparameters for each of… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    Comments: Submitted to Interspeech 2019

    Journal ref: INTERSPEECH 2019