Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–27 of 27 results for author: Sarfraz, M

.
  1. arXiv:2407.01872  [pdf, other

    cs.CV cs.RO eess.IV

    Referring Atomic Video Action Recognition

    Authors: Kunyu Peng, Jia Fu, Kailun Yang, Di Wen, Yufan Chen, Ruiping Liu, Junwei Zheng, Jiaming Zhang, M. Saquib Sarfraz, Rainer Stiefelhagen, Alina Roitberg

    Abstract: We introduce a new task called Referring Atomic Video Action Recognition (RAVAR), aimed at identifying atomic actions of a particular person based on a textual description and the video data of this person. This task differs from traditional action recognition and localization, where predictions are delivered for all present individuals. In contrast, we focus on recognizing the correct atomic acti… ▽ More

    Submitted 10 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024. The dataset and code will be made publicly available at https://github.com/KPeng9510/RAVAR

  2. arXiv:2406.11438  [pdf, other

    hep-ex

    Search for Majorana Neutrinos with the Complete KamLAND-Zen Dataset

    Authors: S. Abe, T. Araki, K. Chiba, T. Eda, M. Eizuka, Y. Funahashi, A. Furuto, A. Gando, Y. Gando, S. Goto, T. Hachiya, K. Hata, K. Ichimura, S. Ieki, H. Ikeda, K. Inoue, K. Ishidoshiro, Y. Kamei, N. Kawada, Y. Kishimoto, M. Koga, A. Marthe, Y. Matsumoto, T. Mitsui, H. Miyake , et al. (48 additional authors not shown)

    Abstract: We present a search for neutrinoless double-beta ($0νββ$) decay of $^{136}$Xe using the full KamLAND-Zen 800 dataset with 745 kg of enriched xenon, corresponding to an exposure of $2.097$ ton yr of $^{136}$Xe. This updated search benefits from a more than twofold increase in exposure, recovery of photo-sensor gain, and reduced background from muon-induced spallation of xenon. Combining with the se… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2203.02139

  3. arXiv:2405.14497  [pdf, other

    cs.CV

    Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment

    Authors: Muhammad Sohail Danish, Muhammad Haris Khan, Muhammad Akhtar Munir, M. Saquib Sarfraz, Mohsen Ali

    Abstract: In this work, we tackle the problem of domain generalization for object detection, specifically focusing on the scenario where only a single source domain is available. We propose an effective approach that involves two key steps: diversifying the source domain and aligning detections based on class prediction confidence and localization. Firstly, we demonstrate that by carefully selecting a set o… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  4. arXiv:2405.13580  [pdf, other

    cs.CV cs.HC

    AltChart: Enhancing VLM-based Chart Summarization Through Multi-Pretext Tasks

    Authors: Omar Moured, Jiaming Zhang, M. Saquib Sarfraz, Rainer Stiefelhagen

    Abstract: Chart summarization is a crucial task for blind and visually impaired individuals as it is their primary means of accessing and interpreting graphical data. Crafting high-quality descriptions is challenging because it requires precise communication of essential details within the chart without vision perception. Many chart analysis methods, however, produce brief, unstructured responses that may c… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted in ICDAR 2024. Project page is at: https://github.com/moured/AltChart

  5. arXiv:2405.02678  [pdf, other

    cs.LG cs.AI cs.CV

    Position: Quo Vadis, Unsupervised Time Series Anomaly Detection?

    Authors: M. Saquib Sarfraz, Mei-Yen Chen, Lukas Layer, Kunyu Peng, Marios Koulakis

    Abstract: The current state of machine learning scholarship in Timeseries Anomaly Detection (TAD) is plagued by the persistent use of flawed evaluation metrics, inconsistent benchmarking practices, and a lack of proper justification for the choices made in novel deep learning-based model designs. Our paper presents a critical analysis of the status quo in TAD, revealing the misleading track of current resea… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  6. arXiv:2401.16923  [pdf, other

    cs.CV cs.RO eess.IV

    Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation

    Authors: Ruiping Liu, Jiaming Zhang, Kunyu Peng, Yufan Chen, Ke Cao, Junwei Zheng, M. Saquib Sarfraz, Kailun Yang, Rainer Stiefelhagen

    Abstract: Integrating information from multiple modalities enhances the robustness of scene perception systems in autonomous vehicles, providing a more comprehensive and reliable sensory framework. However, the modality incompleteness in multi-modal segmentation remains under-explored. In this work, we establish a task called Modality-Incomplete Scene Segmentation (MISS), which encompasses both system-level… ▽ More

    Submitted 10 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted to IEEE IV 2024. The source code is publicly available at https://github.com/RuipingL/MISS

  7. arXiv:2312.06330  [pdf, other

    cs.CV cs.AI cs.RO eess.IV

    Navigating Open Set Scenarios for Skeleton-based Action Recognition

    Authors: Kunyu Peng, Cheng Yin, Junwei Zheng, Ruiping Liu, David Schneider, Jiaming Zhang, Kailun Yang, M. Saquib Sarfraz, Rainer Stiefelhagen, Alina Roitberg

    Abstract: In real-world scenarios, human actions often fall outside the distribution of training data, making it crucial for models to recognize known actions and reject unknown ones. However, using pure skeleton data in such open-set conditions poses challenges due to the lack of visual background cues and the distinct sparse structure of body pose sequences. In this paper, we tackle the unexplored Open-Se… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024. The benchmark, code, and models will be released at https://github.com/KPeng9510/OS-SAR

  8. arXiv:2311.04815  [pdf, other

    cs.CV

    Domain Adaptive Object Detection via Balancing Between Self-Training and Adversarial Learning

    Authors: Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, Mohsen Ali

    Abstract: Deep learning based object detectors struggle generalizing to a new target domain bearing significant variations in object and background. Most current methods align domains by using image or instance-level adversarial feature alignment. This often suffers due to unwanted background and lacks class-specific alignment. A straightforward approach to promote class-level alignment is to use high confi… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: Accepted for publication in IEEE Transactions on Pattern Analysis and Machine Intelligence (Volume: 45, Issue: 12, December 2023); Extended version of our conference paper, arXiv link: arXiv:2110.00249

  9. arXiv:2305.08420  [pdf, other

    cs.CV cs.AI cs.RO eess.IV

    Exploring Few-Shot Adaptation for Activity Recognition on Diverse Domains

    Authors: Kunyu Peng, Di Wen, David Schneider, Jiaming Zhang, Kailun Yang, M. Saquib Sarfraz, Rainer Stiefelhagen, Alina Roitberg

    Abstract: Domain adaptation is essential for activity recognition to ensure accurate and robust performance across diverse environments, sensor types, and data sources. Unsupervised domain adaptation methods have been extensively studied, yet, they require large-scale unlabeled data from the target domain. In this work, we focus on Few-Shot Domain Adaptation for Activity Recognition (FSDA-AR), which leverag… ▽ More

    Submitted 27 April, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: The benchmark and source code will be publicly available at https://github.com/KPeng9510/RelaMiX

  10. arXiv:2303.00952  [pdf, other

    cs.CV cs.RO eess.IV

    Towards Activated Muscle Group Estimation in the Wild

    Authors: Kunyu Peng, David Schneider, Alina Roitberg, Kailun Yang, Jiaming Zhang, Chen Deng, Kaiyu Zhang, M. Saquib Sarfraz, Rainer Stiefelhagen

    Abstract: In this paper, we tackle the new task of video-based Activated Muscle Group Estimation (AMGE) aiming at identifying active muscle regions during physical activity in the wild. To this intent, we provide the MuscleMap dataset featuring >15K video clips with 135 different activities and 20 labeled muscle groups. This dataset opens the vistas to multiple video-based applications in sports and rehabil… ▽ More

    Submitted 5 August, 2024; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted to ACM MM 2024. The database and code can be found at https://github.com/KPeng9510/MuscleMap

  11. arXiv:2209.07601  [pdf, other

    cs.CV

    Towards Improving Calibration in Object Detection Under Domain Shift

    Authors: Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, Mohsen Ali

    Abstract: With deep neural network based solution more readily being incorporated in real-world applications, it has been pressing requirement that predictions by such models, especially in safety-critical environments, be highly accurate and well-calibrated. Although some techniques addressing DNN calibration have been proposed, they are only limited to visual classification applications and in-domain pred… ▽ More

    Submitted 29 October, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: To appear in NeurIPS 2022

  12. Breaking with Fixed Set Pathology Recognition through Report-Guided Contrastive Training

    Authors: Constantin Seibold, Simon Reiß, M. Saquib Sarfraz, Rainer Stiefelhagen, Jens Kleesiek

    Abstract: When reading images, radiologists generate text reports describing the findings therein. Current state-of-the-art computer-aided diagnosis tools utilize a fixed set of predefined categories automatically extracted from these medical reports for training. This form of supervision limits the potential usage of models as they are unable to pick up on anomalies outside of their predefined set, thus, m… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

    Comments: Provisionally Accepted at MICCAI2022

  13. arXiv:2203.12997  [pdf, other

    cs.CV cs.AI cs.DS cs.GR

    Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction

    Authors: M. Saquib Sarfraz, Marios Koulakis, Constantin Seibold, Rainer Stiefelhagen

    Abstract: Dimensionality reduction is crucial both for visualization and preprocessing high dimensional data for machine learning. We introduce a novel method based on a hierarchy built on 1-nearest neighbor graphs in the original space which is used to preserve the grouping properties of the data distribution on multiple levels. The core of the proposal is an optimization-free projection that is competitiv… ▽ More

    Submitted 29 May, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  14. arXiv:2110.15741  [pdf, ps, other

    math.FA

    On new parameters concerning a generalization of the parallelogram law in Banach spaces

    Authors: Qi Liu, Zhijian Yang, Muhammad Sarfraz, Yongjin Li

    Abstract: We shall introduce a new geometric constant $L^{\prime}_{\mathrm{Y}}(λ,X)$ based on a generalization of the parallelogram law. We first investigate some basic properties of this new coefficient. Next, it is shown that, for a Banach space, $L^{\prime}_{\mathrm{Y}}(λ,X)$ becomes $1$ for some $λ_0\in (0,1)$ if and only if the norm is induced by an inner product. Moreover, some relations between other… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    MSC Class: 46B20

  15. arXiv:2110.00249  [pdf, other

    cs.CV

    Synergizing between Self-Training and Adversarial Learning for Domain Adaptive Object Detection

    Authors: Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, Mohsen Ali

    Abstract: We study adapting trained object detectors to unseen domains manifesting significant variations of object appearance, viewpoints and backgrounds. Most current methods align domains by either using image or instance-level feature alignment in an adversarial fashion. This often suffers due to the presence of unwanted background and as such lacks class-specific alignment. A common remedy to promote c… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: To appear in NeurIPS2021

  16. arXiv:2103.11264  [pdf, other

    cs.CV cs.AI cs.LG

    Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation

    Authors: M. Saquib Sarfraz, Naila Murray, Vivek Sharma, Ali Diba, Luc Van Gool, Rainer Stiefelhagen

    Abstract: Action segmentation refers to inferring boundaries of semantically consistent visual concepts in videos and is an important requirement for many video understanding tasks. For this and other video understanding tasks, supervised approaches have achieved encouraging performance but require a high volume of detailed frame-level annotations. We present a fully automatic and unsupervised approach for… ▽ More

    Submitted 27 March, 2021; v1 submitted 20 March, 2021; originally announced March 2021.

    Comments: CVPR 2021

  17. arXiv:2101.03141  [pdf, other

    cs.LG cs.CR

    An Isolation Forest Learning Based Outlier Detection Approach for Effectively Classifying Cyber Anomalies

    Authors: Rony Chowdhury Ripan, Iqbal H. Sarker, Md Musfique Anwar, Md. Hasan Furhad, Fazle Rahat, Mohammed Moshiul Hoque, Muhammad Sarfraz

    Abstract: Cybersecurity has recently gained considerable interest in today's security issues because of the popularity of the Internet-of-Things (IoT), the considerable growth of mobile networks, and many related apps. Therefore, detecting numerous cyber-attacks in a network and creating an effective intrusion detection system plays a vital role in today's security. In this paper, we present an Isolation Fo… ▽ More

    Submitted 9 December, 2020; originally announced January 2021.

    Comments: 10 pages

  18. arXiv:2008.08418  [pdf

    cs.CV

    Anchor-free Small-scale Multispectral Pedestrian Detection

    Authors: Alexander Wolpert, Michael Teutsch, M. Saquib Sarfraz, Rainer Stiefelhagen

    Abstract: Multispectral images consisting of aligned visual-optical (VIS) and thermal infrared (IR) image pairs are well-suited for practical applications like autonomous driving or visual surveillance. Such data can be used to increase the performance of pedestrian detection especially for weakly illuminated, small-scaled, or partially occluded instances. The current state-of-the-art is based on variants o… ▽ More

    Submitted 20 August, 2020; v1 submitted 19 August, 2020; originally announced August 2020.

    Comments: BMVC2020

  19. arXiv:2004.02195  [pdf, other

    cs.CV cs.LG

    Clustering based Contrastive Learning for Improving Face Representations

    Authors: Vivek Sharma, Makarand Tapaswi, M. Saquib Sarfraz, Rainer Stiefelhagen

    Abstract: A good clustering algorithm can discover natural groupings in data. These groupings, if used wisely, provide a form of weak supervision for learning representations. In this work, we present Clustering-based Contrastive Learning (CCL), a new clustering-based representation learning approach that uses labels obtained from clustering along with video constraints to learn discriminative face features… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

    Comments: To appear at IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2020

  20. arXiv:1908.00274  [pdf, other

    cs.CV cs.LG eess.IV

    Content and Colour Distillation for Learning Image Translations with the Spatial Profile Loss

    Authors: M. Saquib Sarfraz, Constantin Seibold, Haroon Khalid, Rainer Stiefelhagen

    Abstract: Generative adversarial networks has emerged as a defacto standard for image translation problems. To successfully drive such models, one has to rely on additional networks e.g., discriminators and/or perceptual networks. Training these networks with pixel based losses alone are generally not sufficient to learn the target distribution. In this paper, we propose a novel method of computing the loss… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

    Comments: BMVC 2019

  21. arXiv:1903.01000  [pdf, other

    cs.CV cs.LG

    Self-Supervised Learning of Face Representations for Video Face Clustering

    Authors: Vivek Sharma, Makarand Tapaswi, M. Saquib Sarfraz, Rainer Stiefelhagen

    Abstract: Analyzing the story behind TV series and movies often requires understanding who the characters are and what they are doing. With improving deep face models, this may seem like a solved problem. However, as face detectors get better, clustering/identification needs to be revisited to address increasing diversity in facial appearance. In this paper, we address video face clustering using unsupervis… ▽ More

    Submitted 3 March, 2019; originally announced March 2019.

    Comments: To appear at International Conference on Automatic Face and Gesture Recognition (2019) as an Oral. The datasets and code are available at https://github.com/vivoutlaw/SSIAM

  22. arXiv:1902.11266  [pdf, other

    cs.CV

    Efficient Parameter-free Clustering Using First Neighbor Relations

    Authors: M. Saquib Sarfraz, Vivek Sharma, Rainer Stiefelhagen

    Abstract: We present a new clustering method in the form of a single clustering equation that is able to directly discover groupings in the data. The main proposition is that the first neighbor of each sample is all one needs to discover large chains and finding the groups in the data. In contrast to most existing clustering algorithms our method does not require any hyper-parameters, distance thresholds an… ▽ More

    Submitted 28 February, 2019; originally announced February 2019.

    Comments: CVPR 2019

  23. A Multimodal Assistive System for Helping Visually Impaired in Social Interactions

    Authors: M. Saquib Sarfraz, Angela Constantinescu, Melanie Zuzej, Rainer Stiefelhagen

    Abstract: Access to non-verbal cues in social interactions is vital for people with visual impairment. It has been shown that non-verbal cues such as eye contact, number of people, their names and positions are helpful for individuals who are blind. While there is an increasing interest in developing systems to provide these cues less emphasis has been put in evaluating its impact on the visually impaired u… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

    Journal ref: Informatik Spectrum, Springer volume 40,No. 6. 2017

  24. arXiv:1711.10378  [pdf, other

    cs.CV

    A Pose-Sensitive Embedding for Person Re-Identification with Expanded Cross Neighborhood Re-Ranking

    Authors: M. Saquib Sarfraz, Arne Schumann, Andreas Eberle, Rainer Stiefelhagen

    Abstract: Person re identification is a challenging retrieval task that requires matching a person's acquired image across non overlapping camera views. In this paper we propose an effective approach that incorporates both the fine and coarse pose information of the person to learn a discriminative embedding. In contrast to the recent direction of explicitly modeling body parts or correcting for misalignmen… ▽ More

    Submitted 2 April, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

    Comments: CVPR 2018: v2 (fixes, added new results on PRW dataset)

  25. arXiv:1707.06089  [pdf, other

    cs.CV

    Deep View-Sensitive Pedestrian Attribute Inference in an end-to-end Model

    Authors: M. Saquib Sarfraz, Arne Schumann, Yan Wang, Rainer Stiefelhagen

    Abstract: Pedestrian attribute inference is a demanding problem in visual surveillance that can facilitate person retrieval, search and indexing. To exploit semantic relations between attributes, recent research treats it as a multi-label image classification task. The visual cues hinting at attributes can be strongly localized and inference of person attributes such as hair, backpack, shorts, etc., are hig… ▽ More

    Submitted 19 July, 2017; originally announced July 2017.

    Comments: accepted BMVC 2017

  26. Deep Perceptual Mapping for Cross-Modal Face Recognition

    Authors: M. Saquib Sarfraz, Rainer Stiefelhagen

    Abstract: Cross modal face matching between the thermal and visible spectrum is a much desired capability for night-time surveillance and security applications. Due to a very large modality gap, thermal-to-visible face recognition is one of the most challenging face matching problem. In this paper, we present an approach to bridge this modality gap by a significant margin. Our approach captures the highly n… ▽ More

    Submitted 7 July, 2016; v1 submitted 20 January, 2016; originally announced January 2016.

    Comments: This is the extended version (invited IJCV submission) with new results of our previous submission (arXiv:1507.02879)

  27. arXiv:1507.02879  [pdf, other

    cs.CV

    Deep Perceptual Mapping for Thermal to Visible Face Recognition

    Authors: M. Saquib Sarfraz, Rainer Stiefelhagen

    Abstract: Cross modal face matching between the thermal and visible spectrum is a much de- sired capability for night-time surveillance and security applications. Due to a very large modality gap, thermal-to-visible face recognition is one of the most challenging face matching problem. In this paper, we present an approach to bridge this modality gap by a significant margin. Our approach captures the highly… ▽ More

    Submitted 10 July, 2015; originally announced July 2015.

    Comments: BMVC 2015 (oral)