Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–42 of 42 results for author: Javed, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.18970  [pdf, other

    cs.CV

    Region Guided Attention Network for Retinal Vessel Segmentation

    Authors: Syed Javed, Tariq M. Khan, Abdul Qayyum, Arcot Sowmya, Imran Razzak

    Abstract: Retinal imaging has emerged as a promising method of addressing this challenge, taking advantage of the unique structure of the retina. The retina is an embryonic extension of the central nervous system, providing a direct in vivo window into neurological health. Recent studies have shown that specific structural changes in retinal vessels can not only serve as early indicators of various diseases… ▽ More

    Submitted 20 August, 2024; v1 submitted 21 July, 2024; originally announced July 2024.

  2. arXiv:2407.15707  [pdf, other

    cs.CV cs.AI eess.IV

    Predicting the Best of N Visual Trackers

    Authors: Basit Alawode, Sajid Javed, Arif Mahmood, Jiri Matas

    Abstract: We observe that the performance of SOTA visual trackers surprisingly strongly varies across different video attributes and datasets. No single tracker remains the best performer across all tracking attributes and datasets. To bridge this gap, for a given video sequence, we predict the "Best of the N Trackers", called the BofN meta-tracker. At its core, a Tracking Performance Prediction Network (TP… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  3. arXiv:2406.05205  [pdf, other

    cs.CV cs.CL cs.LG cs.MM eess.IV

    CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment

    Authors: Sajid Javed, Arif Mahmood, Iyyakutti Iyappan Ganapathi, Fayaz Ali Dharejo, Naoufel Werghi, Mohammed Bennamoun

    Abstract: This paper proposes Comprehensive Pathology Language Image Pre-training (CPLIP), a new unsupervised technique designed to enhance the alignment of images and text in histopathology for tasks such as classification and segmentation. This methodology enriches vision-language models by leveraging extensive data without needing ground truth annotations. CPLIP involves constructing a pathology-specific… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  4. arXiv:2405.19387  [pdf, other

    cs.CV

    Video Anomaly Detection in 10 Years: A Survey and Outlook

    Authors: Moshira Abdalla, Sajid Javed, Muaz Al Radi, Anwaar Ulhaq, Naoufel Werghi

    Abstract: Video anomaly detection (VAD) holds immense importance across diverse domains such as surveillance, healthcare, and environmental monitoring. While numerous surveys focus on conventional VAD methods, they often lack depth in exploring specific approaches and emerging trends. This survey explores deep learning-based VAD, expanding beyond traditional supervised training paradigms to encompass emergi… ▽ More

    Submitted 30 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2405.17520  [pdf, other

    eess.IV cs.CV

    Advancing Medical Image Segmentation with Mini-Net: A Lightweight Solution Tailored for Efficient Segmentation of Medical Images

    Authors: Syed Javed, Tariq M. Khan, Abdul Qayyum, Arcot Sowmya, Imran Razzak

    Abstract: Accurate segmentation of anatomical structures and abnormalities in medical images is crucial for computer-aided diagnosis and analysis. While deep learning techniques excel at this task, their computational demands pose challenges. Additionally, some cutting-edge segmentation methods, though effective for general object segmentation, may not be optimised for medical images. To address these issue… ▽ More

    Submitted 12 July, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2405.07905  [pdf, other

    eess.IV cs.CV

    PLUTO: Pathology-Universal Transformer

    Authors: Dinkar Juyal, Harshith Padigela, Chintan Shah, Daniel Shenker, Natalia Harguindeguy, Yi Liu, Blake Martin, Yibo Zhang, Michael Nercessian, Miles Markey, Isaac Finberg, Kelsey Luu, Daniel Borders, Syed Ashar Javed, Emma Krause, Raymond Biju, Aashish Sood, Allen Ma, Jackson Nyman, John Shamshoian, Guillaume Chhor, Darpan Sanghavi, Marc Thibault, Limin Yu, Fedaa Najdawi , et al. (8 additional authors not shown)

    Abstract: Pathology is the study of microscopic inspection of tissue, and a pathology diagnosis is often the medical gold standard to diagnose disease. Pathology images provide a unique challenge for computer-vision-based analysis: a single pathology Whole Slide Image (WSI) is gigapixel-sized and often contains hundreds of thousands to millions of objects of interest across multiple resolutions. In this wor… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  7. arXiv:2404.10940  [pdf, other

    cs.CV

    Neuromorphic Vision-based Motion Segmentation with Graph Transformer Neural Network

    Authors: Yusra Alkendi, Rana Azzam, Sajid Javed, Lakmal Seneviratne, Yahya Zweiri

    Abstract: Moving object segmentation is critical to interpret scene dynamics for robotic navigation systems in challenging environments. Neuromorphic vision sensors are tailored for motion perception due to their asynchronous nature, high temporal resolution, and reduced power consumption. However, their unconventional output requires novel perception paradigms to leverage their spatially sparse and tempora… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  8. arXiv:2401.01180  [pdf, other

    cs.CV cs.AI eess.IV

    Accurate and Efficient Urban Street Tree Inventory with Deep Learning on Mobile Phone Imagery

    Authors: Asim Khan, Umair Nawaz, Anwaar Ulhaq, Iqbal Gondal, Sajid Javed

    Abstract: Deforestation, a major contributor to climate change, poses detrimental consequences such as agricultural sector disruption, global warming, flash floods, and landslides. Conventional approaches to urban street tree inventory suffer from inaccuracies and necessitate specialised equipment. To overcome these challenges, this paper proposes an innovative method that leverages deep learning techniques… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 8 Pages, 7 figures and 5 Tables

  9. arXiv:2311.18488  [pdf, other

    cs.IT

    Low-Complexity Linear Programming Based Decoding of Quantum LDPC codes

    Authors: Sana Javed, Francisco Garcia-Herrero, Bane Vasic, Mark F. Flanagan

    Abstract: This paper proposes two approaches for reducing the impact of the error floor phenomenon when decoding quantum low-density parity-check codes with belief propagation based algorithms. First, a low-complexity syndrome-based linear programming (SB-LP) decoding algorithm is proposed, and second, the proposed SB-LP is applied as a post-processing step after syndrome-based min-sum (SB-MS) decoding. For… ▽ More

    Submitted 19 January, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted for publication at the IEEE International Conference on Communications (ICC) 2024

  10. arXiv:2311.10651  [pdf

    cs.CV

    3D-TexSeg: Unsupervised Segmentation of 3D Texture using Mutual Transformer Learning

    Authors: Iyyakutti Iyappan Ganapathi, Fayaz Ali, Sajid Javed, Syed Sadaf Ali, Naoufel Werghi

    Abstract: Analysis of the 3D Texture is indispensable for various tasks, such as retrieval, segmentation, classification, and inspection of sculptures, knitted fabrics, and biological tissues. A 3D texture is a locally repeated surface variation independent of the surface's overall shape and can be determined using the local neighborhood and its characteristics. Existing techniques typically employ computer… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: This paper is accepted in 3DV-2024

  11. arXiv:2309.15576  [pdf, other

    cs.CV

    Learning Spatial-Temporal Regularized Tensor Sparse RPCA for Background Subtraction

    Authors: Basit Alawode, Sajid Javed

    Abstract: Video background subtraction is one of the fundamental problems in computer vision that aims to segment all moving objects. Robust principal component analysis has been identified as a promising unsupervised paradigm for background subtraction tasks in the last decade thanks to its competitive performance in a number of benchmark datasets. Tensor robust principal component analysis variations have… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Under review

  12. arXiv:2308.15816  [pdf, other

    cs.CV

    Improving Underwater Visual Tracking With a Large Scale Dataset and Image Enhancement

    Authors: Basit Alawode, Fayaz Ali Dharejo, Mehnaz Ummar, Yuhang Guo, Arif Mahmood, Naoufel Werghi, Fahad Shahbaz Khan, Jiri Matas, Sajid Javed

    Abstract: This paper presents a new dataset and general tracker enhancement method for Underwater Visual Object Tracking (UVOT). Despite its significance, underwater tracking has remained unexplored due to data inaccessibility. It poses distinct challenges; the underwater environment exhibits non-uniform lighting conditions, low visibility, lack of sharpness, low contrast, camouflage, and reflections from s… ▽ More

    Submitted 31 August, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

  13. arXiv:2308.04168  [pdf, other

    cs.CV

    EFaR 2023: Efficient Face Recognition Competition

    Authors: Jan Niklas Kolf, Fadi Boutros, Jurek Elliesen, Markus Theuerkauf, Naser Damer, Mohamad Alansari, Oussama Abdul Hay, Sara Alansari, Sajid Javed, Naoufel Werghi, Klemen Grm, Vitomir Štruc, Fernando Alonso-Fernandez, Kevin Hernandez Diaz, Josef Bigun, Anjith George, Christophe Ecabert, Hatef Otroshi Shahreza, Ketan Kotwal, Sébastien Marcel, Iurii Medvedev, Bo Jin, Diogo Nunes, Ahmad Hassanpour, Pankaj Khatiwada , et al. (2 additional authors not shown)

    Abstract: This paper presents the summary of the Efficient Face Recognition Competition (EFaR) held at the 2023 International Joint Conference on Biometrics (IJCB 2023). The competition received 17 submissions from 6 different teams. To drive further development of efficient face recognition models, the submitted solutions are ranked based on a weighted score of the achieved verification accuracies on a div… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted at IJCB 2023

  14. arXiv:2305.02032  [pdf, other

    cs.CV cs.LG

    Unsupervised Mutual Transformer Learning for Multi-Gigapixel Whole Slide Image Classification

    Authors: Sajid Javed, Arif Mahmood, Talha Qaiser, Naoufel Werghi, Nasir Rajpoot

    Abstract: Classification of gigapixel Whole Slide Images (WSIs) is an important prediction task in the emerging area of computational pathology. There has been a surge of research in deep learning models for WSI classification with clinical applications such as cancer detection or prediction of molecular mutations from WSIs. Most methods require expensive and labor-intensive manual annotations by expert pat… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  15. arXiv:2303.13405  [pdf, other

    cs.CV cs.LG

    SC-MIL: Supervised Contrastive Multiple Instance Learning for Imbalanced Classification in Pathology

    Authors: Dinkar Juyal, Siddhant Shingi, Syed Ashar Javed, Harshith Padigela, Chintan Shah, Anand Sampat, Archit Khosla, John Abel, Amaro Taylor-Weiner

    Abstract: Multiple Instance learning (MIL) models have been extensively used in pathology to predict biomarkers and risk-stratify patients from gigapixel-sized images. Machine learning problems in medical imaging often deal with rare diseases, making it important for these models to work in a label-imbalanced setting. In pathology images, there is another level of imbalance, where given a positively labeled… ▽ More

    Submitted 9 September, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

  16. arXiv:2303.06753  [pdf, other

    cs.CV cs.LG cs.RO

    Modular Quantization-Aware Training: Increasing Accuracy by Decreasing Precision in 6D Object Pose Estimation

    Authors: Saqib Javed, Chengkun Li, Andrew Price, Yinlin Hu, Mathieu Salzmann

    Abstract: Edge applications, such as collaborative robotics and spacecraft rendezvous, demand efficient 6D object pose estimation on resource-constrained embedded platforms. Existing 6D pose estimation networks are often too large for such deployments, necessitating compression while maintaining reliable performance. To address this challenge, we introduce Modular Quantization-Aware Training (MQAT), an adap… ▽ More

    Submitted 28 November, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

  17. arXiv:2302.14807  [pdf, other

    cs.CV cs.RO

    DFR-FastMOT: Detection Failure Resistant Tracker for Fast Multi-Object Tracking Based on Sensor Fusion

    Authors: Mohamed Nagy, Majid Khonji, Jorge Dias, Sajid Javed

    Abstract: Persistent multi-object tracking (MOT) allows autonomous vehicles to navigate safely in highly dynamic environments. One of the well-known challenges in MOT is object occlusion when an object becomes unobservant for subsequent frames. The current MOT methods store objects information, like objects' trajectory, in internal memory to recover the objects after occlusions. However, they retain short-t… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: \c{opyright} 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  18. arXiv:2302.10505  [pdf, other

    cs.LG eess.SP

    Higher-order Sparse Convolutions in Graph Neural Networks

    Authors: Jhony H. Giraldo, Sajid Javed, Arif Mahmood, Fragkiskos D. Malliaros, Thierry Bouwmans

    Abstract: Graph Neural Networks (GNNs) have been applied to many problems in computer sciences. Capturing higher-order relationships between nodes is crucial to increase the expressive power of GNNs. However, existing methods to capture these relationships could be infeasible for large-scale graphs. In this work, we introduce a new higher-order sparse convolution based on the Sobolev norm of graph signals.… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2023

  19. arXiv:2209.01274  [pdf

    cs.CV

    Person Monitoring by Full Body Tracking in Uniform Crowd Environment

    Authors: Zhibo Zhang, Omar Alremeithi, Maryam Almheiri, Marwa Albeshr, Xiaoxiong Zhang, Sajid Javed, Naoufel Werghi

    Abstract: Full body trackers are utilized for surveillance and security purposes, such as person-tracking robots. In the Middle East, uniform crowd environments are the norm which challenges state-of-the-art trackers. Despite tremendous improvements in tracker technology documented in the past literature, these trackers have not been trained using a dataset that captures these environments. In this work, we… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: Accepted by the conference International Conference on Advances in Data-driven Computing and Intelligent Systems (ADCIS 2022), published in Scopus indexed Springer Book Series, 'Lecture Notes in Networks and Systems'

  20. arXiv:2208.10238  [pdf, other

    cs.CV

    Learning Branched Fusion and Orthogonal Projection for Face-Voice Association

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Sajid Javed, Muhammad Haroon Yousaf, Alessio Del Bue

    Abstract: Recent years have seen an increased interest in establishing association between faces and voices of celebrities leveraging audio-visual information from YouTube. Prior works adopt metric learning methods to learn an embedding space that is amenable for associated matching and verification tasks. Albeit showing some progress, such formulations are, however, restrictive due to dependency on distanc… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: Submitted: IEEE Transactions on Multimedia. arXiv admin note: substantial text overlap with arXiv:2112.10483

  21. Graph CNN for Moving Object Detection in Complex Environments from Unseen Videos

    Authors: Jhony H. Giraldo, Sajid Javed, Naoufel Werghi, Thierry Bouwmans

    Abstract: Moving Object Detection (MOD) is a fundamental step for many computer vision applications. MOD becomes very challenging when a video sequence captured from a static or moving camera suffers from the challenges: camouflage, shadow, dynamic backgrounds, and lighting variations, to name a few. Deep learning methods have been successfully applied to address MOD with competitive performance. However, i… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2021, pp. 225-233

  22. arXiv:2206.01794  [pdf, other

    cs.CV cs.LG

    Additive MIL: Intrinsically Interpretable Multiple Instance Learning for Pathology

    Authors: Syed Ashar Javed, Dinkar Juyal, Harshith Padigela, Amaro Taylor-Weiner, Limin Yu, Aaditya Prakash

    Abstract: Multiple Instance Learning (MIL) has been widely applied in pathology towards solving critical problems such as automating cancer diagnosis and grading, predicting patient prognosis, and therapy response. Deploying these models in a clinical setting requires careful inspection of these black boxes during development and deployment to identify failures and maintain physician trust. In this work, we… ▽ More

    Submitted 16 October, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

  23. arXiv:2205.10553  [pdf, other

    cs.CV cs.RO

    Robot Person Following in Uniform Crowd Environment

    Authors: Adarsh Ghimire, Xiaoxiong Zhang, Sajid Javed, Jorge Dias, Naoufel Werghi

    Abstract: Person-tracking robots have many applications, such as in security, elderly care, and socializing robots. Such a task is particularly challenging when the person is moving in a Uniform crowd. Also, despite significant progress of trackers reported in the literature, state-of-the-art trackers have hardly addressed person following in such scenarios. In this work, we focus on improving the perceptiv… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

    Journal ref: ICRA Workshop 2022: ROBOTIC PERCEPTION AND MAPPING: EMERGING TECHNIQUES

  24. arXiv:2205.04213  [pdf, other

    cs.RO

    Deep learning framework for robot for person detection and tracking

    Authors: Adarsh Ghimire, Xiaoxiong Zhang, Naoufel Werghi, Sajid Javed, Jorge Dias

    Abstract: Robustly tracking a person of interest in the crowd with a robotic platform is one of the cornerstones of human-robot interaction. The robot platform which is limited by the computational power, rapid movements, and occlusions of the target requires an efficient and robust framework to perform tracking. This paper proposes a deep learning framework for tracking a person using a mobile robot with a… ▽ More

    Submitted 19 April, 2022; originally announced May 2022.

    Comments: Presented Conference Paper

    Journal ref: Graduate Students Research Conference 2021

  25. arXiv:2204.08978  [pdf, other

    cs.CV

    Real-Time Face Recognition System

    Authors: Adarsh Ghimire, Naoufel Werghi, Sajid Javed, Jorge Dias

    Abstract: Over the past few decades, interest in algorithms for face recognition has been growing rapidly and has even surpassed human-level performance. Despite their accomplishments, their practical integration with a real-time performance-hungry system is not feasible due to high computational costs. So in this paper, we explore the recent, fast, and accurate face recognition system that can be easily in… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: Poster

    Journal ref: Graduate Students Research Conference 2022

  26. arXiv:2204.05205  [pdf, other

    eess.IV cs.CV cs.LG

    Rethinking Machine Learning Model Evaluation in Pathology

    Authors: Syed Ashar Javed, Dinkar Juyal, Zahil Shanis, Shreya Chakraborty, Harsha Pokkalla, Aaditya Prakash

    Abstract: Machine Learning has been applied to pathology images in research and clinical practice with promising outcomes. However, standard ML models often lack the rigorous evaluation required for clinical decisions. Machine learning techniques for natural images are ill-equipped to deal with pathology images that are significantly large and noisy, require expensive labeling, are hard to interpret, and ar… ▽ More

    Submitted 18 April, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: ICLR 2022 ML Evaluation Workshop

  27. arXiv:2204.04199  [pdf, other

    eess.IV cs.CV

    Underwater Image Enhancement Using Pre-trained Transformer

    Authors: Abderrahmene Boudiaf, Yuhang Guo, Adarsh Ghimire, Naoufel Werghi, Giulia De Masi, Sajid Javed, Jorge Dias

    Abstract: The goal of this work is to apply a denoising image transformer to remove the distortion from underwater images and compare it with other similar approaches. Automatic restoration of underwater images plays an important role since it allows to increase the quality of the images, without the need for more expensive equipment. This is a critical example of the important role of the machine learning… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

  28. Neuromorphic Camera Denoising using Graph Neural Network-driven Transformers

    Authors: Yusra Alkendi, Rana Azzam, Abdulla Ayyad, Sajid Javed, Lakmal Seneviratne, Yahya Zweiri

    Abstract: Neuromorphic vision is a bio-inspired technology that has triggered a paradigm shift in the computer-vision community and is serving as a key-enabler for a multitude of applications. This technology has offered significant advantages including reduced power consumption, reduced processing needs, and communication speed-ups. However, neuromorphic cameras suffer from significant amounts of measureme… ▽ More

    Submitted 4 July, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  29. arXiv:2112.02838  [pdf, other

    cs.CV

    Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey and Outlook

    Authors: Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, Jiri Matas

    Abstract: Accurate and robust visual object tracking is one of the most challenging and fundamental computer vision problems. It entails estimating the trajectory of the target in an image sequence, given only its initial location, and segmentation, or its rough approximation in the form of a bounding box. Discriminative Correlation Filters (DCFs) and deep Siamese Networks (SNs) have emerged as dominating t… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Tracking Survey

  30. arXiv:2111.13656  [pdf, other

    cs.CV

    Towards Low-Cost and Efficient Malaria Detection

    Authors: Waqas Sultani, Wajahat Nawaz, Syed Javed, Muhammad Sohail Danish, Asma Saadia, Mohsen Ali

    Abstract: Malaria, a fatal but curable disease claims hundreds of thousands of lives every year. Early and correct diagnosis is vital to avoid health complexities, however, it depends upon the availability of costly microscopes and trained experts to analyze blood-smear slides. Deep learning-based methods have the potential to not only decrease the burden of experts but also improve diagnostic accuracy on l… ▽ More

    Submitted 16 April, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

  31. arXiv:1912.05636  [pdf, ps, other

    cs.CV cs.LG cs.MM

    CineFilter: Unsupervised Filtering for Real Time Autonomous Camera Systems

    Authors: Sudheer Achary, K L Bhanu Moorthy, Syed Ashar Javed, Nikita Shravan, Vineet Gandhi, Anoop Namboodiri

    Abstract: Autonomous camera systems are often subjected to an optimization/filtering operation to smoothen and stabilize the rough trajectory estimates. Most common filtering techniques do reduce the irregularities in data; however, they fail to mimic the behavior of a human cameraman. Global filtering methods modeling human camera operators have been successful; however, they are limited to offline setting… ▽ More

    Submitted 27 May, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

  32. arXiv:1910.01210  [pdf, other

    cs.CV cs.LG cs.RO

    Embodied Language Grounding with 3D Visual Feature Representations

    Authors: Mihir Prabhudesai, Hsiao-Yu Fish Tung, Syed Ashar Javed, Maximilian Sieb, Adam W. Harley, Katerina Fragkiadaki

    Abstract: We propose associating language utterances to 3D visual abstractions of the scene they describe. The 3D visual abstractions are encoded as 3-dimensional visual feature maps. We infer these 3D visual scene feature maps from RGB images of the scene via view prediction: when the generated 3D scene feature map is neurally projected from a camera viewpoint, it should match the corresponding RGB image.… ▽ More

    Submitted 17 June, 2021; v1 submitted 2 October, 2019; originally announced October 2019.

    Journal ref: Conference on Computer Vision and Pattern Recognition. 2020, pp. 2220-2229

  33. arXiv:1812.07368  [pdf, other

    cs.CV

    Handcrafted and Deep Trackers: Recent Visual Object Tracking Approaches and Trends

    Authors: Mustansar Fiaz, Arif Mahmood, Sajid Javed, Soon Ki Jung

    Abstract: In recent years visual object tracking has become a very active research area. An increasing number of tracking algorithms are being proposed each year. It is because tracking has wide applications in various real world problems such as human-computer interaction, autonomous vehicles, robotics, surveillance and security just to name a few. In the current study, we review latest trends and advances… ▽ More

    Submitted 11 February, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: 27pages, 26 figures. arXiv admin note: substantial text overlap with arXiv:1802.03098

  34. arXiv:1811.05255  [pdf, ps, other

    cs.CV

    Deep Neural Network Concepts for Background Subtraction: A Systematic Review and Comparative Evaluation

    Authors: Thierry Bouwmans, Sajid Javed, Maryam Sultana, Soon Ki Jung

    Abstract: Conventional neural networks show a powerful framework for background subtraction in video acquired by static cameras. Indeed, the well-known SOBS method and its variants based on neural networks were the leader methods on the largescale CDnet 2012 dataset during a long time. Recently, convolutional neural networks which belong to deep learning methods were employed with success for background ini… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

    Comments: 46 pages, 4 figures, submitted to neural networks

  35. arXiv:1811.01526  [pdf, other

    cs.CV

    Unsupervised RGBD Video Object Segmentation Using GANs

    Authors: Maryam Sultana, Arif Mahmood, Sajid Javed, Soon Ki Jung

    Abstract: Video object segmentation is a fundamental step in many advanced vision applications. Most existing algorithms are based on handcrafted features such as HOG, super-pixel segmentation or texture-based techniques, while recently deep features have been found to be more efficient. Existing algorithms observe performance degradation in the presence of challenges such as illumination variations, shadow… ▽ More

    Submitted 5 November, 2018; originally announced November 2018.

    Comments: 15 pages, 3 figures, ACCV workshop on RGB-D-sensing and understanding via combined colour and depth

  36. arXiv:1805.07903  [pdf, other

    cs.CV

    Unsupervised Deep Context Prediction for Background Foreground Separation

    Authors: Maryam Sultana, Arif Mahmood, Sajid Javed, Soon Ki Jung

    Abstract: In many advanced video based applications background modeling is a pre-processing step to eliminate redundant data, for instance in tracking or video surveillance applications. Over the past years background subtraction is usually based on low level or hand-crafted features such as raw color components, gradients, or local binary patterns. The background subtraction algorithms performance suffer i… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: 17 pages

    Journal ref: Machine Vision and Applications 2018

  37. arXiv:1803.06508  [pdf, other

    cs.CV cs.RO

    MergeNet: A Deep Net Architecture for Small Obstacle Discovery

    Authors: Krishnam Gupta, Syed Ashar Javed, Vineet Gandhi, K. Madhava Krishna

    Abstract: We present here, a novel network architecture called MergeNet for discovering small obstacles for on-road scenes in the context of autonomous driving. The basis of the architecture rests on the central consideration of training with less amount of data since the physical setup and the annotation process for small obstacles is hard to scale. For making effective use of the limited data, we propose… ▽ More

    Submitted 17 March, 2018; originally announced March 2018.

  38. arXiv:1803.06506  [pdf, other

    cs.CV

    Learning Unsupervised Visual Grounding Through Semantic Self-Supervision

    Authors: Syed Ashar Javed, Shreyas Saxena, Vineet Gandhi

    Abstract: Localizing natural language phrases in images is a challenging problem that requires joint understanding of both the textual and visual modalities. In the unsupervised setting, lack of supervisory signals exacerbate this difficulty. In this paper, we propose a novel framework for unsupervised visual grounding which uses concept learning as a proxy task to obtain self-supervision. The simple intuit… ▽ More

    Submitted 16 November, 2018; v1 submitted 17 March, 2018; originally announced March 2018.

    Comments: NIPS Workshop 2018

  39. arXiv:1801.09360  [pdf

    cs.CV

    Comparative Study of ECO and CFNet Trackers in Noisy Environment

    Authors: Mustansar Fiaz, Sajid Javed, Arif Mahmood, Soon Ki Jung

    Abstract: Object tracking is one of the most challenging task and has secured significant attention of computer vision researchers in the past two decades. Recent deep learning based trackers have shown good performance on various tracking challenges. A tracking method should track objects in sequential frames accurately in challenges such as deformation, low resolution, occlusion, scale and light variation… ▽ More

    Submitted 28 January, 2018; originally announced January 2018.

    Comments: 4 pages, 5 figures

  40. arXiv:1711.09492  [pdf, other

    cs.IT cs.CV stat.ME stat.ML

    Robust Subspace Learning: Robust PCA, Robust Subspace Tracking, and Robust Subspace Recovery

    Authors: Namrata Vaswani, Thierry Bouwmans, Sajid Javed, Praneeth Narayanamurthy

    Abstract: PCA is one of the most widely used dimension reduction techniques. A related easier problem is "subspace learning" or "subspace estimation". Given relatively clean data, both are easily solved via singular value decomposition (SVD). The problem of subspace learning or PCA in the presence of outliers is called robust subspace learning or robust PCA (RPCA). For long data sequences, if one tries to u… ▽ More

    Submitted 5 July, 2018; v1 submitted 26 November, 2017; originally announced November 2017.

    Comments: To appear, IEEE Signal Processing Magazine, July 2018

    Journal ref: IEEE Signal Processing Magazine (Volume: 35, Issue: 4, July 2018)

  41. arXiv:1705.04358  [pdf, other

    cs.CV

    Object-Level Context Modeling For Scene Classification with Context-CNN

    Authors: Syed Ashar Javed, Anil Kumar Nelakanti

    Abstract: Convolutional Neural Networks (CNNs) have been used extensively for computer vision tasks and produce rich feature representation for objects or parts of an image. But reasoning about scenes requires integration between the low-level feature representations and the high-level semantic information. We propose a deep network architecture which models the semantic context of scenes by capturing objec… ▽ More

    Submitted 2 June, 2017; v1 submitted 11 May, 2017; originally announced May 2017.

    Comments: Scene Understanding workshop (SUNw), CVPR 2017

  42. Decomposition into Low-rank plus Additive Matrices for Background/Foreground Separation: A Review for a Comparative Evaluation with a Large-Scale Dataset

    Authors: Thierry Bouwmans, Andrews Sobral, Sajid Javed, Soon Ki Jung, El-Hadi Zahzah

    Abstract: Recent research on problem formulations based on decomposition into low-rank plus sparse matrices shows a suitable framework to separate moving objects from the background. The most representative problem formulation is the Robust Principal Component Analysis (RPCA) solved via Principal Component Pursuit (PCP) which decomposes a data matrix in a low-rank matrix and a sparse matrix. However, simila… ▽ More

    Submitted 28 November, 2016; v1 submitted 4 November, 2015; originally announced November 2015.

    Comments: 121 pages, 5 figures, submitted to Computer Science Review. arXiv admin note: text overlap with arXiv:1312.7167, arXiv:1109.6297, arXiv:1207.3438, arXiv:1105.2126, arXiv:1404.7592, arXiv:1210.0805, arXiv:1403.8067 by other authors, Computer Science Review, November 2016