Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 374 results for author: Sridharan, S

.
  1. arXiv:2405.13264  [pdf, other

    cs.LG cs.AI cs.CV

    Part-based Quantitative Analysis for Heatmaps

    Authors: Osman Tursun, Sinan Kalkan, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Heatmaps have been instrumental in helping understand deep network decisions, and are a common approach for Explainable AI (XAI). While significant progress has been made in enhancing the informativeness and accessibility of heatmaps, heatmap analysis is typically very subjective and limited to domain experts. As such, developing automatic, scalable, and numerical analysis methods to make heatmap-… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  2. arXiv:2403.15717  [pdf, other

    cs.LG cs.CV cs.DC cs.RO

    Ev-Edge: Efficient Execution of Event-based Vision Algorithms on Commodity Edge Platforms

    Authors: Shrihari Sridharan, Surya Selvam, Kaushik Roy, Anand Raghunathan

    Abstract: Event cameras have emerged as a promising sensing modality for autonomous navigation systems, owing to their high temporal resolution, high dynamic range and negligible motion blur. To process the asynchronous temporal event streams from such sensors, recent research has shown that a mix of Artificial Neural Networks (ANNs), Spiking Neural Networks (SNNs) as well as hybrid SNN-ANN algorithms are n… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  3. arXiv:2401.02634  [pdf, other

    cs.CV

    AG-ReID.v2: Bridging Aerial and Ground Views for Person Re-identification

    Authors: Huy Nguyen, Kien Nguyen, Sridha Sridharan, Clinton Fookes

    Abstract: Aerial-ground person re-identification (Re-ID) presents unique challenges in computer vision, stemming from the distinct differences in viewpoints, poses, and resolutions between high-altitude aerial and ground-based cameras. Existing research predominantly focuses on ground-to-ground matching, with aerial matching less explored due to a dearth of comprehensive datasets. To address this, we introd… ▽ More

    Submitted 7 April, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: 13 pages, Accepted by TIFS 2023

  4. arXiv:2312.15364  [pdf, other

    cs.RO cs.CV

    WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments

    Authors: Kavisha Vidanapathirana, Joshua Knights, Stephen Hausler, Mark Cox, Milad Ramezani, Jason Jooste, Ethan Griffiths, Shaheer Mohamed, Sridha Sridharan, Clinton Fookes, Peyman Moghadam

    Abstract: Recent progress in semantic scene understanding has primarily been enabled by the availability of semantically annotated bi-modal (camera and lidar) datasets in urban environments. However, such annotated datasets are also needed for natural, unstructured environments to enable semantic perception for applications, including conservation, search and rescue, environment monitoring, and agricultural… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: Under review. The first 3 authors contributed equally

  5. arXiv:2309.12631  [pdf, other

    quant-ph

    Learning the eigenstructure of quantum dynamics using classical shadows

    Authors: Atithi Acharya, Siddhartha Saha, Shagesh Sridharan, Yanis Bahroun, Anirvan M. Sengupta

    Abstract: Learning dynamics from repeated observation of the time evolution of an open quantum system, namely, the problem of quantum process tomography is an important task. This task is difficult in general, but, with some additional constraints could be tractable. This motivates us to look at the problem of Lindblad operator discovery from observations. We point out that for moderate size Hilbert spaces,… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  6. arXiv:2309.09431  [pdf, other

    cs.CV cs.AI

    FactoFormer: Factorized Hyperspectral Transformers with Self-Supervised Pretraining

    Authors: Shaheer Mohamed, Maryam Haghighat, Tharindu Fernando, Sridha Sridharan, Clinton Fookes, Peyman Moghadam

    Abstract: Hyperspectral images (HSIs) contain rich spectral and spatial information. Motivated by the success of transformers in the field of natural language processing and computer vision where they have shown the ability to learn long range dependencies within input data, recent research has focused on using transformers for HSIs. However, current state-of-the-art hyperspectral transformers only tokenize… ▽ More

    Submitted 3 January, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE Transactions on Geoscience and Remote Sensing in December 2023

  7. arXiv:2308.08731  [pdf, other

    cs.CV

    Learning Through Guidance: Knowledge Distillation for Endoscopic Image Classification

    Authors: Harshala Gammulle, Yubo Chen, Sridha Sridharan, Travis Klein, Clinton Fookes

    Abstract: Endoscopy plays a major role in identifying any underlying abnormalities within the gastrointestinal (GI) tract. There are multiple GI tract diseases that are life-threatening, such as precancerous lesions and other intestinal cancers. In the usual process, a diagnosis is made by a medical expert which can be prone to human errors and the accuracy of the test is also entirely dependent on the expe… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  8. arXiv:2308.04638  [pdf, other

    cs.CV

    GeoAdapt: Self-Supervised Test-Time Adaptation in LiDAR Place Recognition Using Geometric Priors

    Authors: Joshua Knights, Stephen Hausler, Sridha Sridharan, Clinton Fookes, Peyman Moghadam

    Abstract: LiDAR place recognition approaches based on deep learning suffer from significant performance degradation when there is a shift between the distribution of training and test datasets, often requiring re-training the networks to achieve peak performance. However, obtaining accurate ground truth data for new training data can be prohibitively expensive, especially in complex or GPS-deprived environm… ▽ More

    Submitted 28 November, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted to IEEE Robotics and Automation Letters (RA-L) November 2023

  9. arXiv:2308.02427  [pdf, other

    cs.NE cs.AI cs.LG q-bio.NC

    Unlocking the Potential of Similarity Matching: Scalability, Supervision and Pre-training

    Authors: Yanis Bahroun, Shagesh Sridharan, Atithi Acharya, Dmitri B. Chklovskii, Anirvan M. Sengupta

    Abstract: While effective, the backpropagation (BP) algorithm exhibits limitations in terms of biological plausibility, computational cost, and suitability for online learning. As a result, there has been a growing interest in developing alternative biologically plausible learning approaches that rely on local learning rules. This study focuses on the primarily unsupervised similarity matching (SM) framewor… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  10. arXiv:2307.03388  [pdf, other

    cs.CV

    General-Purpose Multimodal Transformer meets Remote Sensing Semantic Segmentation

    Authors: Nhi Kieu, Kien Nguyen, Sridha Sridharan, Clinton Fookes

    Abstract: The advent of high-resolution multispectral/hyperspectral sensors, LiDAR DSM (Digital Surface Model) information and many others has provided us with an unprecedented wealth of data for Earth Observation. Multimodal AI seeks to exploit those complementary data sources, particularly for complex tasks like semantic segmentation. While specialized architectures have been developed, they are highly co… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: Accepted to CVPR Workshop on Multimodal Learning for Earth and Environment 2023

  11. arXiv:2305.14516  [pdf, other

    cs.LG cs.DC

    Chakra: Advancing Performance Benchmarking and Co-design using Standardized Execution Traces

    Authors: Srinivas Sridharan, Taekyung Heo, Louis Feng, Zhaodong Wang, Matt Bergeron, Wenyin Fu, Shengbao Zheng, Brian Coutinho, Saeed Rashidi, Changhai Man, Tushar Krishna

    Abstract: Benchmarking and co-design are essential for driving optimizations and innovation around ML models, ML software, and next-generation hardware. Full workload benchmarks, e.g. MLPerf, play an essential role in enabling fair comparison across different software and hardware stacks especially once systems are fully designed and deployed. However, the pace of AI innovation demands a more agile methodol… ▽ More

    Submitted 26 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  12. arXiv:2305.11394  [pdf, other

    cs.CV

    Remembering What Is Important: A Factorised Multi-Head Retrieval and Auxiliary Memory Stabilisation Scheme for Human Motion Prediction

    Authors: Tharindu Fernando, Harshala Gammulle, Sridha Sridharan, Simon Denman, Clinton Fookes

    Abstract: Humans exhibit complex motions that vary depending on the task that they are performing, the interactions they engage in, as well as subject-specific preferences. Therefore, forecasting future poses based on the history of the previous motions is a challenging task. This paper presents an innovative auxiliary-memory-powered deep neural network framework for the improved modelling of historical kno… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  13. Physical Adversarial Attacks for Surveillance: A Survey

    Authors: Kien Nguyen, Tharindu Fernando, Clinton Fookes, Sridha Sridharan

    Abstract: Modern automated surveillance techniques are heavily reliant on deep learning methods. Despite the superior performance, these learning systems are inherently vulnerable to adversarial attacks - maliciously crafted inputs that are designed to mislead, or trick, models into making incorrect predictions. An adversary can physically change their appearance by wearing adversarial t-shirts, glasses, or… ▽ More

    Submitted 14 October, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: This paper has been accepted for publication in T-NNLS

  14. arXiv:2304.02202  [pdf, other

    cs.CV cs.HC cs.LG

    Towards Self-Explainability of Deep Neural Networks with Heatmap Captioning and Large-Language Models

    Authors: Osman Tursun, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Heatmaps are widely used to interpret deep neural networks, particularly for computer vision tasks, and the heatmap-based explainable AI (XAI) techniques are a well-researched topic. However, most studies concentrate on enhancing the quality of the generated heatmap or discovering alternate heatmap generation techniques, and little effort has been devoted to making heatmap-based XAI automatic, int… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  15. arXiv:2303.14006  [pdf, other

    cs.DC cs.LG

    ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale

    Authors: William Won, Taekyung Heo, Saeed Rashidi, Srinivas Sridharan, Sudarshan Srinivasan, Tushar Krishna

    Abstract: As deep learning models and input data are scaling at an unprecedented rate, it is inevitable to move towards distributed training platforms to fit the model and increase training throughput. State-of-the-art approaches and techniques, such as wafer-scale nodes, multi-dimensional network topologies, disaggregated memory systems, and parallelization strategies, have been actively adopted by emergin… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  16. arXiv:2303.13894  [pdf, ps, other

    math.DS math.CV

    Polynomial correspondences expressible as maps of $d$-tuples

    Authors: Shrihari Sridharan, Subith G., Atma Ram Tiwari

    Abstract: In this paper, we consider polynomial correspondences $f (x, y)$ in $\mathbb{C}[x, y]$ of degree $d \ge 2$ in both the variables and obtain necessary and sufficient conditions in order that the equation $f (x, y) = 0$ can be expressed as $φ(x) = ψ(y)$, where $φ$ and $ψ$ are fractional degree $d$ rational maps in the Riemann sphere. In the absence of involutions that played a vital role towards cha… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  17. arXiv:2303.08597  [pdf, other

    cs.CV

    Aerial-Ground Person Re-ID

    Authors: Huy Nguyen, Kien Nguyen, Sridha Sridharan, Clinton Fookes

    Abstract: Person re-ID matches persons across multiple non-overlapping cameras. Despite the increasing deployment of airborne platforms in surveillance, current existing person re-ID benchmarks' focus is on ground-ground matching and very limited efforts on aerial-aerial matching. We propose a new benchmark dataset - AG-ReID, which performs person re-ID matching in a new setting: across aerial and ground ca… ▽ More

    Submitted 14 August, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: Published on IEEE International Conference on Multimedia and Expo 2023 (ICME2023)

  18. arXiv:2303.07470  [pdf, other

    cs.LG cs.AR

    X-Former: In-Memory Acceleration of Transformers

    Authors: Shrihari Sridharan, Jacob R. Stevens, Kaushik Roy, Anand Raghunathan

    Abstract: Transformers have achieved great success in a wide variety of natural language processing (NLP) tasks due to the attention mechanism, which assigns an importance score for every word relative to other words in a sequence. However, these models are very large, often reaching hundreds of billions of parameters, and therefore require a large number of DRAM accesses. Hence, traditional deep neural net… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  19. arXiv:2301.04122  [pdf, other

    cs.DC cs.AI

    Mystique: Enabling Accurate and Scalable Generation of Production AI Benchmarks

    Authors: Mingyu Liang, Wenyin Fu, Louis Feng, Zhongyi Lin, Pavani Panakanti, Shengbao Zheng, Srinivas Sridharan, Christina Delimitrou

    Abstract: Building large AI fleets to support the rapidly growing DL workloads is an active research topic for modern cloud providers. Generating accurate benchmarks plays an essential role in designing the fast-paced software and hardware solutions in this space. Two fundamental challenges to make this scalable are (i) workload representativeness and (ii) the ability to quickly incorporate changes to the f… ▽ More

    Submitted 11 April, 2023; v1 submitted 16 December, 2022; originally announced January 2023.

    Comments: Accepted to ISCA 2023

  20. arXiv:2211.12732  [pdf, other

    cs.RO

    Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments

    Authors: Joshua Knights, Kavisha Vidanapathirana, Milad Ramezani, Sridha Sridharan, Clinton Fookes, Peyman Moghadam

    Abstract: Many existing datasets for lidar place recognition are solely representative of structured urban environments, and have recently been saturated in performance by deep learning based approaches. Natural and unstructured environments present many additional challenges for the tasks of long-term localisation but these environments are not represented in currently available datasets. To address this w… ▽ More

    Submitted 2 March, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: Equal Contribution from first two authors Accepted to ICRA2023 Website link: https://csiro-robotics.github.io/Wild-Places/

  21. arXiv:2211.08565  [pdf, other

    cs.CV

    Using Auxiliary Information for Person Re-Identification -- A Tutorial Overview

    Authors: Tharindu Fernando, Clinton Fookes, Sridha Sridharan, Dana Michalski

    Abstract: Person re-identification (re-id) is a pivotal task within an intelligent surveillance pipeline and there exist numerous re-id frameworks that achieve satisfactory performance in challenging benchmarks. However, these systems struggle to generate acceptable results when there are significant differences between the camera views, illumination conditions, or occlusions. This result can be attributed… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Preprint Submitted to Pattern Recognition

  22. Spectral Geometric Verification: Re-Ranking Point Cloud Retrieval for Metric Localization

    Authors: Kavisha Vidanapathirana, Peyman Moghadam, Sridha Sridharan, Clinton Fookes

    Abstract: In large-scale metric localization, an incorrect result during retrieval will lead to an incorrect pose estimate or loop closure. Re-ranking methods propose to take into account all the top retrieval candidates and re-order them to increase the likelihood of the top candidate being correct. However, state-of-the-art re-ranking methods are inefficient when re-ranking many potential candidates due t… ▽ More

    Submitted 6 March, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted for publication in IEEE RA-L (2023)

  23. arXiv:2207.10898  [pdf, other

    cs.NI cs.AI

    Impact of RoCE Congestion Control Policies on Distributed Training of DNNs

    Authors: Tarannum Khan, Saeed Rashidi, Srinivas Sridharan, Pallavi Shurpali, Aditya Akella, Tushar Krishna

    Abstract: RDMA over Converged Ethernet (RoCE) has gained significant attraction for datacenter networks due to its compatibility with conventional Ethernet-based fabric. However, the RDMA protocol is efficient only on (nearly) lossless networks, emphasizing the vital role of congestion control on RoCE networks. Unfortunately, the native RoCE congestion control scheme, based on Priority Flow Control (PFC), s… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  24. arXiv:2207.01769  [pdf, other

    cs.CV

    SESS: Saliency Enhancing with Scaling and Sliding

    Authors: Osman Tursun, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: High-quality saliency maps are essential in several machine learning application areas including explainable AI and weakly supervised object detection and segmentation. Many techniques have been developed to generate better saliency using neural networks. However, they are often limited to specific saliency visualisation methods or saliency issues. We propose a novel saliency enhancing approach ca… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: This paper will be presented at ECCV2022

  25. Towards On-Board Panoptic Segmentation of Multispectral Satellite Images

    Authors: Tharindu Fernando, Clinton Fookes, Harshala Gammulle, Simon Denman, Sridha Sridharan

    Abstract: With tremendous advancements in low-power embedded computing devices and remote sensing instruments, the traditional satellite image processing pipeline which includes an expensive data transfer step prior to processing data on the ground is being replaced by on-board processing of captured data. This paradigm shift enables critical and time-sensitive analytic intelligence to be acquired in a time… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

  26. arXiv:2203.00807  [pdf, other

    cs.CV cs.AI

    InCloud: Incremental Learning for Point Cloud Place Recognition

    Authors: Joshua Knights, Peyman Moghadam, Milad Ramezani, Sridha Sridharan, Clinton Fookes

    Abstract: Place recognition is a fundamental component of robotics, and has seen tremendous improvements through the use of deep learning models in recent years. Networks can experience significant drops in performance when deployed in unseen or highly dynamic environments, and require additional training on the collected data. However naively fine-tuning on new training distributions can cause severe degra… ▽ More

    Submitted 29 November, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

  27. arXiv:2201.03080  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    The State of Aerial Surveillance: A Survey

    Authors: Kien Nguyen, Clinton Fookes, Sridha Sridharan, Yingli Tian, Feng Liu, Xiaoming Liu, Arun Ross

    Abstract: The rapid emergence of airborne platforms and imaging sensors are enabling new forms of aerial surveillance due to their unprecedented advantages in scale, mobility, deployment and covert observation capabilities. This paper provides a comprehensive overview of human-centric aerial surveillance tasks from a computer vision and pattern recognition perspective. It aims to provide readers with an in-… ▽ More

    Submitted 12 January, 2022; v1 submitted 9 January, 2022; originally announced January 2022.

  28. arXiv:2112.00289  [pdf, other

    cs.CV cs.AI

    Point Cloud Segmentation Using Sparse Temporal Local Attention

    Authors: Joshua Knights, Peyman Moghadam, Clinton Fookes, Sridha Sridharan

    Abstract: Point clouds are a key modality used for perception in autonomous vehicles, providing the means for a robust geometric understanding of the surrounding environment. However despite the sensor outputs from autonomous vehicles being naturally temporal in nature, there is still limited exploration of exploiting point cloud sequences for 3D seman-tic segmentation. In this paper we propose a novel Spar… ▽ More

    Submitted 2 December, 2021; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: 8 pages, 3 figures Published at the Australasian Conference on Robotics and Automation (ACRA) 2021

  29. arXiv:2110.04478  [pdf, other

    cs.DC cs.AR cs.LG cs.NI

    Themis: A Network Bandwidth-Aware Collective Scheduling Policy for Distributed Training of DL Models

    Authors: Saeed Rashidi, William Won, Sudarshan Srinivasan, Srinivas Sridharan, Tushar Krishna

    Abstract: Distributed training is a solution to reduce DNN training time by splitting the task across multiple NPUs (e.g., GPU/TPU). However, distributed training adds communication overhead between the NPUs in order to synchronize the gradients and/or activation, depending on the parallelization strategy. In next-generation platforms for training at scale, NPUs will be connected through multi-dimensional n… ▽ More

    Submitted 7 July, 2022; v1 submitted 9 October, 2021; originally announced October 2021.

  30. arXiv:2109.13495  [pdf, ps, other

    math.DS

    Dynamics of Products of Matrices in Max Algebra

    Authors: S. Jayaraman, Y. K. Prajapaty, S. Sridharan

    Abstract: The aim of this manuscript is to understand the dynamics of matrix products in a max algebra. A consequence of the Perron-Fröbenius theorem on periodic points of a nonnegative matrix is generalized to a max algebra setting. The same is then studied for a finite product associated to a $p$-lettered word on $N$ letters arising from a finite collection of nonnegative matrices, with each member having… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    MSC Class: 15A80; 15B34; 37H12

  31. LoGG3D-Net: Locally Guided Global Descriptor Learning for 3D Place Recognition

    Authors: Kavisha Vidanapathirana, Milad Ramezani, Peyman Moghadam, Sridha Sridharan, Clinton Fookes

    Abstract: Retrieval-based place recognition is an efficient and effective solution for re-localization within a pre-built map, or global data association for Simultaneous Localization and Mapping (SLAM). The accuracy of such an approach is heavily dependent on the quality of the extracted scene-level representation. While end-to-end solutions - which learn a global descriptor from input point clouds - have… ▽ More

    Submitted 16 February, 2022; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: Accepted - ICRA 2022

  32. arXiv:2108.08995  [pdf, other

    cs.CV cs.AI cs.LG

    Discriminative Domain-Invariant Adversarial Network for Deep Domain Generalization

    Authors: Mohammad Mahfujur Rahman, Clinton Fookes, Sridha Sridharan

    Abstract: Domain generalization approaches aim to learn a domain invariant prediction model for unknown target domains from multiple training source domains with different distributions. Significant efforts have recently been committed to broad domain generalization, which is a challenging and topical problem in machine learning and computer vision communities. Most previous domain generalization approaches… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: This manuscript is submitted to Computer Vision and Image Understanding (CVIU)

  33. arXiv:2108.03786  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-Slice Net: A novel light weight framework for COVID-19 Diagnosis

    Authors: Harshala Gammulle, Tharindu Fernando, Sridha Sridharan, Simon Denman, Clinton Fookes

    Abstract: This paper presents a novel lightweight COVID-19 diagnosis framework using CT scans. Our system utilises a novel two-stage approach to generate robust and efficient diagnoses across heterogeneous patient level inputs. We use a powerful backbone network as a feature extractor to capture discriminative slice-level features. These features are aggregated by a lightweight network to obtain a patient l… ▽ More

    Submitted 8 August, 2021; originally announced August 2021.

    Comments: IEEE International Conference on Autonomous Systems 2021

  34. arXiv:2106.15835  [pdf, other

    cs.SD cs.LG eess.AS

    Robust and Interpretable Temporal Convolution Network for Event Detection in Lung Sound Recordings

    Authors: Tharindu Fernando, Sridha Sridharan, Simon Denman, Houman Ghaemmaghami, Clinton Fookes

    Abstract: This paper proposes a novel framework for lung sound event detection, segmenting continuous lung sound recordings into discrete events and performing recognition on each event. Exploiting the lightweight nature of Temporal Convolution Networks (TCNs) and their superior results compared to their recurrent counterparts, we propose a lightweight, yet robust, and completely interpretable framework for… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: preprint submitted to JBHI

  35. arXiv:2106.05599  [pdf, other

    quant-ph physics.ins-det

    Gated InGaAs Detector Characterization with Sub-Picosecond Weak Coherent Pulses

    Authors: Gautam Kumar Shaw, Shyam Sridharan, Anil Prabhakar

    Abstract: We propose and demonstrate a method to characterize a gated InGaAs single-photon detector (SPD). Ultrashort weak coherent pulses, from a mode-locked sub-picosecond pulsed laser, were used to measure photon counts, at varying arrival times relative to the start of the SPD gate voltage. The uneven detection probabilities within the gate window were used to estimate the afterpulse probability with re… ▽ More

    Submitted 11 July, 2022; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: 15 pages, 10 figures

  36. arXiv:2104.13780  [pdf, other

    cs.CV cs.AI cs.LG

    Semantic Consistency and Identity Mapping Multi-Component Generative Adversarial Network for Person Re-Identification

    Authors: Amena Khatun, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: In a real world environment, person re-identification (Re-ID) is a challenging task due to variations in lighting conditions, viewing angles, pose and occlusions. Despite recent performance gains, current person Re-ID algorithms still suffer heavily when encountering these variations. To address this problem, we propose a semantic consistency and identity mapping multi-component generative adversa… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: Accepted in WACV 2020

    Journal ref: WACV, 2020

  37. arXiv:2104.13773  [pdf, other

    cs.CV cs.AI cs.LG

    Pose-driven Attention-guided Image Generation for Person Re-Identification

    Authors: Amena Khatun, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Person re-identification (re-ID) concerns the matching of subject images across different camera views in a multi camera surveillance system. One of the major challenges in person re-ID is pose variations across the camera network, which significantly affects the appearance of a person. Existing development data lack adequate pose variations to carry out effective training of person re-ID systems.… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: Submitted to Pattern Recognition

  38. arXiv:2104.13725  [pdf, other

    cs.CV cs.AI cs.LG

    Preserving Semantic Consistency in Unsupervised Domain Adaptation Using Generative Adversarial Networks

    Authors: Mohammad Mahfujur Rahman, Clinton Fookes, Sridha Sridharan

    Abstract: Unsupervised domain adaptation seeks to mitigate the distribution discrepancy between source and target domains, given labeled samples of the source domain and unlabeled samples of the target domain. Generative adversarial networks (GANs) have demonstrated significant improvement in domain adaptation by producing images which are domain specific for training. However, most of the existing GAN base… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: Submitted to Pattern Recognition Letters

  39. arXiv:2104.13581  [pdf, other

    cs.CV cs.AI cs.LG

    Deep Domain Generalization with Feature-norm Network

    Authors: Mohammad Mahfujur Rahman, Clinton Fookes, Sridha Sridharan

    Abstract: In this paper, we tackle the problem of training with multiple source domains with the aim to generalize to new domains at test time without an adaptation step. This is known as domain generalization (DG). Previous works on DG assume identical categories or label space across the source domains. In the case of category shift among the source domains, previous methods on DG are vulnerable to negati… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: Submitted to Pattern Recognition

  40. Learning Regional Attention over Multi-resolution Deep Convolutional Features for Trademark Retrieval

    Authors: Osman Tursun, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Large-scale trademark retrieval is an important content-based image retrieval task. A recent study shows that off-the-shelf deep features aggregated with Regional-Maximum Activation of Convolutions (R-MAC) achieve state-of-the-art results. However, R-MAC suffers in the presence of background clutter/trivial regions and scale variance, and discards important spatial information. We introduce three… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  41. arXiv:2104.05158  [pdf, other

    cs.DC cs.AI cs.LG cs.PF

    Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation Models

    Authors: Dheevatsa Mudigere, Yuchen Hao, Jianyu Huang, Zhihao Jia, Andrew Tulloch, Srinivas Sridharan, Xing Liu, Mustafa Ozdal, Jade Nie, Jongsoo Park, Liang Luo, Jie Amy Yang, Leon Gao, Dmytro Ivchenko, Aarti Basant, Yuxi Hu, Jiyan Yang, Ehsan K. Ardestani, Xiaodong Wang, Rakesh Komuravelli, Ching-Hsiang Chu, Serhat Yilmaz, Huayu Li, Jiyuan Qian, Zhuobo Feng , et al. (28 additional authors not shown)

    Abstract: Deep learning recommendation models (DLRMs) are used across many business-critical services at Facebook and are the single largest AI application in terms of infrastructure demand in its data-centers. In this paper we discuss the SW/HW co-designed solution for high-performance distributed training of large-scale DLRMs. We introduce a high-performance scalable software stack based on PyTorch and pa… ▽ More

    Submitted 26 February, 2023; v1 submitted 11 April, 2021; originally announced April 2021.

  42. arXiv:2102.04016  [pdf, other

    cs.CV

    An Efficient Framework for Zero-Shot Sketch-Based Image Retrieval

    Authors: Osman Tursun, Simon Denman, Sridha Sridharan, Ethan Goan, Clinton Fookes

    Abstract: Recently, Zero-shot Sketch-based Image Retrieval (ZS-SBIR) has attracted the attention of the computer vision community due to it's real-world applications, and the more realistic and challenging setting than found in SBIR. ZS-SBIR inherits the main challenges of multiple computer vision problems including content-based Image Retrieval (CBIR), zero-shot learning and domain adaptation. The majority… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  43. arXiv:2101.11239  [pdf, other

    cs.CV

    Im2Mesh GAN: Accurate 3D Hand Mesh Recovery from a Single RGB Image

    Authors: Akila Pemasiri, Kien Nguyen Thanh, Sridha Sridharan, Clinton Fookes

    Abstract: This work addresses hand mesh recovery from a single RGB image. In contrast to most of the existing approaches where the parametric hand models are employed as the prior, we show that the hand mesh can be learned directly from the input image. We propose a new type of GAN called Im2Mesh GAN to learn the mesh through end-to-end adversarial training. By interpreting the mesh as a graph, our model is… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

  44. arXiv:2012.02364  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Deep Learning for Medical Anomaly Detection -- A Survey

    Authors: Tharindu Fernando, Harshala Gammulle, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Machine learning-based medical anomaly detection is an important problem that has been extensively studied. Numerous approaches have been proposed across various medical application domains and we observe several similarities across these distinct applications. Despite this comparability, we observe a lack of structured organisation of these diverse research applications such that their advantages… ▽ More

    Submitted 13 April, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Preprint submitted to ACM Computing Surveys

  45. Locus: LiDAR-based Place Recognition using Spatiotemporal Higher-Order Pooling

    Authors: Kavisha Vidanapathirana, Peyman Moghadam, Ben Harwood, Muming Zhao, Sridha Sridharan, Clinton Fookes

    Abstract: Place Recognition enables the estimation of a globally consistent map and trajectory by providing non-local constraints in Simultaneous Localisation and Mapping (SLAM). This paper presents Locus, a novel place recognition method using 3D LiDAR point clouds in large-scale environments. We propose a method for extracting and encoding topological and temporal information related to components in a sc… ▽ More

    Submitted 7 April, 2021; v1 submitted 29 November, 2020; originally announced November 2020.

    Comments: ICRA 2021. Implementation available at: https://github.com/csiro-robotics/locus

  46. Machine Learning (ML) In a 5G Standalone (SA) Self Organizing Network (SON)

    Authors: Srinivasan Sridharan

    Abstract: Machine learning (ML) is included in Self-organizing Networks (SONs) that are key drivers for enhancing the Operations, Administration, and Maintenance (OAM) activities. It is included in the 5G Standalone (SA) system is one of the 5G communication tracks that transforms 4G networking to next-generation technology that is based on mobile applications. The research's main aim is to an overview of m… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Comments: 5G, Machine learning (ML), Self-organizing Networks (SONs), 5G Standalone, Artificial Intelligence (AI)

  47. arXiv:2011.11198  [pdf, other

    cs.CV

    Complex-valued Iris Recognition Network

    Authors: Kien Nguyen, Clinton Fookes, Sridha Sridharan, Arun Ross

    Abstract: In this work, we design a fully complex-valued neural network for the task of iris recognition. Unlike the problem of general object recognition, where real-valued neural networks can be used to extract pertinent features, iris recognition depends on the extraction of both phase and magnitude information from the input iris texture in order to better represent its biometric content. This necessita… ▽ More

    Submitted 15 February, 2022; v1 submitted 22 November, 2020; originally announced November 2020.

    Comments: This paper has been accepted for publication in T-PAMI

  48. arXiv:2011.09581  [pdf, other

    cs.CV

    Patient-independent Epileptic Seizure Prediction using Deep Learning Models

    Authors: Theekshana Dissanayake, Tharindu Fernando, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Objective: Epilepsy is one of the most prevalent neurological diseases among humans and can lead to severe brain injuries, strokes, and brain tumors. Early detection of seizures can help to mitigate injuries, and can be used to aid the treatment of patients with epilepsy. The purpose of a seizure prediction system is to successfully identify the pre-ictal brain stage, which occurs before a seizure… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

  49. arXiv:2011.06207  [pdf, other

    cs.CV

    Domain Generalization in Biosignal Classification

    Authors: Theekshana Dissanayake, Tharindu Fernando, Simon Denman, Houman Ghaemmaghami, Sridha Sridharan, Clinton Fookes

    Abstract: Objective: When training machine learning models, we often assume that the training data and evaluation data are sampled from the same distribution. However, this assumption is violated when the model is evaluated on another unseen but similar database, even if that database contains the same classes. This problem is caused by domain-shift and can be solved using two approaches: domain adaptation… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  50. arXiv:2011.05438  [pdf, other

    cs.CV

    Fast & Slow Learning: Incorporating Synthetic Gradients in Neural Memory Controllers

    Authors: Tharindu Fernando, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Neural Memory Networks (NMNs) have received increased attention in recent years compared to deep architectures that use a constrained memory. Despite their new appeal, the success of NMNs hinges on the ability of the gradient-based optimiser to perform incremental training of the NMN controllers, determining how to leverage their high capacity for knowledge retrieval. This means that while excelle… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.