Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–30 of 30 results for author: Fieguth, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12723  [pdf, other

    cs.LG

    BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity

    Authors: Zahra Gharaee, Scott C. Lowe, ZeMing Gong, Pablo Millan Arias, Nicholas Pellegrino, Austin T. Wang, Joakim Bruslund Haurum, Iuliia Zarubiieva, Lila Kari, Dirk Steinke, Graham W. Taylor, Paul Fieguth, Angel X. Chang

    Abstract: As part of an ongoing worldwide effort to comprehend and monitor insect biodiversity, this paper presents the BIOSCAN-5M Insect dataset to the machine learning community and establish several benchmark tasks. BIOSCAN-5M is a comprehensive dataset containing multi-modal information for over 5 million insect specimens, and it significantly expands existing image-based biological datasets by includin… ▽ More

    Submitted 24 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  2. Video Relationship Detection Using Mixture of Experts

    Authors: Ala Shaabana, Zahra Gharaee, Paul Fieguth

    Abstract: Machine comprehension of visual information from images and videos by neural networks faces two primary challenges. Firstly, there exists a computational and inference gap in connecting vision and language, making it difficult to accurately determine which object a given agent acts on and represent it through language. Secondly, classifiers trained by a single, monolithic neural network often lack… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  3. arXiv:2312.13396  [pdf, other

    cs.CV

    EPNet: An Efficient Pyramid Network for Enhanced Single-Image Super-Resolution with Reduced Computational Requirements

    Authors: Xin Xu, Jinman Park, Paul Fieguth

    Abstract: Single-image super-resolution (SISR) has seen significant advancements through the integration of deep learning. However, the substantial computational and memory requirements of existing methods often limit their practical application. This paper introduces a new Efficient Pyramid Network (EPNet) that harmoniously merges an Edge Split Pyramid Module (ESPM) with a Panoramic Feature Extraction Modu… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  4. arXiv:2311.09338  [pdf, other

    cs.LG stat.AP

    Challenges for Predictive Modeling with Neural Network Techniques using Error-Prone Dietary Intake Data

    Authors: Dylan Spicker, Amir Nazemi, Joy Hutchinson, Paul Fieguth, Sharon I. Kirkpatrick, Michael Wallace, Kevin W. Dodd

    Abstract: Dietary intake data are routinely drawn upon to explore diet-health relationships. However, these data are often subject to measurement error, distorting the true relationships. Beyond measurement error, there are likely complex synergistic and sometimes antagonistic interactions between different dietary components, complicating the relationships between diet and health outcomes. Flexible models… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  5. arXiv:2309.15274  [pdf, other

    cs.CV cs.AI

    Memory-Efficient Continual Learning Object Segmentation for Long Video

    Authors: Amir Nazemi, Mohammad Javad Shafiee, Zahra Gharaee, Paul Fieguth

    Abstract: Recent state-of-the-art semi-supervised Video Object Segmentation (VOS) methods have shown significant improvements in target object segmentation accuracy when information from preceding frames is used in segmenting the current frame. In particular, such memory-based approaches can help a model to more effectively handle appearance changes (representation drift) or occlusions. Ideally, for maximum… ▽ More

    Submitted 14 February, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  6. arXiv:2307.10455  [pdf, other

    cs.CV cs.AI cs.LG

    A Step Towards Worldwide Biodiversity Assessment: The BIOSCAN-1M Insect Dataset

    Authors: Zahra Gharaee, ZeMing Gong, Nicholas Pellegrino, Iuliia Zarubiieva, Joakim Bruslund Haurum, Scott C. Lowe, Jaclyn T. A. McKeown, Chris C. Y. Ho, Joschka McLeod, Yi-Yun C Wei, Jireh Agda, Sujeevan Ratnasingham, Dirk Steinke, Angel X. Chang, Graham W. Taylor, Paul Fieguth

    Abstract: In an effort to catalog insect biodiversity, we propose a new large dataset of hand-labelled insect images, the BIOSCAN-Insect Dataset. Each record is taxonomically classified by an expert, and also has associated genetic information including raw nucleotide barcode sequences and assigned barcode index numbers, which are genetically-based proxies for species classification. This paper presents a c… ▽ More

    Submitted 13 November, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

  7. arXiv:2306.01706  [pdf, other

    cs.CV cs.AI cs.LG

    Is Generative Modeling-based Stylization Necessary for Domain Adaptation in Regression Tasks?

    Authors: Jinman Park, Francois Barnard, Saad Hossain, Sirisha Rambhatla, Paul Fieguth

    Abstract: Unsupervised domain adaptation (UDA) aims to bridge the gap between source and target domains in the absence of target domain labels using two main techniques: input-level alignment (such as generative modeling and stylization) and feature-level alignment (which matches the distribution of the feature maps, e.g. gradient reversal layers). Motivated from the success of generative modeling for image… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  8. arXiv:2304.04259  [pdf, other

    cs.CV cs.LG

    CLVOS23: A Long Video Object Segmentation Dataset for Continual Learning

    Authors: Amir Nazemi, Zeyad Moustafa, Paul Fieguth

    Abstract: Continual learning in real-world scenarios is a major challenge. A general continual learning model should have a constant memory size and no predefined task boundaries, as is the case in semi-supervised Video Object Segmentation (VOS), where continual learning challenges particularly present themselves in working on long video sequences. In this article, we first formulate the problem of semi-sup… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  9. arXiv:2211.02537  [pdf, other

    cs.CV q-bio.PE

    Machine Learning Challenges of Biological Factors in Insect Image Data

    Authors: Nicholas Pellegrino, Zahra Gharaee, Paul Fieguth

    Abstract: The BIOSCAN project, led by the International Barcode of Life Consortium, seeks to study changes in biodiversity on a global scale. One component of the project is focused on studying the species interaction and dynamics of all insects. In addition to genetically barcoding insects, over 1.5 million images per year will be collected, each needing taxonomic classification. With the immense volume of… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: 4 pages, 3 figures. Submitted to the Journal of Computational Vision and Imaging Systems

    ACM Class: I.4.0; E.0; J.3

  10. arXiv:2206.04785  [pdf, other

    cs.CV cs.AI cs.LG

    Building Spatio-temporal Transformers for Egocentric 3D Pose Estimation

    Authors: Jinman Park, Kimathi Kaai, Saad Hossain, Norikatsu Sumi, Sirisha Rambhatla, Paul Fieguth

    Abstract: Egocentric 3D human pose estimation (HPE) from images is challenging due to severe self-occlusions and strong distortion introduced by the fish-eye view from the head mounted camera. Although existing works use intermediate heatmap-based representations to counter distortion with some success, addressing self-occlusion remains an open problem. In this work, we leverage information from past frames… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 4 pages, Extended abstract, Joint International Workshop on Egocentric Perception, Interaction and Computing (EPIC) and Ego4D, IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022

  11. arXiv:2202.07754  [pdf, other

    cs.CV eess.SP

    K-Means for Noise-Insensitive Multi-Dimensional Feature Learning

    Authors: Nicholas Pellegrino, Paul Fieguth, Parsin Haji Reza

    Abstract: Many measurement modalities which perform imaging by probing an object pixel-by-pixel, such as via Photoacoustic Microscopy, produce a multi-dimensional feature (typically a time-domain signal) at each pixel. In principle, the many degrees of freedom in the time-domain signal would admit the possibility of significant multi-modal information being implicitly present, much more than a single scalar… ▽ More

    Submitted 8 August, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: Under consideration at Pattern Recognition Letters. 6 pages (excluding references), 5 figures

    MSC Class: 68T10 ACM Class: I.5.3

  12. arXiv:2111.04731  [pdf, other

    cs.CV cs.AI cs.LG

    Survey of Deep Learning Methods for Inverse Problems

    Authors: Shima Kamyab, Zohreh Azimifar, Rasool Sabzi, Paul Fieguth

    Abstract: In this paper we investigate a variety of deep learning strategies for solving inverse problems. We classify existing deep learning solutions for inverse problems into three categories of Direct Mapping, Data Consistency Optimizer, and Deep Regularizer. We choose a sample of each inverse problem type, so as to compare the robustness of the three categories, and report a statistical analysis of the… ▽ More

    Submitted 13 November, 2021; v1 submitted 7 November, 2021; originally announced November 2021.

  13. arXiv:2103.11357  [pdf, other

    stat.ME cs.AI cs.LG stat.ML

    Deep ROC Analysis and AUC as Balanced Average Accuracy to Improve Model Selection, Understanding and Interpretation

    Authors: André M. Carrington, Douglas G. Manuel, Paul W. Fieguth, Tim Ramsay, Venet Osmani, Bernhard Wernly, Carol Bennett, Steven Hawken, Matthew McInnes, Olivia Magwood, Yusuf Sheikh, Andreas Holzinger

    Abstract: Optimal performance is critical for decision-making tasks from medicine to autonomous driving, however common performance measures may be too general or too specific. For binary classifiers, diagnostic tests or prognosis at a timepoint, measures such as the area under the receiver operating characteristic curve, or the area under the precision recall curve, are too general because they include unr… ▽ More

    Submitted 21 March, 2021; originally announced March 2021.

    Comments: 14 pages, 6 Figures, submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), currently under review

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 2022

  14. arXiv:2101.11282  [pdf, other

    cs.CV

    Deep Learning for Instance Retrieval: A Survey

    Authors: Wei Chen, Yu Liu, Weiping Wang, Erwin Bakker, Theodoros Georgiou, Paul Fieguth, Li Liu, Michael S. Lew

    Abstract: In recent years a vast amount of visual content has been generated and shared from many fields, such as social media platforms, medical imaging, and robotics. This abundance of content creation and sharing has introduced new challenges, particularly that of searching databases for similar content-Content Based Image Retrieval (CBIR)-a long-established research area in which improved efficiency and… ▽ More

    Submitted 30 October, 2022; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence

  15. A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges

    Authors: Moloud Abdar, Farhad Pourpanah, Sadiq Hussain, Dana Rezazadegan, Li Liu, Mohammad Ghavamzadeh, Paul Fieguth, Xiaochun Cao, Abbas Khosravi, U Rajendra Acharya, Vladimir Makarenkov, Saeid Nahavandi

    Abstract: Uncertainty quantification (UQ) plays a pivotal role in reduction of uncertainties during both optimization and decision making processes. It can be applied to solve a variety of real-world applications in science and engineering. Bayesian approximation and ensemble learning techniques are two most widely-used UQ methods in the literature. In this regard, researchers have proposed different UQ met… ▽ More

    Submitted 5 January, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Report number: INFFUS_1411]

    Journal ref: 2021

  16. arXiv:2006.04305  [pdf, other

    cs.CV cs.LG

    Text Detection and Recognition in the Wild: A Review

    Authors: Zobeir Raisi, Mohamed A. Naiel, Paul Fieguth, Steven Wardell, John Zelek

    Abstract: Detection and recognition of text in natural images are two main problems in the field of computer vision that have a wide variety of applications in analysis of sports videos, autonomous driving, industrial automation, to name a few. They face common challenging problems that are factors in how text is represented and affected by several environmental conditions. The current state-of-the-art scen… ▽ More

    Submitted 30 June, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

  17. arXiv:2003.08756  [pdf, other

    cs.CV cs.LG cs.RO stat.ML

    Deep Neural Network Perception Models and Robust Autonomous Driving Systems

    Authors: Mohammad Javad Shafiee, Ahmadreza Jeddi, Amir Nazemi, Paul Fieguth, Alexander Wong

    Abstract: This paper analyzes the robustness of deep learning models in autonomous driving applications and discusses the practical solutions to address that.

    Submitted 4 March, 2020; originally announced March 2020.

  18. arXiv:1912.06409  [pdf, other

    cs.LG stat.ML

    Potential adversarial samples for white-box attacks

    Authors: Amir Nazemi, Paul Fieguth

    Abstract: Deep convolutional neural networks can be highly vulnerable to small perturbations of their inputs, potentially a major issue or limitation on system robustness when using deep networks as classifiers. In this paper we propose a low-cost method to explore marginal sample data near trained classifier decision boundaries, thus identifying potential adversarial samples. By finding such adversarial sa… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

  19. arXiv:1904.09879  [pdf, other

    cs.CV cs.NE

    Assessing Architectural Similarity in Populations of Deep Neural Networks

    Authors: Audrey Chung, Paul Fieguth, Alexander Wong

    Abstract: Evolutionary deep intelligence has recently shown great promise for producing small, powerful deep neural network models via the synthesis of increasingly efficient architectures over successive generations. Despite recent research showing the efficacy of multi-parent evolutionary synthesis, little has been done to directly assess architectural similarity between networks during the synthesis proc… ▽ More

    Submitted 19 April, 2019; originally announced April 2019.

    Comments: 3 pages. arXiv admin note: text overlap with arXiv:1811.07966

  20. arXiv:1811.07966  [pdf, other

    cs.CV cs.AI cs.NE

    Mitigating Architectural Mismatch During the Evolutionary Synthesis of Deep Neural Networks

    Authors: Audrey Chung, Paul Fieguth, Alexander Wong

    Abstract: Evolutionary deep intelligence has recently shown great promise for producing small, powerful deep neural network models via the organic synthesis of increasingly efficient architectures over successive generations. Existing evolutionary synthesis processes, however, have allowed the mating of parent networks independent of architectural alignment, resulting in a mismatch of network structures. We… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

    Comments: 5 pages

  21. arXiv:1811.05817  [pdf, other

    cs.CV cs.AI cs.NE

    ProstateGAN: Mitigating Data Bias via Prostate Diffusion Imaging Synthesis with Generative Adversarial Networks

    Authors: Xiaodan Hu, Audrey G. Chung, Paul Fieguth, Farzad Khalvati, Masoom A. Haider, Alexander Wong

    Abstract: Generative Adversarial Networks (GANs) have shown considerable promise for mitigating the challenge of data scarcity when building machine learning-driven analysis algorithms. Specifically, a number of studies have shown that GAN-based image synthesis for data augmentation can aid in improving classification accuracy in a number of medical image analysis tasks, such as brain and liver image analys… ▽ More

    Submitted 20 November, 2018; v1 submitted 14 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

  22. arXiv:1809.02165  [pdf, other

    cs.CV

    Deep Learning for Generic Object Detection: A Survey

    Authors: Li Liu, Wanli Ouyang, Xiaogang Wang, Paul Fieguth, Jie Chen, Xinwang Liu, Matti Pietikäinen

    Abstract: Object detection, one of the most fundamental and challenging problems in computer vision, seeks to locate object instances from a large number of predefined categories in natural images. Deep learning techniques have emerged as a powerful strategy for learning feature representations directly from data and have led to remarkable breakthroughs in the field of generic object detection. Given this p… ▽ More

    Submitted 22 August, 2019; v1 submitted 6 September, 2018; originally announced September 2018.

    Comments: IJCV Minor

  23. Texture Classification in Extreme Scale Variations using GANet

    Authors: Li Liu, Jie Chen, Guoying Zhao, Paul Fieguth, Xilin Chen, Matti Pietikäinen

    Abstract: Research in texture recognition often concentrates on recognizing textures with intraclass variations such as illumination, rotation, viewpoint and small scale changes. In contrast, in real-world applications a change in scale can have a dramatic impact on texture appearance, to the point of changing completely from one texture category to another. As a result, texture variations due to changes in… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

    Comments: submitted to IEEE Transactions on Image Processing

  24. arXiv:1802.03318  [pdf, other

    cs.NE cs.AI cs.CV

    Nature vs. Nurture: The Role of Environmental Resources in Evolutionary Deep Intelligence

    Authors: Audrey G. Chung, Paul Fieguth, Alexander Wong

    Abstract: Evolutionary deep intelligence synthesizes highly efficient deep neural networks architectures over successive generations. Inspired by the nature versus nurture debate, we propose a study to examine the role of external factors on the network synthesis process by varying the availability of simulated environmental resources. Experimental results were obtained for networks synthesized via asexual… ▽ More

    Submitted 9 February, 2018; originally announced February 2018.

  25. arXiv:1801.10324  [pdf, other

    cs.CV cs.LG

    From BoW to CNN: Two Decades of Texture Representation for Texture Classification

    Authors: Li Liu, Jie Chen, Paul Fieguth, Guoying Zhao, Rama Chellappa, Matti Pietikainen

    Abstract: Texture is a fundamental characteristic of many types of images, and texture representation is one of the essential and challenging problems in computer vision and pattern recognition which has attracted extensive research attention. Since 2000, texture representations based on Bag of Words (BoW) and on Convolutional Neural Networks (CNNs) have been extensively studied with impressive performance.… ▽ More

    Submitted 3 October, 2018; v1 submitted 31 January, 2018; originally announced January 2018.

    Comments: Accepted by IJCV

    MSC Class: 68T10

  26. arXiv:1709.02043  [pdf, other

    cs.NE cs.CV

    The Mating Rituals of Deep Neural Networks: Learning Compact Feature Representations through Sexual Evolutionary Synthesis

    Authors: Audrey Chung, Mohammad Javad Shafiee, Paul Fieguth, Alexander Wong

    Abstract: Evolutionary deep intelligence was recently proposed as a method for achieving highly efficient deep neural network architectures over successive generations. Drawing inspiration from nature, we propose the incorporation of sexual evolutionary synthesis. Rather than the current asexual synthesis of networks, we aim to produce more compact feature representations by synthesizing more diverse and ge… ▽ More

    Submitted 6 September, 2017; originally announced September 2017.

    Comments: 8 pages

  27. arXiv:1602.01728  [pdf, other

    cs.CV

    NeRD: a Neural Response Divergence Approach to Visual Salience Detection

    Authors: M. J. Shafiee, P. Siva, C. Scharfenberger, P. Fieguth, A. Wong

    Abstract: In this paper, a novel approach to visual salience detection via Neural Response Divergence (NeRD) is proposed, where synaptic portions of deep neural networks, previously trained for complex object recognition, are leveraged to compute low level cues that can be used to compute image region distinctiveness. Based on this concept , an efficient visual salience detection framework is proposed using… ▽ More

    Submitted 4 February, 2016; originally announced February 2016.

    Comments: 5 pages

  28. Domain Adaptation and Transfer Learning in StochasticNets

    Authors: Mohammad Javad Shafiee, Parthipan Siva, Paul Fieguth, Alexander Wong

    Abstract: Transfer learning is a recent field of machine learning research that aims to resolve the challenge of dealing with insufficient training data in the domain of interest. This is a particular issue with traditional deep neural networks where a large amount of training data is needed. Recently, StochasticNets was proposed to take advantage of sparse connectivity in order to decrease the number of pa… ▽ More

    Submitted 17 December, 2015; originally announced December 2015.

    Journal ref: Vision Letters, Vol. 1, No. 1, pp. VL115, 2015

  29. arXiv:1512.03844  [pdf, ps, other

    cs.LG stat.ML

    Efficient Deep Feature Learning and Extraction via StochasticNets

    Authors: Mohammad Javad Shafiee, Parthipan Siva, Paul Fieguth, Alexander Wong

    Abstract: Deep neural networks are a powerful tool for feature learning and extraction given their ability to model high-level abstractions in highly complex data. One area worth exploring in feature learning and extraction using deep neural networks is efficient neural connectivity formation for faster feature learning and extraction. Motivated by findings of stochastic synaptic connectivity formation in t… ▽ More

    Submitted 11 December, 2015; originally announced December 2015.

    Comments: 10 pages. arXiv admin note: substantial text overlap with arXiv:1508.05463

  30. arXiv:1506.09110  [pdf, other

    cs.CV

    Forming A Random Field via Stochastic Cliques: From Random Graphs to Fully Connected Random Fields

    Authors: Mohammad Javad Shafiee, Alexander Wong, Paul Fieguth

    Abstract: Random fields have remained a topic of great interest over past decades for the purpose of structured inference, especially for problems such as image segmentation. The local nodal interactions commonly used in such models often suffer the short-boundary bias problem, which are tackled primarily through the incorporation of long-range nodal interactions. However, the issue of computational tractab… ▽ More

    Submitted 30 June, 2015; originally announced June 2015.

    Comments: 8 pages