Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 148 results for author: Gabbouj, M

.
  1. arXiv:2404.09010  [pdf, other

    cs.CV cs.LG

    MMA-DFER: MultiModal Adaptation of unimodal models for Dynamic Facial Expression Recognition in-the-wild

    Authors: Kateryna Chumachenko, Alexandros Iosifidis, Moncef Gabbouj

    Abstract: Dynamic Facial Expression Recognition (DFER) has received significant interest in the recent years dictated by its pivotal role in enabling empathic and human-compatible technologies. Achieving robustness towards in-the-wild data in DFER is particularly important for real-world applications. One of the directions aimed at improving such models is multimodal emotion recognition based on audio and v… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: accepted to CVPR 2024 ABAW Workshop

  2. arXiv:2403.10936  [pdf

    eess.IV cs.CV cs.MM

    Channel-wise Feature Decorrelation for Enhanced Learned Image Compression

    Authors: Farhad Pakdaman, Moncef Gabbouj

    Abstract: The emerging Learned Compression (LC) replaces the traditional codec modules with Deep Neural Networks (DNN), which are trained end-to-end for rate-distortion performance. This approach is considered as the future of image/video compression, and major efforts have been dedicated to improving its compression efficiency. However, most proposed works target compression efficiency by employing more co… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  3. arXiv:2402.06530  [pdf, other

    cs.LG cs.AI eess.SP

    Refining Myocardial Infarction Detection: A Novel Multi-Modal Composite Kernel Strategy in One-Class Classification

    Authors: Muhammad Uzair Zahid, Aysen Degerli, Fahad Sohrab, Serkan Kiranyaz, Tahir Hamid, Rashid Mazhar, Moncef Gabbouj

    Abstract: Early detection of myocardial infarction (MI), a critical condition arising from coronary artery disease (CAD), is vital to prevent further myocardial damage. This study introduces a novel method for early MI detection using a one-class classification (OCC) algorithm in echocardiography. Our study overcomes the challenge of limited echocardiography data availability by adopting a novel approach ba… ▽ More

    Submitted 27 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  4. arXiv:2402.05582  [pdf

    eess.IV cs.CV cs.MM

    Joint End-to-End Image Compression and Denoising: Leveraging Contrastive Learning and Multi-Scale Self-ONNs

    Authors: Yuxin Xie, Li Yu, Farhad Pakdaman, Moncef Gabbouj

    Abstract: Noisy images are a challenge to image compression algorithms due to the inherent difficulty of compressing noise. As noise cannot easily be discerned from image details, such as high-frequency signals, its presence leads to extra bits needed for compression. Since the emerging learned image compression paradigm enables end-to-end optimization of codecs, recent efforts were made to integrate denois… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Copyright 2024 IEEE - Submitted to IEEE ICIP 2024

  5. arXiv:2402.02936  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Panoramic Image Inpainting With Gated Convolution And Contextual Reconstruction Loss

    Authors: Li Yu, Yanjun Gao, Farhad Pakdaman, Moncef Gabbouj

    Abstract: Deep learning-based methods have demonstrated encouraging results in tackling the task of panoramic image inpainting. However, it is challenging for existing methods to distinguish valid pixels from invalid pixels and find suitable references for corrupted areas, thus leading to artifacts in the inpainted results. In response to these challenges, we propose a panoramic image inpainting framework t… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Copyright 2024 IEEE - to appear in IEEE ICASSP 2024

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024

  6. arXiv:2402.02922  [pdf, other

    cs.CV eess.IV

    Pixel-Wise Color Constancy via Smoothness Techniques in Multi-Illuminant Scenes

    Authors: Umut Cem Entok, Firas Laakom, Farhad Pakdaman, Moncef Gabbouj

    Abstract: Most scenes are illuminated by several light sources, where the traditional assumption of uniform illumination is invalid. This issue is ignored in most color constancy methods, primarily due to the complex spatial impact of multiple light sources on the image. Moreover, most existing multi-illuminant methods fail to preserve the smooth change of illumination, which stems from spatial dependencies… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Copyright 2024 IEEE - Submitted to IEEE ICIP 2024

  7. arXiv:2402.02836  [pdf

    eess.IV cs.CV cs.MM

    Perceptual Learned Image Compression via End-to-End JND-Based Optimization

    Authors: Farhad Pakdaman, Sanaz Nami, Moncef Gabbouj

    Abstract: Emerging Learned image Compression (LC) achieves significant improvements in coding efficiency by end-to-end training of neural networks for compression. An important benefit of this approach over traditional codecs is that any optimization criteria can be directly applied to the encoder-decoder networks during training. Perceptual optimization of LC to comply with the Human Visual System (HVS) is… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Copyright 2024 IEEE - Submitted to IEEE ICIP 2024

  8. arXiv:2402.02245  [pdf, other

    cs.CV cs.LG eess.IV

    Revisiting Generative Adversarial Networks for Binary Semantic Segmentation on Imbalanced Datasets

    Authors: Lei Xu, Moncef Gabbouj

    Abstract: Anomalous crack region detection is a typical binary semantic segmentation task, which aims to detect pixels representing cracks on pavement surface images automatically by algorithms. Although existing deep learning-based methods have achieved outcoming results on specific public pavement datasets, the performance would deteriorate dramatically on imbalanced datasets. The input datasets used in s… ▽ More

    Submitted 7 March, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

  9. arXiv:2402.02066  [pdf, other

    cs.SI cs.AI

    Trustworthiness of $\mathbb{X}$ Users: A One-Class Classification Approach

    Authors: Tanveer Khan, Fahad Sohrab, Antonis Michalas, Moncef Gabbouj

    Abstract: $\mathbb{X}$ (formerly Twitter) is a prominent online social media platform that plays an important role in sharing information making the content generated on this platform a valuable source of information. Ensuring trust on $\mathbb{X}$ is essential to determine the user credibility and prevents issues across various domains. While assigning credibility to $\mathbb{X}… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  10. arXiv:2401.16522  [pdf, other

    cs.CV

    Dropout Concrete Autoencoder for Band Selection on HSI Scenes

    Authors: Lei Xu, Mete Ahishali, Moncef Gabbouj

    Abstract: Deep learning-based informative band selection methods on hyperspectral images (HSI) recently have gained intense attention to eliminate spectral correlation and redundancies. However, the existing deep learning-based methods either need additional post-processing strategies to select the descriptive bands or optimize the model indirectly, due to the parameterization inability of discrete variable… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  11. Efficient Bitrate Ladder Construction using Transfer Learning and Spatio-Temporal Features

    Authors: Ali Falahati, Mohammad Karim Safavi, Ardavan Elahi, Farhad Pakdaman, Moncef Gabbouj

    Abstract: Providing high-quality video with efficient bitrate is a main challenge in video industry. The traditional one-size-fits-all scheme for bitrate ladders is inefficient and reaching the best content-aware decision computationally impractical due to extensive encodings required. To mitigate this, we propose a bitrate and complexity efficient bitrate ladder prediction method using transfer learning an… ▽ More

    Submitted 13 March, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: 7 pages, 9 figures, 7 tables, Copyright 2024 IEEE - Presented in IEEE MVIP 2024

    ACM Class: I.4.2

    Journal ref: Proc. 2024 13th Iranian/3rd Int. Conf. Mach. Vis. Image Process. (MVIP) (2024) 1-7

  12. arXiv:2401.02904  [pdf, other

    cs.LG stat.ML

    Class-wise Generalization Error: an Information-Theoretic Analysis

    Authors: Firas Laakom, Yuheng Bu, Moncef Gabbouj

    Abstract: Existing generalization theories of supervised learning typically take a holistic approach and provide bounds for the expected generalization over the whole data distribution, which implicitly assumes that the model generalizes similarly for all the classes. In practice, however, there are significant variations in generalization performance among different classes, which cannot be captured by the… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 26 pages

  13. arXiv:2401.01172  [pdf, other

    cs.LG cs.AI eess.SY

    Quadratic Time-Frequency Analysis of Vibration Signals for Diagnosing Bearing Faults

    Authors: Mohammad Al-Sa'd, Tuomas Jalonen, Serkan Kiranyaz, Moncef Gabbouj

    Abstract: Diagnosis of bearing faults is paramount to reducing maintenance costs and operational breakdowns. Bearing faults are primary contributors to machine vibrations, and analyzing their signal morphology offers insights into their health status. Unfortunately, existing approaches are optimized for controlled environments, neglecting realistic conditions such as time-varying rotational speeds and the v… ▽ More

    Submitted 8 February, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  14. arXiv:2312.10742  [pdf

    cs.SD cs.AI eess.AS

    Exploring Sound vs Vibration for Robust Fault Detection on Rotating Machinery

    Authors: Serkan Kiranyaz, Ozer Can Devecioglu, Amir Alhams, Sadok Sassi, Turker Ince, Onur Avci, Moncef Gabbouj

    Abstract: Robust and real-time detection of faults on rotating machinery has become an ultimate objective for predictive maintenance in various industries. Vibration-based Deep Learning (DL) methodologies have become the de facto standard for bearing fault detection as they can produce state-of-the-art detection performances under certain conditions. Despite such particular focus on the vibration signal, th… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 8 pages

  15. arXiv:2311.18547  [pdf, other

    cs.LG cs.AI eess.SY

    Real-Time Vibration-Based Bearing Fault Diagnosis Under Time-Varying Speed Conditions

    Authors: Tuomas Jalonen, Mohammad Al-Sa'd, Serkan Kiranyaz, Moncef Gabbouj

    Abstract: Detection of rolling-element bearing faults is crucial for implementing proactive maintenance strategies and for minimizing the economic and operational consequences of unexpected failures. However, many existing techniques are developed and tested under strictly controlled conditions, limiting their adaptability to the diverse and dynamic settings encountered in practical applications. This paper… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  16. arXiv:2311.10170  [pdf, other

    cs.LG

    Improving Unimodal Inference with Multimodal Transformers

    Authors: Kateryna Chumachenko, Alexandros Iosifidis, Moncef Gabbouj

    Abstract: This paper proposes an approach for improving performance of unimodal models with multimodal training. Our approach involves a multi-branch architecture that incorporates unimodal models with a multimodal transformer-based branch. By co-training these branches, the stronger multimodal branch can transfer its knowledge to the weaker unimodal branches through a multi-task objective, thereby improvin… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  17. arXiv:2310.01148  [pdf, other

    cs.LG

    Cryptocurrency Portfolio Optimization by Neural Networks

    Authors: Quoc Minh Nguyen, Dat Thanh Tran, Juho Kanniainen, Alexandros Iosifidis, Moncef Gabbouj

    Abstract: Many cryptocurrency brokers nowadays offer a variety of derivative assets that allow traders to perform hedging or speculation. This paper proposes an effective algorithm based on neural networks to take advantage of these investment products. The proposed algorithm constructs a portfolio that contains a pair of negatively correlated assets. A deep neural network, which outputs the allocation weig… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 8 pages, 4 figures, accepted at SSCI 2023

  18. arXiv:2309.15520  [pdf, other

    cs.LG cs.CV eess.IV

    SAF-Net: Self-Attention Fusion Network for Myocardial Infarction Detection using Multi-View Echocardiography

    Authors: Ilke Adalioglu, Mete Ahishali, Aysen Degerli, Serkan Kiranyaz, Moncef Gabbouj

    Abstract: Myocardial infarction (MI) is a severe case of coronary artery disease (CAD) and ultimately, its detection is substantial to prevent progressive damage to the myocardium. In this study, we propose a novel view-fusion model named self-attention fusion network (SAF-Net) to detect MI from multi-view echocardiography recordings. The proposed framework utilizes apical 2-chamber (A2C) and apical 4-chamb… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 4 pages, 3 figures, Computing in Cardiology (CinC) 2023

  19. arXiv:2309.14880  [pdf, other

    cs.LG

    Credit Card Fraud Detection with Subspace Learning-based One-Class Classification

    Authors: Zaffar Zaffar, Fahad Sohrab, Juho Kanniainen, Moncef Gabbouj

    Abstract: In an increasingly digitalized commerce landscape, the proliferation of credit card fraud and the evolution of sophisticated fraudulent techniques have led to substantial financial losses. Automating credit card fraud detection is a viable way to accelerate detection, reducing response times and minimizing potential financial losses. However, addressing this challenge is complicated by the highly… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 6 pages, 1 figure, 2 tables. Accepted at IEEE Symposium Series on Computational Intelligence 2023

  20. arXiv:2309.14134  [pdf, other

    cs.LG cs.CR

    One-Class Classification for Intrusion Detection on Vehicular Networks

    Authors: Jake Guidry, Fahad Sohrab, Raju Gottumukkala, Satya Katragadda, Moncef Gabbouj

    Abstract: Controller Area Network bus systems within vehicular networks are not equipped with the tools necessary to ward off and protect themselves from modern cyber-security threats. Work has been done on using machine learning methods to detect and report these attacks, but common methods are not robust towards unknown attacks. These methods usually rely on there being a sufficient representation of atta… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 7 pages, 2 figures, 4 tables. Accepted at IEEE Symposium Series on Computational Intelligence 2023

  21. arXiv:2309.14090  [pdf, other

    cs.LG cs.CV

    Convolutional autoencoder-based multimodal one-class classification

    Authors: Firas Laakom, Fahad Sohrab, Jenni Raitoharju, Alexandros Iosifidis, Moncef Gabbouj

    Abstract: One-class classification refers to approaches of learning using data from a single class only. In this paper, we propose a deep learning one-class classification method suitable for multimodal data, which relies on two convolutional autoencoders jointly trained to reconstruct the positive input data while obtaining the data representations in the latent space as compact as possible. During inferen… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 5 pages, 1 figure, 4 tables

  22. arXiv:2309.13960  [pdf, other

    cs.LG

    Newton Method-based Subspace Support Vector Data Description

    Authors: Fahad Sohrab, Firas Laakom, Moncef Gabbouj

    Abstract: In this paper, we present an adaptation of Newton's method for the optimization of Subspace Support Vector Data Description (S-SVDD). The objective of S-SVDD is to map the original data to a subspace optimized for one-class classification, and the iterative optimization process of data mapping and description in S-SVDD relies on gradient descent. However, gradient descent only utilizes first-order… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 8 pages, 2 figures, 2 tables, 1 Algorithm. Accepted at IEEE Symposium Series on Computational Intelligence 2023

  23. arXiv:2307.06065  [pdf, other

    cs.CV

    Operational Support Estimator Networks

    Authors: Mete Ahishali, Mehmet Yamac, Serkan Kiranyaz, Moncef Gabbouj

    Abstract: In this work, we propose a novel approach called Operational Support Estimator Networks (OSENs) for the support estimation task. Support Estimation (SE) is defined as finding the locations of non-zero elements in sparse signals. By its very nature, the mapping between the measurement and sparse signal is a non-linear operation. Traditional support estimators rely on computationally expensive itera… ▽ More

    Submitted 2 May, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

  24. arXiv:2306.01489  [pdf, other

    cs.LG cs.IT

    On Feature Diversity in Energy-based Models

    Authors: Firas Laakom, Jenni Raitoharju, Alexandros Iosifidis, Moncef Gabbouj

    Abstract: Energy-based learning is a powerful learning paradigm that encapsulates various discriminative and generative approaches. An energy-based model (EBM) is typically formed of inner-model(s) that learn a combination of the different features to generate an energy mapping for each input configuration. In this paper, we focus on the diversity of the produced feature set. We extend the probably approxim… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 18 pages, 3 figures

  25. arXiv:2305.07960  [pdf

    cs.SD cs.HC eess.AS

    Sound-to-Vibration Transformation for Sensorless Motor Health Monitoring

    Authors: Ozer Can Devecioglu, Serkan Kiranyaz, Amer Elhmes, Sadok Sassi, Turker Ince, Onur Avci, Mohammad Hesam Soleimani-Babakamali, Ertugrul Taciroglu, Moncef Gabbouj

    Abstract: Automatic sensor-based detection of motor failures such as bearing faults is crucial for predictive maintenance in various industries. Numerous methodologies have been developed over the years to detect bearing faults. Despite the appearance of numerous different approaches for diagnosing faults in motors have been proposed, vibration-based methods have become the de facto standard and the most co… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.

  26. arXiv:2304.09840  [pdf, other

    cs.LG cs.CE q-fin.TR

    Optimum Output Long Short-Term Memory Cell for High-Frequency Trading Forecasting

    Authors: Adamantios Ntakaris, Moncef Gabbouj, Juho Kanniainen

    Abstract: High-frequency trading requires fast data processing without information lags for precise stock price forecasting. This high-paced stock price forecasting is usually based on vectors that need to be treated as sequential and time-independent signals due to the time irregularities that are inherent in high-frequency trading. A well-documented and tested method that considers these time-irregulariti… ▽ More

    Submitted 15 May, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

  27. Hyperspectral Image Analysis with Subspace Learning-based One-Class Classification

    Authors: Sertac Kilickaya, Mete Ahishali, Fahad Sohrab, Turker Ince, Moncef Gabbouj

    Abstract: Hyperspectral image (HSI) classification is an important task in many applications, such as environmental monitoring, medical imaging, and land use/land cover (LULC) classification. Due to the significant amount of spectral information from recent HSI sensors, analyzing the acquired images is challenging using traditional Machine Learning (ML) methods. As the number of frequency bands increases, t… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Journal ref: 2023 Photonics & Electromagnetics Research Symposium (PIERS)

  28. Improved Active Fire Detection using Operational U-Nets

    Authors: Ozer Can Devecioglu, Mete Ahishali, Fahad Sohrab, Turker Ince, Moncef Gabbouj

    Abstract: As a consequence of global warming and climate change, the risk and extent of wildfires have been increasing in many areas worldwide. Warmer temperatures and drier conditions can cause quickly spreading fires and make them harder to control; therefore, early detection and accurate locating of active fires are crucial in environmental monitoring. Using satellite imagery to monitor and detect active… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Journal ref: 2023 Photonics & Electromagnetics Research Symposium (PIERS)

  29. arXiv:2303.16636  [pdf, other

    eess.IV

    Operational Neural Networks for Parameter-Efficient Hyperspectral Single-Image Super-Resolution

    Authors: Alexander Ulrichsen, Paul Murray, Stephen Marshall, Moncef Gabbouj, Serkan Kiranyaz, Mehmet Yamac, Nour Aburaed

    Abstract: Hyperspectral Imaging is a crucial tool in remote sensing which captures far more spectral information than standard color images. However, the increase in spectral information comes at the cost of spatial resolution. Super-resolution is a popular technique where the goal is to generate a high-resolution version of a given low-resolution input. The majority of modern super-resolution approaches us… ▽ More

    Submitted 25 October, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: 17 pages, 12 figures

  30. arXiv:2303.10559  [pdf, other

    cs.CV

    Deep Learning for Camera Calibration and Beyond: A Survey

    Authors: Kang Liao, Lang Nie, Shujuan Huang, Chunyu Lin, Jing Zhang, Yao Zhao, Moncef Gabbouj, Dacheng Tao

    Abstract: Camera calibration involves estimating camera parameters to infer geometric features from captured sequences, which is crucial for computer vision and robotics. However, conventional calibration is laborious and requires dedicated collection. Recent efforts show that learning-based solutions have the potential to be used in place of the repeatability works of manual calibrations. Among these solut… ▽ More

    Submitted 4 June, 2024; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: Github repository: https://github.com/KangLiao929/Awesome-Deep-Camera-Calibration

  31. arXiv:2302.11947  [pdf, other

    cs.CV cs.LG

    Real-Time Damage Detection in Fiber Lifting Ropes Using Convolutional Neural Networks

    Authors: Tuomas Jalonen, Mohammad Al-Sa'd, Roope Mellanen, Serkan Kiranyaz, Moncef Gabbouj

    Abstract: The health and safety hazards posed by worn crane lifting ropes mandate periodic inspection for damage. This task is time-consuming, prone to human error, halts operation, and may result in the premature disposal of ropes. Therefore, we propose using deep learning and computer vision methods to automate the process of detecting damaged ropes. Specifically, we present a novel vision-based system fo… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  32. arXiv:2301.01352  [pdf, other

    cs.LG cs.CV

    WLD-Reg: A Data-dependent Within-layer Diversity Regularizer

    Authors: Firas Laakom, Jenni Raitoharju, Alexandros Iosifidis, Moncef Gabbouj

    Abstract: Neural networks are composed of multiple layers arranged in a hierarchical structure jointly trained with a gradient-based optimization, where the errors are back-propagated from the last layer back to the first one. At each optimization step, neurons at a given layer receive feedback from neurons belonging to higher layers of the hierarchy. In this paper, we propose to complement this traditional… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

    Comments: accepted at AAAI 2023. arXiv admin note: substantial text overlap with arXiv:2106.06012

  33. arXiv:2212.14618  [pdf

    cs.SD cs.LG eess.AS

    Blind Restoration of Real-World Audio by 1D Operational GANs

    Authors: Turker Ince, Serkan Kiranyaz, Ozer Can Devecioglu, Muhammad Salman Khan, Muhammad Chowdhury, Moncef Gabbouj

    Abstract: Objective: Despite numerous studies proposed for audio restoration in the literature, most of them focus on an isolated restoration problem such as denoising or dereverberation, ignoring other artifacts. Moreover, assuming a noisy or reverberant environment with limited number of fixed signal-to-distortion ratio (SDR) levels is a common practice. However, real-world audio is often corrupted by a b… ▽ More

    Submitted 20 January, 2023; v1 submitted 30 December, 2022; originally announced December 2022.

  34. arXiv:2212.06154  [pdf

    cs.LG cs.AI

    Zero-Shot Motor Health Monitoring by Blind Domain Transition

    Authors: Serkan Kiranyaz, Ozer Can Devecioglu, Amir Alhams, Sadok Sassi, Turker Ince, Osama Abdeljaber, Onur Avci, Moncef Gabbouj

    Abstract: Continuous long-term monitoring of motor health is crucial for the early detection of abnormalities such as bearing faults (up to 51% of motor failures are attributed to bearing faults). Despite numerous methodologies proposed for bearing fault detection, most of them require normal (healthy) and abnormal (faulty) data for training. Even with the recent deep learning (DL) methodologies trained on… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: 13 pages, 9 figures, Journal

  35. Comprehensive Complexity Assessment of Emerging Learned Image Compression on CPU and GPU

    Authors: Farhad Pakdaman, Moncef Gabbouj

    Abstract: Learned Compression (LC) is the emerging technology for compressing image and video content, using deep neural networks. Despite being new, LC methods have already gained a compression efficiency comparable to state-of-the-art image compression, such as HEVC or even VVC. However, the existing solutions often require a huge computational complexity, which discourages their adoption in international… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023

  36. arXiv:2210.13827  [pdf, other

    cs.MM cs.CV

    End-to-end Transformer for Compressed Video Quality Enhancement

    Authors: Li Yu, Wenshuai Chang, Shiyu Wu, Moncef Gabbouj

    Abstract: Convolutional neural networks have achieved excellent results in compressed video quality enhancement task in recent years. State-of-the-art methods explore the spatiotemporal information of adjacent frames mainly by deformable convolution. However, offset fields in deformable convolution are difficult to train, and its instability in training often leads to offset overflow, which reduce the effic… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  37. arXiv:2209.14770  [pdf, other

    eess.IV cs.CV cs.LG

    R2C-GAN: Restore-to-Classify GANs for Blind X-Ray Restoration and COVID-19 Classification

    Authors: Mete Ahishali, Aysen Degerli, Serkan Kiranyaz, Tahir Hamid, Rashid Mazhar, Moncef Gabbouj

    Abstract: Restoration of poor quality images with a blended set of artifacts plays a vital role for a reliable diagnosis. Existing studies have focused on specific restoration problems such as image deblurring, denoising, and exposure correction where there is usually a strong assumption on the artifact type and severity. As a pioneer study in blind X-ray restoration, we propose a joint model for generic im… ▽ More

    Submitted 15 August, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

  38. arXiv:2209.13542  [pdf, other

    cs.MM eess.SP

    EmpathicSchool: A multimodal dataset for real-time facial expressions and physiological data analysis under different stress conditions

    Authors: Majid Hosseini, Fahad Sohrab, Raju Gottumukkala, Ravi Teja Bhupatiraju, Satya Katragadda, Jenni Raitoharju, Alexandros Iosifidis, Moncef Gabbouj

    Abstract: Affective computing has garnered researchers' attention and interest in recent years as there is a need for AI systems to better understand and react to human emotions. However, analyzing human emotions, such as mood or stress, is quite complex. While various stress studies use facial expressions and wearables, most existing datasets rely on processing data from a single modality. This paper prese… ▽ More

    Submitted 29 August, 2022; originally announced September 2022.

  39. Efficient CNN with uncorrelated Bag of Features pooling

    Authors: Firas Laakom, Jenni Raitoharju, Alexandros Iosifidis, Moncef Gabbouj

    Abstract: Despite the superior performance of CNN, deploying them on low computational power devices is still limited as they are typically computationally expensive. One key cause of the high complexity is the connection between the convolution layers and the fully connected layers, which typically requires a high number of parameters. To alleviate this issue, Bag of Features (BoF) pooling has been recentl… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: 6 pages, 2 Figures

    Journal ref: 2022 IEEE Symposium Series on Computational Intelligence (SSCI)

  40. arXiv:2209.05761  [pdf, other

    cs.MM cs.NI

    A Survey on Mobile Edge Computing for Video Streaming: Opportunities and Challenges

    Authors: Muhammad Asif Khan, Emna Baccour, Zina Chkirbene, Aiman Erbad, Ridha Hamila, Mounir Hamdi, Moncef Gabbouj

    Abstract: 5G communication brings substantial improvements in the quality of service provided to various applications by achieving higher throughput and lower latency. However, interactive multimedia applications (e.g., ultra high definition video conferencing, 3D and multiview video streaming, crowd-sourced video streaming, cloud gaming, virtual and augmented reality) are becoming more ambitious with high… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 36 pages

  41. arXiv:2207.07089  [pdf, other

    cs.LG cs.CV

    A Personalized Zero-Shot ECG Arrhythmia Monitoring System: From Sparse Representation Based Domain Adaption to Energy Efficient Abnormal Beat Detection for Practical ECG Surveillance

    Authors: Mehmet Yamaç, Mert Duman, İlke Adalıoğlu, Serkan Kiranyaz, Moncef Gabbouj

    Abstract: This paper proposes a low-cost and highly accurate ECG-monitoring system intended for personalized early arrhythmia detection for wearable mobile sensors. Earlier supervised approaches for personalized ECG monitoring require both abnormal and normal heartbeats for the training of the dedicated classifier. However, in a real-world scenario where the personalized algorithm is embedded in a wearable… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: Software implementation: https://github.com/MertDuman/Zero-Shot-ECG

  42. Early Myocardial Infarction Detection with One-Class Classification over Multi-view Echocardiography

    Authors: Aysen Degerli, Fahad Sohrab, Serkan Kiranyaz, Moncef Gabbouj

    Abstract: Myocardial infarction (MI) is the leading cause of mortality and morbidity in the world. Early therapeutics of MI can ensure the prevention of further myocardial necrosis. Echocardiography is the fundamental imaging technique that can reveal the earliest sign of MI. However, the scarcity of echocardiographic datasets for the MI detection is the major issue for training data-driven classification a… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  43. arXiv:2204.03768  [pdf

    cs.LG

    Global ECG Classification by Self-Operational Neural Networks with Feature Injection

    Authors: Muhammad Uzair Zahid, Serkan Kiranyaz, Moncef Gabbouj

    Abstract: Objective: Global (inter-patient) ECG classification for arrhythmia detection over Electrocardiogram (ECG) signal is a challenging task for both humans and machines. The main reason is the significant variations of both normal and arrhythmic ECG patterns among patients. Automating this process with utmost accuracy is, therefore, highly desirable due to the advent of wearable ECG sensors. However,… ▽ More

    Submitted 29 May, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  44. arXiv:2203.00403  [pdf, other

    cs.RO cs.AI

    OpenDR: An Open Toolkit for Enabling High Performance, Low Footprint Deep Learning for Robotics

    Authors: N. Passalis, S. Pedrazzi, R. Babuska, W. Burgard, D. Dias, F. Ferro, M. Gabbouj, O. Green, A. Iosifidis, E. Kayacan, J. Kober, O. Michel, N. Nikolaidis, P. Nousi, R. Pieters, M. Tzelepi, A. Valada, A. Tefas

    Abstract: Existing Deep Learning (DL) frameworks typically do not provide ready-to-use solutions for robotics, where very specific learning, reasoning, and embodiment problems exist. Their relatively steep learning curve and the different methodologies employed by DL compared to traditional approaches, along with the high complexity of DL models, which often leads to the need of employing specialized hardwa… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

  45. arXiv:2202.10185  [pdf, other

    eess.IV cs.CV cs.LG

    OSegNet: Operational Segmentation Network for COVID-19 Detection using Chest X-ray Images

    Authors: Aysen Degerli, Serkan Kiranyaz, Muhammad E. H. Chowdhury, Moncef Gabbouj

    Abstract: Coronavirus disease 2019 (COVID-19) has been diagnosed automatically using Machine Learning algorithms over chest X-ray (CXR) images. However, most of the earlier studies used Deep Learning models over scarce datasets bearing the risk of overfitting. Additionally, previous studies have revealed the fact that deep networks are not reliable for classification since their decisions may originate from… ▽ More

    Submitted 30 May, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

  46. arXiv:2202.09918  [pdf, other

    cs.CV cs.LG

    SRL-SOA: Self-Representation Learning with Sparse 1D-Operational Autoencoder for Hyperspectral Image Band Selection

    Authors: Mete Ahishali, Serkan Kiranyaz, Iftikhar Ahmad, Moncef Gabbouj

    Abstract: The band selection in the hyperspectral image (HSI) data processing is an important task considering its effect on the computational complexity and accuracy. In this work, we propose a novel framework for the band selection problem: Self-Representation Learning (SRL) with Sparse 1D-Operational Autoencoder (SOA). The proposed SLR-SOA approach introduces a novel autoencoder model, SOA, that is desig… ▽ More

    Submitted 20 February, 2022; originally announced February 2022.

  47. arXiv:2202.04678  [pdf, other

    cs.LG cs.AI math.SP

    Non-Linear Spectral Dimensionality Reduction Under Uncertainty

    Authors: Firas Laakom, Jenni Raitoharju, Nikolaos Passalis, Alexandros Iosifidis, Moncef Gabbouj

    Abstract: In this paper, we consider the problem of non-linear dimensionality reduction under uncertainty, both from a theoretical and algorithmic perspectives. Since real-world data usually contain measurements with uncertainties and artifacts, the input space in the proposed framework consists of probability distributions to model the uncertainties associated with each sample. We propose a new dimensional… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

    Comments: 10 pages, 3 figures

  48. arXiv:2202.04629  [pdf, other

    cs.CV eess.IV

    Reducing Redundancy in the Bottleneck Representation of the Autoencoders

    Authors: Firas Laakom, Jenni Raitoharju, Alexandros Iosifidis, Moncef Gabbouj

    Abstract: Autoencoders are a type of unsupervised neural networks, which can be used to solve various tasks, e.g., dimensionality reduction, image compression, and image denoising. An AE has two goals: (i) compress the original input to a low-dimensional space at the bottleneck of the network topology using an encoder, (ii) reconstruct the input from the representation at the bottleneck using a decoder. Bot… ▽ More

    Submitted 23 November, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: 6 pages,4 figures. The paper is under consideration at Pattern Recognition Letters

  49. arXiv:2202.00589  [pdf

    eess.SP cs.AI cs.LG

    Blind ECG Restoration by Operational Cycle-GANs

    Authors: Serkan Kiranyaz, Ozer Can Devecioglu, Turker Ince, Junaid Malik, Muhammad Chowdhury, Tahir Hamid, Rashid Mazhar, Amith Khandakar, Anas Tahir, Tawsifur Rahman, Moncef Gabbouj

    Abstract: Continuous long-term monitoring of electrocardiography (ECG) signals is crucial for the early detection of cardiac abnormalities such as arrhythmia. Non-clinical ECG recordings acquired by Holter and wearable ECG sensors often suffer from severe artifacts such as baseline wander, signal cuts, motion artifacts, variations on QRS amplitude, noise, and other interferences. Usually, a set of such arti… ▽ More

    Submitted 29 January, 2022; originally announced February 2022.

    Comments: 16 pages, 10 figures, journal article submission

  50. arXiv:2201.11095  [pdf, other

    cs.CV cs.SD eess.AS

    Self-attention fusion for audiovisual emotion recognition with incomplete data

    Authors: Kateryna Chumachenko, Alexandros Iosifidis, Moncef Gabbouj

    Abstract: In this paper, we consider the problem of multimodal data analysis with a use case of audiovisual emotion recognition. We propose an architecture capable of learning from raw data and describe three variants of it with distinct modality fusion mechanisms. While most of the previous works consider the ideal scenario of presence of both modalities at all times during inference, we evaluate the robus… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.