Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–14 of 14 results for author: Marin-Jimenez, M J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.04120  [pdf, other

    cs.CV

    Cross-Modality Gait Recognition: Bridging LiDAR and Camera Modalities for Human Identification

    Authors: Rui Wang, Chuanfu Shen, Manuel J. Marin-Jimenez, George Q. Huang, Shiqi Yu

    Abstract: Current gait recognition research mainly focuses on identifying pedestrians captured by the same type of sensor, neglecting the fact that individuals may be captured by different sensors in order to adapt to various environments. A more practical approach should involve cross-modality matching across different sensors. Hence, this paper focuses on investigating the problem of cross-modality gait r… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  2. arXiv:2211.16289  [pdf, other

    cs.CV

    Lightweight Structure-Aware Attention for Visual Understanding

    Authors: Heeseung Kwon, Francisco M. Castro, Manuel J. Marin-Jimenez, Nicolas Guil, Karteek Alahari

    Abstract: Vision Transformers (ViTs) have become a dominant paradigm for visual representation learning with self-attention operators. Although these operators provide flexibility to the model with their adjustable attention kernels, they suffer from inherent limitations: (1) the attention kernel is not discriminative enough, resulting in high redundancy of the ViT layers, and (2) the complexity in computat… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 8 pages, 5 figures

  3. LAEO-Net++: revisiting people Looking At Each Other in videos

    Authors: Manuel J. Marin-Jimenez, Vicky Kalogeiton, Pablo Medina-Suarez, Andrew Zisserman

    Abstract: Capturing the 'mutual gaze' of people is essential for understanding and interpreting the social interactions between them. To this end, this paper addresses the problem of detecting people Looking At Each Other (LAEO) in video sequences. For this purpose, we propose LAEO-Net++, a new deep CNN for determining LAEO in videos. In contrast to previous works, LAEO-Net++ takes spatio-temporal tracks as… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

    Comments: 16 pages, 16 Figures. arXiv admin note: substantial text overlap with arXiv:1906.05261

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

  4. arXiv:2011.01890  [pdf, other

    cs.CV cs.AI

    RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild

    Authors: Rafael Berral-Soler, Francisco J. Madrid-Cuevas, Rafael Muñoz-Salinas, Manuel J. Marín-Jiménez

    Abstract: Human head pose estimation in images has applications in many fields such as human-computer interaction or video surveillance tasks. In this work, we address this problem, defined here as the estimation of both vertical (tilt/pitch) and horizontal (pan/yaw) angles, through the use of a single Convolutional Neural Network (ConvNet) model, trying to balance precision and inference speed in order to… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: Accepted for publication at Neural Computing and Applications

  5. arXiv:2008.13507  [pdf, other

    cs.CV cs.AI

    iLGaCo: Incremental Learning of Gait Covariate Factors

    Authors: Zihao Mu, Francisco M. Castro, Manuel J. Marin-Jimenez, Nicolas Guil, Yan-ran Li, Shiqi Yu

    Abstract: Gait is a popular biometric pattern used for identifying people based on their way of walking. Traditionally, gait recognition approaches based on deep learning are trained using the whole training dataset. In fact, if new data (classes, view-points, walking conditions, etc.) need to be included, it is necessary to re-train again the model with old and new data samples. In this paper, we propose… ▽ More

    Submitted 31 August, 2020; originally announced August 2020.

    Comments: Accepted for presentation at IJCB'2020

  6. arXiv:1906.05261  [pdf, other

    cs.CV

    LAEO-Net: revisiting people Looking At Each Other in videos

    Authors: Manuel J. Marin-Jimenez, Vicky Kalogeiton, Pablo Medina-Suarez, Andrew Zisserman

    Abstract: Capturing the `mutual gaze' of people is essential for understanding and interpreting the social interactions between them. To this end, this paper addresses the problem of detecting people Looking At Each Other (LAEO) in video sequences. For this purpose, we propose LAEO-Net, a new deep CNN for determining LAEO in videos. In contrast to previous works, LAEO-Net takes spatio-temporal tracks as inp… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

    Comments: CVPR 2019

  7. arXiv:1808.00286  [pdf, other

    cs.CV

    Energy-based Tuning of Convolutional Neural Networks on Multi-GPUs

    Authors: Francisco M. Castro, Nicolás Guil, Manuel J. Marín-Jiménez, Jesús Pérez-Serrano, Manuel Ujaldón

    Abstract: Deep Learning (DL) applications are gaining momentum in the realm of Artificial Intelligence, particularly after GPUs have demonstrated remarkable skills for accelerating their challenging computational requirements. Within this context, Convolutional Neural Network (CNN) models constitute a representative example of success on a wide set of complex applications, particularly on datasets where the… ▽ More

    Submitted 1 August, 2018; originally announced August 2018.

    Comments: To appear in Concurrency and Computation: Practice and Experience

  8. arXiv:1807.09536  [pdf, other

    cs.CV

    End-to-End Incremental Learning

    Authors: Francisco M. Castro, Manuel J. Marín-Jiménez, Nicolás Guil, Cordelia Schmid, Karteek Alahari

    Abstract: Although deep learning approaches have stood out in recent years due to their state-of-the-art results, they continue to suffer from catastrophic forgetting, a dramatic decrease in overall performance when training with new classes added incrementally. This is due to current neural network architectures requiring the entire dataset, consisting of all the samples from the old as well as the new cla… ▽ More

    Submitted 3 September, 2018; v1 submitted 25 July, 2018; originally announced July 2018.

    Comments: To appear in ECCV 2018

  9. arXiv:1807.05389  [pdf, other

    cs.CV cs.HC

    3D human pose estimation from depth maps using a deep combination of poses

    Authors: Manuel J. Marin-Jimenez, Francisco J. Romero-Ramirez, Rafael Muñoz-Salinas, Rafael Medina-Carnicer

    Abstract: Many real-world applications require the estimation of human body joints for higher-level tasks as, for example, human behaviour understanding. In recent years, depth sensors have become a popular approach to obtain three-dimensional information. The depth maps generated by these sensors provide information that can be employed to disambiguate the poses observed in two-dimensional images. This wor… ▽ More

    Submitted 14 July, 2018; originally announced July 2018.

    Comments: Accepted for publication at "Journal of Visual Communication and Image Representation"

  10. arXiv:1806.07753  [pdf, other

    cs.CV

    Multimodal feature fusion for CNN-based gait recognition: an empirical comparison

    Authors: Francisco Manuel Castro, Manuel Jesús Marín-Jiménez, Nicolás Guil, Nicolás Pérez de la Blanca

    Abstract: People identification in video based on the way they walk (i.e. gait) is a relevant task in computer vision using a non-invasive approach. Standard and current approaches typically derive gait signatures from sequences of binary energy maps of subjects extracted from images, but this process introduces a large amount of non-stationary noise, thus, conditioning their efficacy. In contrast, in this… ▽ More

    Submitted 20 February, 2020; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: arXiv admin note: text overlap with arXiv:1603.01006

  11. arXiv:1606.00151  [pdf, other

    cs.CV

    Mapping and Localization from Planar Markers

    Authors: Rafael Muñoz-Salinas, Manuel J. Marín-Jimenez, Enrique Yeguas-Bolivar, Rafael Medina-Carnicer

    Abstract: Squared planar markers are a popular tool for fast, accurate and robust camera localization, but its use is frequently limited to a single marker, or at most, to a small set of them for which their relative pose is known beforehand. Mapping and localization from a large set of planar markers is yet a scarcely treated problem in favour of keypoint-based approaches. However, while keypoint detectors… ▽ More

    Submitted 25 January, 2017; v1 submitted 1 June, 2016; originally announced June 2016.

    Comments: Paper submitted to journal. Code available. See webpage http://www.uco.es/investiga/grupos/ava/node/57/

  12. arXiv:1603.01006  [pdf, other

    cs.CV cs.AI

    Automatic learning of gait signatures for people identification

    Authors: F. M. Castro, M. J. Marin-Jimenez, N. Guil, N. Perez de la Blanca

    Abstract: This work targets people identification in video based on the way they walk (i.e. gait). While classical methods typically derive gait signatures from sequences of binary silhouettes, in this work we explore the use of convolutional neural networks (CNN) for learning high-level descriptors from low-level motion features (i.e. optical flow components). We carry out a thorough experimental evaluatio… ▽ More

    Submitted 14 June, 2016; v1 submitted 3 March, 2016; originally announced March 2016.

    Comments: Proof of concept paper. Technical report on the use of ConvNets (CNN) for gait recognition. Data and code: http://www.uco.es/~in1majim/research/cnngaitof.html

    Report number: 2016-03

  13. arXiv:1601.06931  [pdf, other

    cs.CV cs.AI

    Fisher Motion Descriptor for Multiview Gait Recognition

    Authors: F. M. Castro, M. J. Marín-Jiménez, N. Guil, R. Muñoz-Salinas

    Abstract: The goal of this paper is to identify individuals by analyzing their gait. Instead of using binary silhouettes as input data (as done in many previous works) we propose and evaluate the use of motion descriptors based on densely sampled short-term trajectories. We take advantage of state-of-the-art people detectors to define custom spatial configurations of the descriptors around the target person… ▽ More

    Submitted 26 January, 2016; originally announced January 2016.

    Comments: This paper extends with new experiments the one published at ICPR'2014

  14. arXiv:1403.6950  [pdf, other

    cs.CV

    Pyramidal Fisher Motion for Multiview Gait Recognition

    Authors: F. M. Castro, M. J. Marin-Jimenez, R. Medina-Carnicer

    Abstract: The goal of this paper is to identify individuals by analyzing their gait. Instead of using binary silhouettes as input data (as done in many previous works) we propose and evaluate the use of motion descriptors based on densely sampled short-term trajectories. We take advantage of state-of-the-art people detectors to define custom spatial configurations of the descriptors around the target person… ▽ More

    Submitted 27 March, 2014; originally announced March 2014.

    Comments: Submitted to International Conference on Pattern Recognition, ICPR, 2014