Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–12 of 12 results for author: Feris, R S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2112.00054  [pdf, other

    cs.CV cs.LG

    Task2Sim : Towards Effective Pre-training and Transfer from Synthetic Data

    Authors: Samarth Mishra, Rameswar Panda, Cheng Perng Phoo, Chun-Fu Chen, Leonid Karlinsky, Kate Saenko, Venkatesh Saligrama, Rogerio S. Feris

    Abstract: Pre-training models on Imagenet or other massive datasets of real images has led to major advances in computer vision, albeit accompanied with shortcomings related to curation cost, privacy, usage rights, and ethical issues. In this paper, for the first time, we study the transferability of pre-trained models based on synthetic data generated by graphics simulators to downstream tasks from very di… ▽ More

    Submitted 28 March, 2022; v1 submitted 30 November, 2021; originally announced December 2021.

    Comments: Accepted to CVPR'22

  2. arXiv:1901.10436  [pdf, other

    cs.CV

    Diversity in Faces

    Authors: Michele Merler, Nalini Ratha, Rogerio S. Feris, John R. Smith

    Abstract: Face recognition is a long standing challenge in the field of Artificial Intelligence (AI). The goal is to create systems that accurately detect, recognize, verify, and understand human faces. There are significant technical hurdles in making these systems accurate, particularly in unconstrained settings due to confounding factors related to pose, resolution, illumination, occlusion, and viewpoint… ▽ More

    Submitted 8 April, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: Updated statistics after slight modification to dataset due to inactive links and deletions

  3. arXiv:1811.08815  [pdf, other

    cs.CV

    Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection

    Authors: Khoi-Nguyen C. Mac, Dhiraj Joshi, Raymond A. Yeh, Jinjun Xiong, Rogerio S. Feris, Minh N. Do

    Abstract: Fine-grained action detection is an important task with numerous applications in robotics and human-computer interaction. Existing methods typically utilize a two-stage approach including extraction of local spatio-temporal features followed by temporal modeling to capture long-term dependencies. While most recent papers have focused on the latter (long-temporal modeling), here, we focus on produc… ▽ More

    Submitted 6 November, 2019; v1 submitted 21 November, 2018; originally announced November 2018.

    Comments: Accepted at ICCV 2019 as oral

  4. arXiv:1805.00145  [pdf, other

    cs.CV cs.AI

    Dialog-based Interactive Image Retrieval

    Authors: Xiaoxiao Guo, Hui Wu, Yu Cheng, Steven Rennie, Gerald Tesauro, Rogerio Schmidt Feris

    Abstract: Existing methods for interactive image retrieval have demonstrated the merit of integrating user feedback, improving retrieval results. However, most current systems rely on restricted forms of user feedback, such as binary relevance responses, or feedback based on a fixed set of relative attributes, which limits their impact. In this paper, we introduce a new approach to interactive image search… ▽ More

    Submitted 20 December, 2018; v1 submitted 30 April, 2018; originally announced May 2018.

    Comments: accepted at NeurIPS 2018

  5. arXiv:1801.06066  [pdf, other

    cs.CV

    RED-Net: A Recurrent Encoder-Decoder Network for Video-based Face Alignment

    Authors: Xi Peng, Rogerio S. Feris, Xiaoyu Wang, Dimitris N. Metaxas

    Abstract: We propose a novel method for real-time face alignment in videos based on a recurrent encoder-decoder network model. Our proposed model predicts 2D facial point heat maps regularized by both detection and regression loss, while uniquely exploiting recurrent learning at both spatial and temporal dimensions. At the spatial level, we add a feedback loop connection between the combined output response… ▽ More

    Submitted 16 January, 2018; originally announced January 2018.

    Comments: International Journal of Computer Vision. arXiv admin note: text overlap with arXiv:1608.05477

  6. arXiv:1707.07075  [pdf, other

    cs.CV cs.MM

    Automatic Curation of Golf Highlights using Multimodal Excitement Features

    Authors: Michele Merler, Dhiraj Joshi, Quoc-Bao Nguyen, Stephen Hammer, John Kent, John R. Smith, Rogerio S. Feris

    Abstract: The production of sports highlight packages summarizing a game's most exciting moments is an essential task for broadcast media. Yet, it requires labor-intensive video editing. We propose a novel approach for auto-curating sports highlights, and use it to create a real-world system for the editorial aid of golf highlight reels. Our method fuses information from the players' reactions (action recog… ▽ More

    Submitted 21 July, 2017; originally announced July 2017.

  7. arXiv:1608.05477  [pdf, other

    cs.CV

    A Recurrent Encoder-Decoder Network for Sequential Face Alignment

    Authors: Xi Peng, Rogerio S. Feris, Xiaoyu Wang, Dimitris N. Metaxas

    Abstract: We propose a novel recurrent encoder-decoder network model for real-time video-based face alignment. Our proposed model predicts 2D facial point maps regularized by a regression loss, while uniquely exploiting recurrent learning at both spatial and temporal dimensions. At the spatial level, we add a feedback loop connection between the combined output response map and the input, in order to enable… ▽ More

    Submitted 22 August, 2016; v1 submitted 18 August, 2016; originally announced August 2016.

    Comments: European Conference on Computer Vision (ECCV), 2016

  8. arXiv:1607.07155  [pdf, ps, other

    cs.CV

    A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

    Authors: Zhaowei Cai, Quanfu Fan, Rogerio S. Feris, Nuno Vasconcelos

    Abstract: A unified deep neural network, denoted the multi-scale CNN (MS-CNN), is proposed for fast multi-scale object detection. The MS-CNN consists of a proposal sub-network and a detection sub-network. In the proposal sub-network, detection is performed at multiple output layers, so that receptive fields match objects of different scales. These complementary scale-specific detectors are combined to produ… ▽ More

    Submitted 25 July, 2016; originally announced July 2016.

  9. arXiv:1604.06433  [pdf, other

    cs.CV

    Walk and Learn: Facial Attribute Representation Learning from Egocentric Video and Contextual Data

    Authors: Jing Wang, Yu Cheng, Rogerio Schmidt Feris

    Abstract: The way people look in terms of facial attributes (ethnicity, hair color, facial hair, etc.) and the clothes or accessories they wear (sunglasses, hat, hoodies, etc.) is highly dependent on geo-location and weather condition, respectively. This work explores, for the first time, the use of this contextual information, as people with wearable cameras walk across different neighborhoods of a city, i… ▽ More

    Submitted 22 June, 2016; v1 submitted 21 April, 2016; originally announced April 2016.

    Comments: Paper accepted by CVPR 2016

  10. arXiv:1505.07922  [pdf, other

    cs.CV

    Cross-domain Image Retrieval with a Dual Attribute-aware Ranking Network

    Authors: Junshi Huang, Rogerio S. Feris, Qiang Chen, Shuicheng Yan

    Abstract: We address the problem of cross-domain image retrieval, considering the following practical application: given a user photo depicting a clothing image, our goal is to retrieve the same or attribute-similar clothing items from online shopping stores. This is a challenging problem due to the large discrepancy between online shopping images, usually taken in ideal lighting/pose/background conditions,… ▽ More

    Submitted 29 May, 2015; originally announced May 2015.

  11. arXiv:1502.03436  [pdf, other

    cs.CV

    An exploration of parameter redundancy in deep networks with circulant projections

    Authors: Yu Cheng, Felix X. Yu, Rogerio S. Feris, Sanjiv Kumar, Alok Choudhary, Shih-Fu Chang

    Abstract: We explore the redundancy of parameters in deep neural networks by replacing the conventional linear projection in fully-connected layers with the circulant projection. The circulant structure substantially reduces memory footprint and enables the use of the Fast Fourier Transform to speed up the computation. Considering a fully-connected neural network layer with d input nodes, and d output nodes… ▽ More

    Submitted 27 October, 2015; v1 submitted 11 February, 2015; originally announced February 2015.

    Comments: International Conference on Computer Vision (ICCV) 2015

  12. arXiv:cs/0509083  [pdf, ps, other

    cs.CV

    Face Verification in Polar Frequency Domain: a Biologically Motivated Approach

    Authors: Yossi Zana, Roberto M. Cesar-Jr, Rogerio S. Feris, Matthew Turk

    Abstract: We present a novel local-based face verification system whose components are analogous to those of biological systems. In the proposed system, after global registration and normalization, three eye regions are converted from the spatial to polar frequency domain by a Fourier-Bessel Transform. The resulting representations are embedded in a dissimilarity space, where each image is represented by… ▽ More

    Submitted 27 September, 2005; originally announced September 2005.

    Comments: 2005, International Symposium on Visual Computing (ISVC)