Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleApril 2019
A fully trainable network with RNN-based pooling
Highlights- A trainable RNN based pooling method is proposed.
- CNNs are turned into fully ...
Pooling is an important component in convolutional neural networks (CNNs) for aggregating features and reducing computational burden. Compared with other components such as convolutional layers and fully connected layers which are ...
- surveyFebruary 2019
Recent Advances in Transfer Learning for Cross-Dataset Visual Recognition: A Problem-Oriented Perspective
ACM Computing Surveys (CSUR), Volume 52, Issue 1Article No.: 7, Pages 1–38https://doi.org/10.1145/3291124This article takes a problem-oriented perspective and presents a comprehensive review of transfer-learning methods, both shallow and deep, for cross-dataset visual recognition. Specifically, it categorises the cross-dataset recognition into 17 problems ...
- research-articleOctober 2018
Action recognition based on joint trajectory maps with convolutional neural networks
Knowledge-Based Systems (KNBS), Volume 158, Issue CPages 43–53https://doi.org/10.1016/j.knosys.2018.05.029AbstractConvolutional Neural Networks (ConvNets) have recently shown promising performance in many computer vision tasks, especially image-based recognition. How to effectively apply ConvNets to sequence-based data is still an open problem. ...
- research-articleOctober 2018
Smartphone-sensors Based Activity Recognition Using IndRNN
UbiComp '18: Proceedings of the 2018 ACM International Joint Conference and 2018 International Symposium on Pervasive and Ubiquitous Computing and Wearable ComputersPages 1541–1547https://doi.org/10.1145/3267305.3267521Human activity recognition based on the smartphone sensors has the potential to impact a wide range of applications such as healthcare, smart home, and remote monitoring. For simple activities like "Sit" and "Walk", it can be distinguished relatively ...
- research-articleApril 2018
Robust unsupervised feature selection via dual self-representation and manifold regularization
Knowledge-Based Systems (KNBS), Volume 145, Issue CPages 109–120https://doi.org/10.1016/j.knosys.2018.01.009AbstractUnsupervised feature selection has become an important and challenging pre-processing step in machine learning and data mining since large amount of unlabelled high dimensional data are often required to be processed. In this paper, we ...
- research-articleFebruary 2018
Cooperative training of deep aggregation networks for RGB-D action recognition
AAAI'18/IAAI'18/EAAI'18: Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial IntelligenceArticle No.: 907, Pages 7404–7411A novel deep neural network training paradigm that exploits the conjoint information in multiple heterogeneous sources is proposed. Specifically, in a RGB-D based action recognition task, it cooperatively trains a single convolutional neural network (...
- research-articleDecember 2017
Semantic action recognition by learning a pose lexicon
Pattern Recognition (PATT), Volume 72, Issue CPages 548–562https://doi.org/10.1016/j.patcog.2017.06.035A novel semantic representation, pose lexicon, is proposed for action recognition.An extended hidden Markov alignment model is developed to learn a pose lexicon.A semantic action recognition method that is capable of zero-shot recognition is developed ...
- research-articleSeptember 2017
Foreground detection in camouflaged scenes
2017 IEEE International Conference on Image Processing (ICIP)Pages 4247–4251https://doi.org/10.1109/ICIP.2017.8297083Foreground detection has been widely studied for decades due to its importance in many practical applications. Most of the existing methods assume foreground and background show visually distinct characteristics and thus the foreground can be detected ...
- research-articleJune 2017
Optimization of Camera Arrangement Using Correspondence Field to Improve Depth Estimation
IEEE Transactions on Image Processing (TIP), Volume 26, Issue 6Pages 3038–3050https://doi.org/10.1109/TIP.2017.2695102Stereo matching algorithms attempt to estimate depth from the images obtained by two cameras. In most cases, the arrangement of cameras (their locations and orientations with respect to the scene) is determined based on human experience. In this paper, ...
- research-articleApril 2017
An effective edge-preserving smoothing method for image manipulation
Digital Signal Processing (DISP), Volume 63, Issue CPages 10–24https://doi.org/10.1016/j.dsp.2016.10.009This paper presents a novel and effective edge-preserving image smoothing method for edge-aware image manipulation. The method formulates the smoothing as a problem of minimizing a convex object function with a constraint and an efficient solution to ...
- research-articleDecember 2016
RGB-D-based action recognition datasets
Pattern Recognition (PATT), Volume 60, Issue CPages 86–105https://doi.org/10.1016/j.patcog.2016.05.019Human action recognition from RGB-D (Red, Green, Blue and Depth) data has attracted increasing attention since the first work reported in 2010. Over this period, many benchmark datasets have been created to facilitate the development and evaluation of ...
- research-articleNovember 2016
Enhancing Project-Based Learning Through Student and Industry Engagement in a Video-Augmented 3-D Virtual Trade Fair
IEEE Transactions on Education (ITE), Volume 59, Issue 4Pages 290–298https://doi.org/10.1109/TE.2016.2546230Project-based learning is a widely used pedagogical strategy in engineering education shown to be effective in fostering problem-solving, design, and teamwork skills. There are distinct benefits to be gained from giving students autonomy in determining ...
- short-paperOctober 2016
Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks
MM '16: Proceedings of the 24th ACM international conference on MultimediaPages 102–106https://doi.org/10.1145/2964284.2967191Recently, Convolutional Neural Networks (ConvNets) have shown promising performances in many computer vision tasks, especially image-based recognition. How to effectively use ConvNets for video-based recognition is still an open problem. In this paper, ...
- articleMay 2016
Efficient 2D viewpoint combination for human action recognition
Pattern Analysis & Applications (PAAS), Volume 19, Issue 2Pages 563–577https://doi.org/10.1007/s10044-016-0537-zThe ability to recognize human actions using a single viewpoint is affected by phenomena such as self-occlusions or occlusions by other objects. Incorporating multiple cameras can help overcome these issues. However, the question remains how to ...
- research-articleMarch 2016
Learning structured dictionary based on inter-class similarity and representative margins
2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)Pages 2399–2403https://doi.org/10.1109/ICASSP.2016.7472107We consider the problem of learning a structured and discriminative dictionary based on sparse representation for classification task. The structure comprises class-shared and class-specific partitions which allows the separation of common and class-...
- research-articleMarch 2016
Human detection from images and videos
Pattern Recognition (PATT), Volume 51, Issue CPages 148–175https://doi.org/10.1016/j.patcog.2015.08.027The problem of human detection is to automatically locate people in an image or video sequence and has been actively researched in the past decade. This paper aims to provide a comprehensive survey on the recent development and challenges of human ...
- ArticleDecember 2015
Beyond Covariance: Feature Representation with Nonlinear Kernel Matrices
ICCV '15: Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV)Pages 4570–4578https://doi.org/10.1109/ICCV.2015.519Covariance matrix has recently received increasing attention in computer vision by leveraging Riemannian geometry of symmetric positive-definite (SPD) matrices. Originally proposed as a region descriptor, it has now been used as a generic representation ...
- short-paperOctober 2015
ConvNets-Based Action Recognition from Depth Maps through Virtual Cameras and Pseudocoloring
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1119–1122https://doi.org/10.1145/2733373.2806296In this paper, we propose to adopt ConvNets to recognize human actions from depth maps on relatively small datasets based on Depth Motion Maps (DMMs). In particular, three strategies are developed to effectively leverage the capability of ConvNets in ...
- research-articleOctober 2015
Estimation of Signal Distortion Using Effective Sampling Density for Light Field-Based Free Viewpoint Video
IEEE Transactions on Multimedia (TOM), Volume 17, Issue 10Pages 1677–1693https://doi.org/10.1109/TMM.2015.2447274In a light field-based free viewpoint video (LF-based FVV) system, effective sampling density (ESD) is defined as the number of rays per unit area of the scene that has been acquired and is selected in the rendering process for reconstructing an unknown ...