Author: Li, Wanqing : Search

research-article

A fully trainable network with RNN-based pooling

Neurocomputing (NEUROC), Volume 338, Issue CPages 72–82https://doi.org/10.1016/j.neucom.2019.02.004

Highlights

A trainable RNN based pooling method is proposed.
CNNs are turned into fully ...

Abstract

Pooling is an important component in convolutional neural networks (CNNs) for aggregating features and reducing computational burden. Compared with other components such as convolutional layers and fully connected layers which are ...

survey

Recent Advances in Transfer Learning for Cross-Dataset Visual Recognition: A Problem-Oriented Perspective

ACM Computing Surveys (CSUR), Volume 52, Issue 1Article No.: 7, Pages 1–38https://doi.org/10.1145/3291124

This article takes a problem-oriented perspective and presents a comprehensive review of transfer-learning methods, both shallow and deep, for cross-dataset visual recognition. Specifically, it categorises the cross-dataset recognition into 17 problems ...

research-article

Action recognition based on joint trajectory maps with convolutional neural networks

Knowledge-Based Systems (KNBS), Volume 158, Issue CPages 43–53https://doi.org/10.1016/j.knosys.2018.05.029

Abstract

Convolutional Neural Networks (ConvNets) have recently shown promising performance in many computer vision tasks, especially image-based recognition. How to effectively apply ConvNets to sequence-based data is still an open problem. ...

research-article

Smartphone-sensors Based Activity Recognition Using IndRNN

UbiComp '18: Proceedings of the 2018 ACM International Joint Conference and 2018 International Symposium on Pervasive and Ubiquitous Computing and Wearable ComputersPages 1541–1547https://doi.org/10.1145/3267305.3267521

Human activity recognition based on the smartphone sensors has the potential to impact a wide range of applications such as healthcare, smart home, and remote monitoring. For simple activities like "Sit" and "Walk", it can be distinguished relatively ...

research-article

Robust unsupervised feature selection via dual self-representation and manifold regularization

Knowledge-Based Systems (KNBS), Volume 145, Issue CPages 109–120https://doi.org/10.1016/j.knosys.2018.01.009

Abstract

Unsupervised feature selection has become an important and challenging pre-processing step in machine learning and data mining since large amount of unlabelled high dimensional data are often required to be processed. In this paper, we ...

research-article

Free

Cooperative training of deep aggregation networks for RGB-D action recognition

AAAI'18/IAAI'18/EAAI'18: Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial IntelligenceArticle No.: 907, Pages 7404–7411

A novel deep neural network training paradigm that exploits the conjoint information in multiple heterogeneous sources is proposed. Specifically, in a RGB-D based action recognition task, it cooperatively trains a single convolutional neural network (...

research-article

Semantic action recognition by learning a pose lexicon

Pattern Recognition (PATT), Volume 72, Issue CPages 548–562https://doi.org/10.1016/j.patcog.2017.06.035

A novel semantic representation, pose lexicon, is proposed for action recognition.An extended hidden Markov alignment model is developed to learn a pose lexicon.A semantic action recognition method that is capable of zero-shot recognition is developed ...

research-article

Foreground detection in camouflaged scenes

2017 IEEE International Conference on Image Processing (ICIP)Pages 4247–4251https://doi.org/10.1109/ICIP.2017.8297083

Foreground detection has been widely studied for decades due to its importance in many practical applications. Most of the existing methods assume foreground and background show visually distinct characteristics and thus the foreground can be detected ...

research-article

Optimization of Camera Arrangement Using Correspondence Field to Improve Depth Estimation

IEEE Transactions on Image Processing (TIP), Volume 26, Issue 6Pages 3038–3050https://doi.org/10.1109/TIP.2017.2695102

Stereo matching algorithms attempt to estimate depth from the images obtained by two cameras. In most cases, the arrangement of cameras (their locations and orientations with respect to the scene) is determined based on human experience. In this paper, ...

research-article

An effective edge-preserving smoothing method for image manipulation

Digital Signal Processing (DISP), Volume 63, Issue CPages 10–24https://doi.org/10.1016/j.dsp.2016.10.009

This paper presents a novel and effective edge-preserving image smoothing method for edge-aware image manipulation. The method formulates the smoothing as a problem of minimizing a convex object function with a constraint and an efficient solution to ...

research-article

RGB-D-based action recognition datasets

Pattern Recognition (PATT), Volume 60, Issue CPages 86–105https://doi.org/10.1016/j.patcog.2016.05.019

Human action recognition from RGB-D (Red, Green, Blue and Depth) data has attracted increasing attention since the first work reported in 2010. Over this period, many benchmark datasets have been created to facilitate the development and evaluation of ...

research-article

Enhancing Project-Based Learning Through Student and Industry Engagement in a Video-Augmented 3-D Virtual Trade Fair

IEEE Transactions on Education (ITE), Volume 59, Issue 4Pages 290–298https://doi.org/10.1109/TE.2016.2546230

Project-based learning is a widely used pedagogical strategy in engineering education shown to be effective in fostering problem-solving, design, and teamwork skills. There are distinct benefits to be gained from giving students autonomy in determining ...

short-paper

Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks

MM '16: Proceedings of the 24th ACM international conference on MultimediaPages 102–106https://doi.org/10.1145/2964284.2967191

Recently, Convolutional Neural Networks (ConvNets) have shown promising performances in many computer vision tasks, especially image-based recognition. How to effectively use ConvNets for video-based recognition is still an open problem. In this paper, ...

article

Guest Editorial: Human Activity Understanding from 2D and 3D Data

International Journal of Computer Vision (IJCV), Volume 118, Issue 2Pages 113–114https://doi.org/10.1007/s11263-016-0915-4

article

Efficient 2D viewpoint combination for human action recognition

Pattern Analysis & Applications (PAAS), Volume 19, Issue 2Pages 563–577https://doi.org/10.1007/s10044-016-0537-z

The ability to recognize human actions using a single viewpoint is affected by phenomena such as self-occlusions or occlusions by other objects. Incorporating multiple cameras can help overcome these issues. However, the question remains how to ...

research-article

Learning structured dictionary based on inter-class similarity and representative margins

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)Pages 2399–2403https://doi.org/10.1109/ICASSP.2016.7472107

We consider the problem of learning a structured and discriminative dictionary based on sparse representation for classification task. The structure comprises class-shared and class-specific partitions which allows the separation of common and class-...

research-article

Human detection from images and videos

Pattern Recognition (PATT), Volume 51, Issue CPages 148–175https://doi.org/10.1016/j.patcog.2015.08.027

The problem of human detection is to automatically locate people in an image or video sequence and has been actively researched in the past decade. This paper aims to provide a comprehensive survey on the recent development and challenges of human ...

Article

Beyond Covariance: Feature Representation with Nonlinear Kernel Matrices

ICCV '15: Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV)Pages 4570–4578https://doi.org/10.1109/ICCV.2015.519

Covariance matrix has recently received increasing attention in computer vision by leveraging Riemannian geometry of symmetric positive-definite (SPD) matrices. Originally proposed as a region descriptor, it has now been used as a generic representation ...

short-paper

ConvNets-Based Action Recognition from Depth Maps through Virtual Cameras and Pseudocoloring

MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1119–1122https://doi.org/10.1145/2733373.2806296

In this paper, we propose to adopt ConvNets to recognize human actions from depth maps on relatively small datasets based on Depth Motion Maps (DMMs). In particular, three strategies are developed to effectively leverage the capability of ConvNets in ...

research-article

Estimation of Signal Distortion Using Effective Sampling Density for Light Field-Based Free Viewpoint Video

IEEE Transactions on Multimedia (TOM), Volume 17, Issue 10Pages 1677–1693https://doi.org/10.1109/TMM.2015.2447274

In a light field-based free viewpoint video (LF-based FVV) system, effective sampling density (ESD) is defined as the number of rays per unit area of the scene that has been acquired and is selected in the rendering process for reconstructing an unknown ...

Search Results

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Results

A fully trainable network with RNN-based pooling

Recent Advances in Transfer Learning for Cross-Dataset Visual Recognition: A Problem-Oriented Perspective

Action recognition based on joint trajectory maps with convolutional neural networks

Smartphone-sensors Based Activity Recognition Using IndRNN

Robust unsupervised feature selection via dual self-representation and manifold regularization

Cooperative training of deep aggregation networks for RGB-D action recognition

Semantic action recognition by learning a pose lexicon

Foreground detection in camouflaged scenes

Optimization of Camera Arrangement Using Correspondence Field to Improve Depth Estimation

An effective edge-preserving smoothing method for image manipulation

RGB-D-based action recognition datasets

Enhancing Project-Based Learning Through Student and Industry Engagement in a Video-Augmented 3-D Virtual Trade Fair

Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks

Guest Editorial: Human Activity Understanding from 2D and 3D Data

Efficient 2D viewpoint combination for human action recognition

Learning structured dictionary based on inter-class similarity and representative margins

Human detection from images and videos

Beyond Covariance: Feature Representation with Nonlinear Kernel Matrices

ConvNets-Based Action Recognition from Depth Maps through Virtual Cameras and Pseudocoloring

Estimation of Signal Distortion Using Effective Sampling Density for Light Field-Based Free Viewpoint Video

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder