: Search

research-article

Free

Exploring Matching Rates: From Keypoint Selection to Camera Relocalization

MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 506–514https://doi.org/10.1145/3664647.3681628

Camera relocalization is a challenging task to estimate camera pose within a known scene, with wide applications in the fields of Virtual Reality (VR), Augmented Reality (AR), robotics, and etc. Most existing learning-based methods invariably utilize all ...

research-article

Free

CSO: Constraint-Guided Space Optimization for Active Scene Mapping

MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 5015–5024https://doi.org/10.1145/3664647.3681066

Simultaneously mapping and exploring a complex unknown scene is an NP-hard problem, which is still challenging with the rapid development of deep learning techniques. We present CSO, a deep reinforcement learning-based framework for efficient active ...

research-article

Diffusion prediction of competitive information with time-varying attractiveness in social networks

Information Processing and Management: an International Journal (IPRM), Volume 61, Issue 4https://doi.org/10.1016/j.ipm.2024.103739

Highlights

Our study delved into the intricate dynamics of information propagation in social networks, unraveling the complexities of competing information from various sources.
Unlike traditional models assuming static attractiveness, we ...

Abstract

The ubiquity of social media has facilitated the simultaneous dissemination of large-scale information within online social networks. By assuming that information attractiveness is static, numerous studies have been devoted to the analysis of ...

research-article

Adaptive Fuzzy Practical Predefined-Time Bipartite Consensus Tracking Control for Heterogeneous Nonlinear MASs With Actuator Faults

IEEE Transactions on Fuzzy Systems (TOFS), Volume 32, Issue 5Pages 3071–3083https://doi.org/10.1109/TFUZZ.2024.3367305

This article focuses on the adaptive fuzzy practical predefined-time bipartite consensus tracking control (BCTC) problem for heterogeneous nonlinear multiagent systems (HNMASs) with actuator faults. First, the fuzzy logic systems are used to approximate ...

research-article

Attention-Bridged Modal Interaction for Text-to-Image Generation

IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 34, Issue 7Pages 5400–5413https://doi.org/10.1109/TCSVT.2023.3347971

We propose a novel Text-to-Image Generation Network, Attention-bridged Modal Interaction Generative Adversarial Network (AMI-GAN), to better explore modal interaction and perception for high-quality image synthesis. The AMI-GAN contains two novel designs: ...

research-article

An intelligent Hybrid‐Q Learning clustering approach and resource management within heterogeneous cluster networks based on reinforcement learning

Transactions on Emerging Telecommunications Technologies (TETT), Volume 35, Issue 4https://doi.org/10.1002/ett.4852

Abstract

Recently, heterogeneous cluster networks (HCNs) have been the subject of significant research. The nature of the next‐generation HCN environment is decentralized and highly dynamic; optimization techniques cannot quite express the dynamic ...

We propose a Hybrid‐Q Learning (Hybrid QL)‐based clustering for IoT and WSN. Self‐learning solution to solve the problem of decentralized and dynamic self‐access for heterogeneous nodes. Our proposed model dynamic accessing system on node/agents ...

research-article

Distractor-Aware Event-Based Tracking

IEEE Transactions on Image Processing (TIP), Volume 32Pages 6129–6141https://doi.org/10.1109/TIP.2023.3326683

Event cameras, or dynamic vision sensors, have recently achieved success from fundamental vision tasks to high-level vision researches. Due to its ability to asynchronously capture light intensity changes, event camera has an inherent advantage to capture ...

research-article

ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis

IEEE Transactions on Multimedia (TOM), Volume 25Pages 8620–8631https://doi.org/10.1109/TMM.2023.3238554

We propose a novel Text-to-Image Generation Network, Adaptive Layout Refinement Generative Adversarial Network (ALR-GAN), to adaptively refine the layout of synthesized images without any auxiliary information. The ALR-GAN includes an Adaptive Layout ...

research-article

Discriminative matrix-variate restricted Boltzmann machine classification model

Wireless Networks (WIRE), Volume 27, Issue 5Pages 3621–3633https://doi.org/10.1007/s11276-019-02234-w

Abstract

Matrix-variate Restricted Boltzmann Machine (MVRBM), a variant of Restricted Boltzmann Machine, has demonstrated excellent capacity of modelling matrix variable. However, MVRBM is still an unsupervised generative model, and is usually used to ...

Article

Reweighted Non-convex Non-smooth Rank Minimization Based Spectral Clustering on Grassmann Manifold

Computer Vision – ACCV 2020Pages 562–577https://doi.org/10.1007/978-3-030-69541-5_34

Abstract

Low Rank Representation (LRR) based unsupervised clustering methods have achieved great success since these methods could explore low-dimensional subspace structure embedded in original data effectively. The conventional LRR methods generally ...

research-article

Matrix-variate variational auto-encoder with applications to image process

Journal of Visual Communication and Image Representation (JVCIR), Volume 67, Issue Chttps://doi.org/10.1016/j.jvcir.2019.102750

Abstract

Variational Auto-Encoder (VAE) is an important probabilistic technology to model 1D vectorial data. However, when applying VAE model to 2D image, vectorization is necessary. Vectorization process may lead to dimension curse and lose valuable ...

research-article

Surface Normal Data Guided Depth Recovery with Graph Laplacian Regularization

MMAsia '19: Proceedings of the 1st ACM International Conference on Multimedia in AsiaArticle No.: 24, Pages 1–6https://doi.org/10.1145/3338533.3366582

High-quality depth information has been increasingly used in many real-world multimedia applications in recent years. Due to the limitation of depth sensor and sensing technology, actually, the captured depth map usually has low resolution and black ...

research-article

Learning to Predict Bus Arrival Time From Heterogeneous Measurements via Recurrent Neural Network

IEEE Transactions on Intelligent Transportation Systems (TITS), Volume 20, Issue 9Pages 3283–3293https://doi.org/10.1109/TITS.2018.2873747

Bus arrival time prediction intends to improve the level of the services provided by transportation agencies. Intuitively, many stochastic factors affect the predictability of the arrival time, <italic>e.g.</italic>, weather and local events. Moreover, ...

research-article

Maximally Correlated Principal Component Analysis Based on Deep Parameterization Learning

ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 13, Issue 4Article No.: 39, Pages 1–17https://doi.org/10.1145/3332183

Dimensionality reduction is widely used to deal with high-dimensional data. As a famous dimensionality reduction method, principal component analysis (PCA) aiming at finding the low dimension feature of original data has made great successes, and many ...

research-article

Tensorizing Restricted Boltzmann Machine

ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 13, Issue 3Article No.: 30, Pages 1–16https://doi.org/10.1145/3321517

Restricted Boltzmann machine (RBM) is a famous model for feature extraction and can be used as an initializer for neural networks. When applying the classic RBM to multidimensional data such as 2D/3D tensors, one needs to vectorize such as high-order ...

article

Effective human action recognition using global and local offsets of skeleton joints

Multimedia Tools and Applications (MTAA), Volume 78, Issue 5Pages 6329–6353https://doi.org/10.1007/s11042-018-6370-1

Human action recognition based on 3D skeleton joints is an important yet challenging task. While many research work are devoted to 3D action recognition, they mainly suffer from two problems: complex model representation and low implementation ...

research-article

Unsupervised Learning of Human Pose Distance Metric via Sparsity Locality Preserving Projections

IEEE Transactions on Multimedia (TOM), Volume 21, Issue 2Pages 314–327https://doi.org/10.1109/TMM.2018.2859029

Human poses admit complicated articulations and multigranular similarity. Previous works on learning human pose metric utilize sparse models, which concentrate large weights on highly similar poses and fail to depict an overall structure of poses with ...

research-article

Tensor Completion From One-Bit Observations

IEEE Transactions on Image Processing (TIP), Volume 28, Issue 1Pages 170–180https://doi.org/10.1109/TIP.2018.2865837

The tensor completion issues have obtained a great deal of attention in the past few years. However, the data fidelity part minimizes a squared loss function, which may be inappropriate for the case of noisy one-bit observations. In this paper, we ...

research-article

Discovering Fine-Grained Spatial Pattern From Taxi Trips: Where Point Process Meets Matrix Decomposition and Factorization

IEEE Transactions on Intelligent Transportation Systems (ITS-TRANSACTIONS), Volume 19, Issue 10Pages 3208–3219https://doi.org/10.1109/TITS.2017.2771262

As increasing volumes of urban data are being available, new opportunities arise for data-driven analysis that can lead to improvements in the lives of citizens through evidence-based policies. In particular, taxi trip is an important urban sensor that ...

research-article

Localized LRR on Grassmann Manifold: An Extrinsic View

IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 28, Issue 10Pages 2524–2536https://doi.org/10.1109/TCSVT.2017.2757063

Subspace data representation has recently become a common practice in many computer vision tasks. Low-rank representation (LRR) is one of the most successful models for clustering vectorial data according to their subspace structures. This paper explores ...

Applied Filters

People

Names

Institutions

Authors

Editors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Exploring Matching Rates: From Keypoint Selection to Camera Relocalization

CSO: Constraint-Guided Space Optimization for Active Scene Mapping

Diffusion prediction of competitive information with time-varying attractiveness in social networks

Adaptive Fuzzy Practical Predefined-Time Bipartite Consensus Tracking Control for Heterogeneous Nonlinear MASs With Actuator Faults

Attention-Bridged Modal Interaction for Text-to-Image Generation

An intelligent Hybrid‐Q Learning clustering approach and resource management within heterogeneous cluster networks based on reinforcement learning

Distractor-Aware Event-Based Tracking

ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis

Discriminative matrix-variate restricted Boltzmann machine classification model

Reweighted Non-convex Non-smooth Rank Minimization Based Spectral Clustering on Grassmann Manifold

Matrix-variate variational auto-encoder with applications to image process

Surface Normal Data Guided Depth Recovery with Graph Laplacian Regularization

Learning to Predict Bus Arrival Time From Heterogeneous Measurements via Recurrent Neural Network

Maximally Correlated Principal Component Analysis Based on Deep Parameterization Learning

Tensorizing Restricted Boltzmann Machine

Effective human action recognition using global and local offsets of skeleton joints

Unsupervised Learning of Human Pose Distance Metric via Sparsity Locality Preserving Projections

Tensor Completion From One-Bit Observations

Discovering Fine-Grained Spatial Pattern From Taxi Trips: Where Point Process Meets Matrix Decomposition and Factorization

Localized LRR on Grassmann Manifold: An Extrinsic View

Applied Filters

People

Names

Institutions

Authors

Editors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder