Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2024
Exploring Matching Rates: From Keypoint Selection to Camera Relocalization
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 506–514https://doi.org/10.1145/3664647.3681628Camera relocalization is a challenging task to estimate camera pose within a known scene, with wide applications in the fields of Virtual Reality (VR), Augmented Reality (AR), robotics, and etc. Most existing learning-based methods invariably utilize all ...
- research-articleOctober 2024
CSO: Constraint-Guided Space Optimization for Active Scene Mapping
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 5015–5024https://doi.org/10.1145/3664647.3681066Simultaneously mapping and exploring a complex unknown scene is an NP-hard problem, which is still challenging with the rapid development of deep learning techniques. We present CSO, a deep reinforcement learning-based framework for efficient active ...
- research-articleJuly 2024
Diffusion prediction of competitive information with time-varying attractiveness in social networks
Information Processing and Management: an International Journal (IPRM), Volume 61, Issue 4https://doi.org/10.1016/j.ipm.2024.103739Highlights- Our study delved into the intricate dynamics of information propagation in social networks, unraveling the complexities of competing information from various sources.
- Unlike traditional models assuming static attractiveness, we ...
The ubiquity of social media has facilitated the simultaneous dissemination of large-scale information within online social networks. By assuming that information attractiveness is static, numerous studies have been devoted to the analysis of ...
- research-articleFebruary 2024
Adaptive Fuzzy Practical Predefined-Time Bipartite Consensus Tracking Control for Heterogeneous Nonlinear MASs With Actuator Faults
IEEE Transactions on Fuzzy Systems (TOFS), Volume 32, Issue 5Pages 3071–3083https://doi.org/10.1109/TFUZZ.2024.3367305This article focuses on the adaptive fuzzy practical predefined-time bipartite consensus tracking control (BCTC) problem for heterogeneous nonlinear multiagent systems (HNMASs) with actuator faults. First, the fuzzy logic systems are used to approximate ...
- research-articleDecember 2023
Attention-Bridged Modal Interaction for Text-to-Image Generation
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 34, Issue 7Pages 5400–5413https://doi.org/10.1109/TCSVT.2023.3347971We propose a novel Text-to-Image Generation Network, Attention-bridged Modal Interaction Generative Adversarial Network (AMI-GAN), to better explore modal interaction and perception for high-quality image synthesis. The AMI-GAN contains two novel designs: ...
-
- research-articleSeptember 2023
An intelligent Hybrid‐Q Learning clustering approach and resource management within heterogeneous cluster networks based on reinforcement learning
- Fahad Razaque Mughal,
- Jingsha He,
- Nafei Zhu,
- Mutiq Almutiq,
- Fayaz Ali Dharejo,
- Deepak Kumar Jain,
- Saqib Hussain,
- Zulfiqar Ali Zardari
Transactions on Emerging Telecommunications Technologies (TETT), Volume 35, Issue 4https://doi.org/10.1002/ett.4852AbstractRecently, heterogeneous cluster networks (HCNs) have been the subject of significant research. The nature of the next‐generation HCN environment is decentralized and highly dynamic; optimization techniques cannot quite express the dynamic ...
We propose a Hybrid‐Q Learning (Hybrid QL)‐based clustering for IoT and WSN. Self‐learning solution to solve the problem of decentralized and dynamic self‐access for heterogeneous nodes. Our proposed model dynamic accessing system on node/agents ...
- research-articleOctober 2023
Distractor-Aware Event-Based Tracking
IEEE Transactions on Image Processing (TIP), Volume 32Pages 6129–6141https://doi.org/10.1109/TIP.2023.3326683Event cameras, or dynamic vision sensors, have recently achieved success from fundamental vision tasks to high-level vision researches. Due to its ability to asynchronously capture light intensity changes, event camera has an inherent advantage to capture ...
- research-articleJanuary 2023
ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis
IEEE Transactions on Multimedia (TOM), Volume 25Pages 8620–8631https://doi.org/10.1109/TMM.2023.3238554We propose a novel Text-to-Image Generation Network, Adaptive Layout Refinement Generative Adversarial Network (ALR-GAN), to adaptively refine the layout of synthesized images without any auxiliary information. The ALR-GAN includes an Adaptive Layout ...
- research-articleJuly 2021
Discriminative matrix-variate restricted Boltzmann machine classification model
Wireless Networks (WIRE), Volume 27, Issue 5Pages 3621–3633https://doi.org/10.1007/s11276-019-02234-wAbstractMatrix-variate Restricted Boltzmann Machine (MVRBM), a variant of Restricted Boltzmann Machine, has demonstrated excellent capacity of modelling matrix variable. However, MVRBM is still an unsupervised generative model, and is usually used to ...
- ArticleNovember 2020
Reweighted Non-convex Non-smooth Rank Minimization Based Spectral Clustering on Grassmann Manifold
AbstractLow Rank Representation (LRR) based unsupervised clustering methods have achieved great success since these methods could explore low-dimensional subspace structure embedded in original data effectively. The conventional LRR methods generally ...
- research-articleFebruary 2020
Matrix-variate variational auto-encoder with applications to image process
Journal of Visual Communication and Image Representation (JVCIR), Volume 67, Issue Chttps://doi.org/10.1016/j.jvcir.2019.102750AbstractVariational Auto-Encoder (VAE) is an important probabilistic technology to model 1D vectorial data. However, when applying VAE model to 2D image, vectorization is necessary. Vectorization process may lead to dimension curse and lose valuable ...
- research-articleJanuary 2020
Surface Normal Data Guided Depth Recovery with Graph Laplacian Regularization
MMAsia '19: Proceedings of the 1st ACM International Conference on Multimedia in AsiaArticle No.: 24, Pages 1–6https://doi.org/10.1145/3338533.3366582High-quality depth information has been increasingly used in many real-world multimedia applications in recent years. Due to the limitation of depth sensor and sensing technology, actually, the captured depth map usually has low resolution and black ...
- research-articleAugust 2019
Learning to Predict Bus Arrival Time From Heterogeneous Measurements via Recurrent Neural Network
IEEE Transactions on Intelligent Transportation Systems (TITS), Volume 20, Issue 9Pages 3283–3293https://doi.org/10.1109/TITS.2018.2873747Bus arrival time prediction intends to improve the level of the services provided by transportation agencies. Intuitively, many stochastic factors affect the predictability of the arrival time, <italic>e.g.</italic>, weather and local events. Moreover, ...
- research-articleJuly 2019
Maximally Correlated Principal Component Analysis Based on Deep Parameterization Learning
ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 13, Issue 4Article No.: 39, Pages 1–17https://doi.org/10.1145/3332183Dimensionality reduction is widely used to deal with high-dimensional data. As a famous dimensionality reduction method, principal component analysis (PCA) aiming at finding the low dimension feature of original data has made great successes, and many ...
- research-articleJune 2019
Tensorizing Restricted Boltzmann Machine
ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 13, Issue 3Article No.: 30, Pages 1–16https://doi.org/10.1145/3321517Restricted Boltzmann machine (RBM) is a famous model for feature extraction and can be used as an initializer for neural networks. When applying the classic RBM to multidimensional data such as 2D/3D tensors, one needs to vectorize such as high-order ...
- articleMarch 2019
Effective human action recognition using global and local offsets of skeleton joints
Multimedia Tools and Applications (MTAA), Volume 78, Issue 5Pages 6329–6353https://doi.org/10.1007/s11042-018-6370-1Human action recognition based on 3D skeleton joints is an important yet challenging task. While many research work are devoted to 3D action recognition, they mainly suffer from two problems: complex model representation and low implementation ...
- research-articleFebruary 2019
Unsupervised Learning of Human Pose Distance Metric via Sparsity Locality Preserving Projections
IEEE Transactions on Multimedia (TOM), Volume 21, Issue 2Pages 314–327https://doi.org/10.1109/TMM.2018.2859029Human poses admit complicated articulations and multigranular similarity. Previous works on learning human pose metric utilize sparse models, which concentrate large weights on highly similar poses and fail to depict an overall structure of poses with ...
- research-articleJanuary 2019
- research-articleOctober 2018
Discovering Fine-Grained Spatial Pattern From Taxi Trips: Where Point Process Meets Matrix Decomposition and Factorization
IEEE Transactions on Intelligent Transportation Systems (ITS-TRANSACTIONS), Volume 19, Issue 10Pages 3208–3219https://doi.org/10.1109/TITS.2017.2771262As increasing volumes of urban data are being available, new opportunities arise for data-driven analysis that can lead to improvements in the lives of citizens through evidence-based policies. In particular, taxi trip is an important urban sensor that ...
- research-articleOctober 2018
Localized LRR on Grassmann Manifold: An Extrinsic View
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 28, Issue 10Pages 2524–2536https://doi.org/10.1109/TCSVT.2017.2757063Subspace data representation has recently become a common practice in many computer vision tasks. Low-rank representation (LRR) is one of the most successful models for clustering vectorial data according to their subspace structures. This paper explores ...