Author: Jiang, Jianmin : Search

research-article

RobustFace: Adaptive Mining of Noise and Hard Samples for Robust Face Recognitions

MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 5065–5073https://doi.org/10.1145/3664647.3681231

While margin-based deep face recognition models, such as ArcFace and AdaFace, have achieved remarkable successes over recent years, they may suffer from degraded performances when encountering training sets corrupted with noises. This is often inevitable ...

research-article

Effective Optimization of Root Selection Towards Improved Explanation of Deep Classifiers

MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 5365–5373https://doi.org/10.1145/3664647.3680866

Explaining what part of the input images primarily contributed to the predicted classification results by deep models has been widely researched over the years and many effective methods have been reported in the literature, for which deep Taylor ...

Article

Managing Traceability for Software Life Cycle Processes

Theoretical Aspects of Software EngineeringPages 428–445https://doi.org/10.1007/978-3-031-64626-3_25

Abstract

Various types of software artifacts are produced in the software life cycle processes. Although traceability between different artifacts is beneficial for software development, the actual practice of modeling and maintaining traceability is not ...

research-article

Dual-Clustered Conditioning Toward GAN-Based Diverse Image Generation

IEEE Transactions on Consumer Electronics (ITOCE), Volume 70, Issue 1Pages 2817–2825https://doi.org/10.1109/TCE.2024.3367170

Generative Artificial Intelligence (AI) has revolutionized image generation in the realm of consumer electronics, which has illustrated its significant impact on product development and user experiences. In this paper, we propose a class conditioned GAN ...

research-article

Weakly supervised semantic segmentation via self-supervised destruction learning

Neurocomputing (NEUROC), Volume 561, Issue Chttps://doi.org/10.1016/j.neucom.2023.126821

Abstract

Currently, weakly supervised semantic segmentation approaches adopt the Class Activation Map (CAM) to generate the initial attention maps from the standard classification backbone network, with only image-level class labels as training ...

Graphical abstract

Display Omitted

Highlights

A novel “destruction learning” method via self-supervised manner.
The MDC module is with stronger sensitivity to the Mid-Level local parts.
The LD Module explores the local feature details from the original images.

research-article

Distortion-Aware Self-Supervised Indoor 360<inline-formula><tex-math notation="LaTeX">$^{\circ }$</tex-math></inline-formula> Depth Estimation via Hybrid Projection Fusion and Structural Regularities

IEEE Transactions on Multimedia (TOM), Volume 26Pages 3998–4011https://doi.org/10.1109/TMM.2023.3318470

Owing to the rapid development of emerging 360<inline-formula><tex-math notation="LaTeX">$^{\circ }$</tex-math></inline-formula> panoramic imaging techniques, indoor 360<inline-formula><tex-math notation="LaTeX">$^{\circ }$</tex-math></inline-formula> ...

research-article

Surface Geometry Processing: An Efficient Normal-Based Detail Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 45, Issue 11Pages 13749–13765https://doi.org/10.1109/TPAMI.2023.3296509

With the rapid development of high-resolution 3D vision applications, the traditional way of manipulating surface detail requires considerable memory and computing time. To address these problems, we introduce an efficient surface detail processing ...

research-article

Deep stereoscopic image saliency inspired stereoscopic image thumbnail generation

Multimedia Tools and Applications (MTAA), Volume 81, Issue 29Pages 42749–42767https://doi.org/10.1007/s11042-022-13487-7

Abstract

In this paper, we propose a stereoscopic image thumbnail generation method guided by the stereoscopic image saliency. Specifically, we utilize an uncertain-weighted fusion mechanism to combine the spatial saliency information with the saliency ...

research-article

Learning Across Tasks for Zero-Shot Domain Adaptation From a Single Source Domain

IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 44, Issue 10_Part_1Pages 6264–6279https://doi.org/10.1109/TPAMI.2021.3088859

Domain adaptation techniques learn transferable knowledge from a source domain to a target domain and train models that generalize well in the target domain. Unfortunately, a majority of the existing techniques are only applicable to scenarios that the ...

research-article

Preserving similarity order for unsupervised clustering

Pattern Recognition (PATT), Volume 128, Issue Chttps://doi.org/10.1016/j.patcog.2022.108670

Highlights

Our method takes the ordering of pairwise distance as the supervisory signal to learn the similarity score function.

Abstract

Unsupervised clustering categorizes a sample set into several groups, where the samples in the same group share high-level concepts. As the clustering performances are heavily determined by the metric to assess the similarity between ...

research-article

Scheduling in Real-Time Mobile Systems

ACM Transactions on Embedded Computing Systems (TECS), Volume 21, Issue 3Article No.: 34, Pages 1–36https://doi.org/10.1145/3517747

To guarantee the safety and security of a real-time mobile system such as an intelligent transportation system, it is necessary to model and analyze its behaviors prior to actual development. In particular, the mobile objects in such systems must be ...

rapid-communication

Generative synthesis of logos across DCT domain

Neurocomputing (NEUROC), Volume 467, Issue CPages 163–172https://doi.org/10.1016/j.neucom.2021.09.068

Abstract

Generative learning in pixel domain has achieved great success in exploiting their correlations in processing images towards desired objectives, yet learning in frequency domain could provide added benefits in exploiting pixel correlations ...

research-article

PR-RL: Portrait Relighting Via Deep Reinforcement Learning

IEEE Transactions on Multimedia (TOM), Volume 24Pages 3240–3255https://doi.org/10.1109/TMM.2021.3096009

In this paper, we propose a portrait relighting method based on deep reinforcement learning (called PR-RL). Our PR-RL model could conduct portrait relighting by sequentially predicting local light editing strokes, and use strokes to conduct dodge and burn ...

research-article

A Multi-Task Collaborative Network for Light Field Salient Object Detection

IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 31, Issue 5Pages 1849–1861https://doi.org/10.1109/TCSVT.2020.3013119

Being able to predict the salient object is of fundamental importance in image processing and computer vision. With numerous approaches proposed for automatic image and video salient object detection, much less work has been dedicated to detecting and ...

research-article

Emotion Attention-Aware Collaborative Deep Reinforcement Learning for Image Cropping

IEEE Transactions on Multimedia (TOM), Volume 23Pages 2545–2560https://doi.org/10.1109/TMM.2020.3013350

This paper proposes a collaborative deep reinforcement learning model for automatic image cropping (called CDRL-IC). By modeling image cropping as a decision-making process of reinforcement learning, our model could generate optimal cropping result in a ...

research-article

A Brain-Media Deep Framework Towards Seeing Imaginations Inside Brains

IEEE Transactions on Multimedia (TOM), Volume 23Pages 1454–1465https://doi.org/10.1109/TMM.2020.2999183

While current research on multimedia is essentially dealing with the information derived from our observations of the world, internal activities inside human brains, such as imaginations and memories of past events etc., could become a brand new concept ...

research-article

Brain-media: A Dual Conditioned and Lateralization Supported GAN (DCLS-GAN) towards Visualization of Image-evoked Brain Activities

MM '20: Proceedings of the 28th ACM International Conference on MultimediaPages 1764–1772https://doi.org/10.1145/3394171.3413858

Essentially, the current concept of multimedia is limited to presenting what people see in their eyes. What people think inside brains, however, remains a rich source of multimedia, such as imaginations of paradise and memories of good old days etc. In ...

Article

Adversarial Learning for Zero-Shot Domain Adaptation

Computer Vision – ECCV 2020Pages 329–344https://doi.org/10.1007/978-3-030-58589-1_20

Abstract

Zero-shot domain adaptation (ZSDA) is a category of domain adaptation problems where neither data sample nor label is available for parameter learning in the target domain. With the hypothesis that the shift between a given pair of domains is ...

research-article

Event-based functional decomposition

Information and Computation (ICOM), Volume 271, Issue Chttps://doi.org/10.1016/j.ic.2019.104484

Abstract

Functional decomposition is the process of resolving a functional relationship into its constituent parts in such a way that the original function can be recomposed from those parts by functional composition. Perfect decomposition requires the ...

research-article

SA-Net: A deep spectral analysis network for image clustering

Neurocomputing (NEUROC), Volume 383, Issue CPages 10–23https://doi.org/10.1016/j.neucom.2019.11.078

Highlights

Based on spectral analysis, we propose a novel deep learning framework, SA-Net for deep image clustering.

Abstract

Although supervised deep representation learning has attracted enormous attentions across areas of pattern recognition and computer vision, little progress has been made towards unsupervised deep representation learning for image ...

Applied Filters

People

Names

Institutions

Authors

Editors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences