Parsing All Adverse Scenes: Severity-Aware Semantic Segmentation with Mask-Enhanced Cross-Domain Consistency
DOI:
https://doi.org/10.1609/aaai.v38i12.29251Keywords:
ML: Transfer, Domain Adaptation, Multi-Task Learning, CV: Low Level & Physics-based Vision, CV: Scene Analysis & Understanding, CV: Vision for Robotics & Autonomous Driving, ML: Deep Learning Algorithms, ML: Semi-Supervised Learning, ML: Transparent, Interpretable, Explainable ML, ML: Unsupervised & Self-Supervised LearningAbstract
Although recent methods in Unsupervised Domain Adaptation (UDA) have achieved success in segmenting rainy or snowy scenes by improving consistency, they face limitations when dealing with more challenging scenarios like foggy and night scenes. We argue that these prior methods excessively focus on weather-specific features in adverse scenes, which exacerbates the existing domain gaps. To address this issue, we propose a new metric to evaluate the severity of all adverse scenes and offer a novel perspective that enables task unification across all adverse scenarios. Our method focuses on Severity, allowing our model to learn more consistent features and facilitate domain distribution alignment, thereby alleviating domain gaps. Unlike the vague descriptions of consistency in previous methods, we introduce Cross-domain Consistency, which is quantified using the Structure Similarity Index Measure (SSIM) to measure the distance between the source and target domains. Specifically, our unified model consists of two key modules: the Merging Style Augmentation Module (MSA) and the Severity Perception Mask Module (SPM). The MSA module transforms all adverse scenes into augmented scenes, effectively eliminating weather-specific features and enhancing Cross-domain Consistency. The SPM module incorporates a Severity Perception mechanism, guiding a Mask operation that enables our model to learn highly consistent features from the augmented scenes. Our unified framework, named PASS (Parsing All adverSe Scenes), achieves significant performance improvements over state-of-the-art methods on widely-used benchmarks for all adverse scenes. Notably, the performance of PASS is superior to Semi-Unified models and even surpasses weather-specific models.Downloads
Published
2024-03-24
How to Cite
Li, F., Gong, Z., Deng, Y., Ma, X., Zhang, R., Ji, Z., Zhu, X., & Zhang, H. (2024). Parsing All Adverse Scenes: Severity-Aware Semantic Segmentation with Mask-Enhanced Cross-Domain Consistency. Proceedings of the AAAI Conference on Artificial Intelligence, 38(12), 13483-13491. https://doi.org/10.1609/aaai.v38i12.29251
Issue
Section
AAAI Technical Track on Machine Learning III