research-article

Fast and Accurate Lane Detection via Frequency Domain Learning

Authors:

Yulan GuoAuthors Info & Claims

MM '21: Proceedings of the 29th ACM International Conference on Multimedia

Pages 890 - 898

https://doi.org/10.1145/3474085.3475267

Published: 17 October 2021 Publication History

Abstract

It is desirable to maintain both high accuracy and runtime efficiency in lane detection. State-of-the-art methods mainly address the efficiency problem by direct compression of high-dimensional features. These methods usually suffer from information loss and cannot achieve satisfactory accuracy performance. To ensure the diversity of features and subsequently maintain information as much as possible, we introduce multi-frequency analysis into lane detection. Specifically, we propose a multi-spectral feature compressor (MSFC) based on two-dimensional (2D) discrete cosine transform (DCT) to compress features while preserving diversity information. We group features and associate each group with an individual frequency component, which incurs only 1/7 overhead of one-dimensional convolution operation but preserves more information. Moreover, to further enhance the discriminability of features, we design a multi-spectral lane feature aggregator (MSFA) based on one-dimensional (1D) DCT to aggregate features from each lane according to their corresponding frequency components. The proposed method outperforms the state-of-the-art methods (including LaneATT and UFLD) on TuSimple, CULane, and LLAMAS benchmarks. For example, our method achieves 76.32% F1 at 237 FPS and 76.98% F1 at 164 FPS on CULane, which is 1.23% and 0.30% higher than LaneATT. Our code and models are available at https://github.com/harrylin-hyl/MSLD.

References

[1]

Karsten Behrendt and Ryan Soussan. 2019. Unsupervised labeled lane marker dataset generation using maps. In Proceedings of the IEEE International Conference on Computer Vision, Vol. 3. 3--3.

[2]

Zhenpeng Chen, Qianfei Liu, and Chenfan Lian. 2019. PointLaneNet: Efficient end-to-end CNNs for Accurate Real-Time Lane Detection. In 2019 IEEE Intelligent Vehicles Symposium (IV). IEEE, 2563--2568.

[3]

Prema M Daigavane and Preeti R Bajaj. 2010. Road lane detection with improved canny edges using ant colony optimization. In 2010 3rd International Conference on Emerging Trends in Engineering and Technology. IEEE, 76--80.

Digital Library

[4]

Max Ehrlich and Larry S Davis. 2019. Deep residual learning in the jpeg transform domain. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 3484--3493.

[5]

Jonathan Frankle and Michael Carbin. 2018. The lottery ticket hypothesis: Finding sparse, trainable neural networks. arXiv preprint arXiv:1803.03635 (2018).

[6]

Lionel Gueguen, Alex Sergeev, Ben Kadlec, Rosanne Liu, and Jason Yosinski. 2018. Faster neural networks straight from jpeg. Advances in Neural Information Processing Systems, Vol. 31 (2018), 3933--3944.

Digital Library

[7]

S Han, H Mao, and WJ Dally. 2015. Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint (2015).

[8]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.

[9]

Yuenan Hou, Zheng Ma, Chunxiao Liu, Tak-Wai Hui, and Chen Change Loy. 2020. Inter-region affinity distillation for road marking segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 12486--12495.

[10]

Yuenan Hou, Zheng Ma, Chunxiao Liu, and Chen Change Loy. 2019. Learning lightweight lane detection cnns by self attention distillation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 1013--1021.

[11]

Jie Hu, Li Shen, and Gang Sun. 2018. Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7132--7141.

[12]

Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).

[13]

Xiang Li, Jun Li, Xiaolin Hu, and Jian Yang. 2019. Line-CNN: End-to-End Traffic line detection with line proposal unit. IEEE Transactions on Intelligent Transportation Systems, Vol. 21, 1 (2019), 248--258.

[14]

Zhenguo Li. 2020. CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending. (2020).

[15]

Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision. 2980--2988.

[16]

Ruijin Liu, Zejian Yuan, Tie Liu, and Zhiliang Xiong. 2021. End-to-end lane shape prediction with transformers. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 3694--3702.

[17]

Tong Liu, Zhaowei Chen, Yi Yang, Zehao Wu, and Haowei Li. 2020. Lane detection in low-light conditions using an efficient data enhancement: Light conditions style transfer. In 2020 IEEE Intelligent Vehicles Symposium (IV). IEEE, 1394--1399.

[18]

Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In European conference on computer vision. Springer, 21--37.

[19]

Hiren M Mandalia and Mandalia Dario D Salvucci. 2005. Using support vector machines for lane-change detection. In Proceedings of the human factors and ergonomics society annual meeting, Vol. 49. SAGE Publications Sage CA: Los Angeles, CA, 1965--1969.

[20]

Pavlo Molchanov, Arun Mallya, Stephen Tyree, Iuri Frosio, and Jan Kautz. 2019. Importance estimation for neural network pruning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 11264--11272.

[21]

Xingang Pan, Jianping Shi, Ping Luo, Xiaogang Wang, and Xiaoou Tang. 2018. Spatial as deep: Spatial cnn for traffic scene understanding. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.

[22]

Jonah Philion. 2019. FastDraw: Addressing the long tail of lane detection by adapting a sequential prediction network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 11582--11591.

[23]

Zequn Qin, Huanyu Wang, and Xi Li. 2020 a. Ultra fast structure-aware deep lane detection. arXiv preprint arXiv:2004.11757 (2020).

[24]

Zequn Qin, Pengyi Zhang, Fei Wu, and Xi Li. 2020 b. FcaNet: Frequency Channel Attention Networks. arXiv preprint arXiv:2012.11879 (2020).

[25]

Joseph Redmon and Ali Farhadi. 2018. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767 (2018).

[26]

Lucas Tabelini, Rodrigo Berriel, Thiago M Paixao, Claudine Badue, Alberto F De Souza, and Thiago Oliveira-Santos. 2020 a. PolyLaneNet: Lane estimation via deep polynomial regression. arXiv preprint arXiv:2004.10924 (2020).

[27]

Lucas Tabelini, Rodrigo Berriel, Thiago M Paix ao, Claudine Badue, Alberto F De Souza, and Thiago Olivera-Santos. 2020 b. Keep your Eyes on the Lane: Attention-guided Lane Detection. arXiv preprint arXiv:2010.12035 (2020).

[28]

TuSimple. 2020. Tusimple benchmark. https://github.com/TuSimple/tusimple-benchmark. Accessed (2020).

[29]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. arXiv preprint arXiv:1706.03762 (2017).

[30]

Kuan Wang, Zhijian Liu, Yujun Lin, Ji Lin, and Song Han. 2019. Haq: Hardware-aware automated quantization with mixed precision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8612--8620.

[31]

Sanghyun Woo, Jongchan Park, Joon-Young Lee, and In So Kweon. 2018. Cbam: Convolutional block attention module. In Proceedings of the European conference on computer vision (ECCV). 3--19.

Digital Library

[32]

Seungwoo Yoo, Hee Seok Lee, Heesoo Myeong, Sungrack Yun, Hyoungwoo Park, Janghoon Cho, and Duck Hoon Kim. 2020. End-to-end lane marker detection via row-wise classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 1006--1007.

[33]

Tu Zheng, Hao Fang, Yi Zhang, Wenjian Tang, Zheng Yang, Haifeng Liu, and Deng Cai. 2020. RESA: Recurrent Feature-Shift Aggregator for Lane Detection. arXiv preprint arXiv:2008.13719 (2020).

Cited By

Liang KMeng LLiu YLiu MWei WLiu STu WWang SZhou SLiu XCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)Simple Yet Effective: Structure Guided Pre-trained Transformer for Multi-modal Knowledge Graph ReasoningProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681112(1554-1563)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3681112
Liu YZhu LWan LWang X(2024)Masked frequency-color fusion network for video instance-level hazy lane detectionThe Visual Computer10.1007/s00371-024-03671-1Online publication date: 14-Oct-2024
https://doi.org/10.1007/s00371-024-03671-1
Liao MTian SZhang YHua GZou WLi XEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)Calibration-based Dual Prototypical Contrastive Learning Approach for Domain Generalization Semantic SegmentationProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3611792(2199-2210)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3581783.3611792
Show More Cited By

Recommendations

Viewing from Frequency Domain: A DCT-based Information Enhancement Network for Video Person Re-Identification
MM '21: Proceedings of the 29th ACM International Conference on Multimedia

Video-based person re-identification (Re-ID) aims to match the target pedestrians under non-overlapping camera system by video tracklets. The key issue of video Re-ID focuses on exploring effective spatio-temporal features. Generally, the spatio-...
Video Instance Lane Detection via Deep Temporal and Geometry Consistency Constraints
MM '22: Proceedings of the 30th ACM International Conference on Multimedia

Video instance lane detection is one of the most important tasks in autonomous driving.Due to the very sparse region and weak context in lane annotations, accurately detecting instance-level lanes in real-world traffic scenarios is challenging, ...
Repainting and Imitating Learning for Lane Detection
MM '22: Proceedings of the 30th ACM International Conference on Multimedia

Current lane detection methods are struggling with the invisibility lane issue caused by heavy shadows, severe road mark degradation, and serious vehicle occlusion. As a result, discriminative lane features can be barely learned by the network despite ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '21: Proceedings of the 29th ACM International Conference on Multimedia

October 2021

5796 pages

ISBN:9781450386517

DOI:10.1145/3474085

General Chairs:
Heng Tao Shen
University of Electronic Science&Technology of China, China
,
Yueting Zhuang
Zhejiang University, China
,
John R. Smith
IBM, USA
,
Program Chairs:
Yang Yang
University of Electronic Science and Technology of China, China
,
Pablo Cesar
CWI&TU Delft, The Netherlands
,
Florian Metze
FACEBOOK, Inc., USA
,
Balakrishnan Prabhakaran
University of Texas at Dallas, USA

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China
Science & Technology Major Project of Hubei Province
National Key Research and Development Program of China
Major Project for Technological Innovation of Hubei Province

Conference

MM '21

Sponsor:

SIGMM

MM '21: ACM Multimedia Conference

October 20 - 24, 2021

Virtual Event, China

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

6
Total Citations
View Citations
237
Total Downloads

Downloads (Last 12 months)15
Downloads (Last 6 weeks)1

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Liang KMeng LLiu YLiu MWei WLiu STu WWang SZhou SLiu XCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)Simple Yet Effective: Structure Guided Pre-trained Transformer for Multi-modal Knowledge Graph ReasoningProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681112(1554-1563)Online publication date: 28-Oct-2024
https://dl.acm.org/doi/10.1145/3664647.3681112
Liu YZhu LWan LWang X(2024)Masked frequency-color fusion network for video instance-level hazy lane detectionThe Visual Computer10.1007/s00371-024-03671-1Online publication date: 14-Oct-2024
https://doi.org/10.1007/s00371-024-03671-1
Liao MTian SZhang YHua GZou WLi XEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)Calibration-based Dual Prototypical Contrastive Learning Approach for Domain Generalization Semantic SegmentationProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3611792(2199-2210)Online publication date: 26-Oct-2023
https://dl.acm.org/doi/10.1145/3581783.3611792
Yang HLin SJiang RLu YWang H(2023)DQFORMER: Dynamic Query Transformer for Lane DetectionICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP49357.2023.10097047(1-5)Online publication date: 4-Jun-2023
https://doi.org/10.1109/ICASSP49357.2023.10097047
Park JJohnson J(2023)RGB No More: Minimally-Decoded JPEG Vision Transformers2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52729.2023.02139(22334-22346)Online publication date: Jun-2023
https://doi.org/10.1109/CVPR52729.2023.02139
Wu MLi CYao Z(2022)Deep Active Learning for Computer Vision Tasks: Methodologies, Applications, and ChallengesApplied Sciences10.3390/app1216810312:16(8103)Online publication date: 12-Aug-2022
https://doi.org/10.3390/app12168103

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten