Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3474085.3475267acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Fast and Accurate Lane Detection via Frequency Domain Learning

Published: 17 October 2021 Publication History

Abstract

It is desirable to maintain both high accuracy and runtime efficiency in lane detection. State-of-the-art methods mainly address the efficiency problem by direct compression of high-dimensional features. These methods usually suffer from information loss and cannot achieve satisfactory accuracy performance. To ensure the diversity of features and subsequently maintain information as much as possible, we introduce multi-frequency analysis into lane detection. Specifically, we propose a multi-spectral feature compressor (MSFC) based on two-dimensional (2D) discrete cosine transform (DCT) to compress features while preserving diversity information. We group features and associate each group with an individual frequency component, which incurs only 1/7 overhead of one-dimensional convolution operation but preserves more information. Moreover, to further enhance the discriminability of features, we design a multi-spectral lane feature aggregator (MSFA) based on one-dimensional (1D) DCT to aggregate features from each lane according to their corresponding frequency components. The proposed method outperforms the state-of-the-art methods (including LaneATT and UFLD) on TuSimple, CULane, and LLAMAS benchmarks. For example, our method achieves 76.32% F1 at 237 FPS and 76.98% F1 at 164 FPS on CULane, which is 1.23% and 0.30% higher than LaneATT. Our code and models are available at https://github.com/harrylin-hyl/MSLD.

References

[1]
Karsten Behrendt and Ryan Soussan. 2019. Unsupervised labeled lane marker dataset generation using maps. In Proceedings of the IEEE International Conference on Computer Vision, Vol. 3. 3--3.
[2]
Zhenpeng Chen, Qianfei Liu, and Chenfan Lian. 2019. PointLaneNet: Efficient end-to-end CNNs for Accurate Real-Time Lane Detection. In 2019 IEEE Intelligent Vehicles Symposium (IV). IEEE, 2563--2568.
[3]
Prema M Daigavane and Preeti R Bajaj. 2010. Road lane detection with improved canny edges using ant colony optimization. In 2010 3rd International Conference on Emerging Trends in Engineering and Technology. IEEE, 76--80.
[4]
Max Ehrlich and Larry S Davis. 2019. Deep residual learning in the jpeg transform domain. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 3484--3493.
[5]
Jonathan Frankle and Michael Carbin. 2018. The lottery ticket hypothesis: Finding sparse, trainable neural networks. arXiv preprint arXiv:1803.03635 (2018).
[6]
Lionel Gueguen, Alex Sergeev, Ben Kadlec, Rosanne Liu, and Jason Yosinski. 2018. Faster neural networks straight from jpeg. Advances in Neural Information Processing Systems, Vol. 31 (2018), 3933--3944.
[7]
S Han, H Mao, and WJ Dally. 2015. Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint (2015).
[8]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.
[9]
Yuenan Hou, Zheng Ma, Chunxiao Liu, Tak-Wai Hui, and Chen Change Loy. 2020. Inter-region affinity distillation for road marking segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 12486--12495.
[10]
Yuenan Hou, Zheng Ma, Chunxiao Liu, and Chen Change Loy. 2019. Learning lightweight lane detection cnns by self attention distillation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 1013--1021.
[11]
Jie Hu, Li Shen, and Gang Sun. 2018. Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7132--7141.
[12]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[13]
Xiang Li, Jun Li, Xiaolin Hu, and Jian Yang. 2019. Line-CNN: End-to-End Traffic line detection with line proposal unit. IEEE Transactions on Intelligent Transportation Systems, Vol. 21, 1 (2019), 248--258.
[14]
Zhenguo Li. 2020. CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending. (2020).
[15]
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision. 2980--2988.
[16]
Ruijin Liu, Zejian Yuan, Tie Liu, and Zhiliang Xiong. 2021. End-to-end lane shape prediction with transformers. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 3694--3702.
[17]
Tong Liu, Zhaowei Chen, Yi Yang, Zehao Wu, and Haowei Li. 2020. Lane detection in low-light conditions using an efficient data enhancement: Light conditions style transfer. In 2020 IEEE Intelligent Vehicles Symposium (IV). IEEE, 1394--1399.
[18]
Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In European conference on computer vision. Springer, 21--37.
[19]
Hiren M Mandalia and Mandalia Dario D Salvucci. 2005. Using support vector machines for lane-change detection. In Proceedings of the human factors and ergonomics society annual meeting, Vol. 49. SAGE Publications Sage CA: Los Angeles, CA, 1965--1969.
[20]
Pavlo Molchanov, Arun Mallya, Stephen Tyree, Iuri Frosio, and Jan Kautz. 2019. Importance estimation for neural network pruning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 11264--11272.
[21]
Xingang Pan, Jianping Shi, Ping Luo, Xiaogang Wang, and Xiaoou Tang. 2018. Spatial as deep: Spatial cnn for traffic scene understanding. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.
[22]
Jonah Philion. 2019. FastDraw: Addressing the long tail of lane detection by adapting a sequential prediction network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 11582--11591.
[23]
Zequn Qin, Huanyu Wang, and Xi Li. 2020 a. Ultra fast structure-aware deep lane detection. arXiv preprint arXiv:2004.11757 (2020).
[24]
Zequn Qin, Pengyi Zhang, Fei Wu, and Xi Li. 2020 b. FcaNet: Frequency Channel Attention Networks. arXiv preprint arXiv:2012.11879 (2020).
[25]
Joseph Redmon and Ali Farhadi. 2018. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767 (2018).
[26]
Lucas Tabelini, Rodrigo Berriel, Thiago M Paixao, Claudine Badue, Alberto F De Souza, and Thiago Oliveira-Santos. 2020 a. PolyLaneNet: Lane estimation via deep polynomial regression. arXiv preprint arXiv:2004.10924 (2020).
[27]
Lucas Tabelini, Rodrigo Berriel, Thiago M Paix ao, Claudine Badue, Alberto F De Souza, and Thiago Olivera-Santos. 2020 b. Keep your Eyes on the Lane: Attention-guided Lane Detection. arXiv preprint arXiv:2010.12035 (2020).
[28]
TuSimple. 2020. Tusimple benchmark. https://github.com/TuSimple/tusimple-benchmark. Accessed (2020).
[29]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. arXiv preprint arXiv:1706.03762 (2017).
[30]
Kuan Wang, Zhijian Liu, Yujun Lin, Ji Lin, and Song Han. 2019. Haq: Hardware-aware automated quantization with mixed precision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8612--8620.
[31]
Sanghyun Woo, Jongchan Park, Joon-Young Lee, and In So Kweon. 2018. Cbam: Convolutional block attention module. In Proceedings of the European conference on computer vision (ECCV). 3--19.
[32]
Seungwoo Yoo, Hee Seok Lee, Heesoo Myeong, Sungrack Yun, Hyoungwoo Park, Janghoon Cho, and Duck Hoon Kim. 2020. End-to-end lane marker detection via row-wise classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 1006--1007.
[33]
Tu Zheng, Hao Fang, Yi Zhang, Wenjian Tang, Zheng Yang, Haifeng Liu, and Deng Cai. 2020. RESA: Recurrent Feature-Shift Aggregator for Lane Detection. arXiv preprint arXiv:2008.13719 (2020).

Cited By

View all
  • (2024)Simple Yet Effective: Structure Guided Pre-trained Transformer for Multi-modal Knowledge Graph ReasoningProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681112(1554-1563)Online publication date: 28-Oct-2024
  • (2024)Masked frequency-color fusion network for video instance-level hazy lane detectionThe Visual Computer10.1007/s00371-024-03671-1Online publication date: 14-Oct-2024
  • (2023)Calibration-based Dual Prototypical Contrastive Learning Approach for Domain Generalization Semantic SegmentationProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3611792(2199-2210)Online publication date: 26-Oct-2023
  • Show More Cited By

Index Terms

  1. Fast and Accurate Lane Detection via Frequency Domain Learning

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MM '21: Proceedings of the 29th ACM International Conference on Multimedia
      October 2021
      5796 pages
      ISBN:9781450386517
      DOI:10.1145/3474085
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 17 October 2021

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. discrete cosine transform
      2. feature diversity
      3. frequency domain learning
      4. lane detection
      5. multi-frequency analysis

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      MM '21
      Sponsor:
      MM '21: ACM Multimedia Conference
      October 20 - 24, 2021
      Virtual Event, China

      Acceptance Rates

      Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)25
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 23 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Simple Yet Effective: Structure Guided Pre-trained Transformer for Multi-modal Knowledge Graph ReasoningProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681112(1554-1563)Online publication date: 28-Oct-2024
      • (2024)Masked frequency-color fusion network for video instance-level hazy lane detectionThe Visual Computer10.1007/s00371-024-03671-1Online publication date: 14-Oct-2024
      • (2023)Calibration-based Dual Prototypical Contrastive Learning Approach for Domain Generalization Semantic SegmentationProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3611792(2199-2210)Online publication date: 26-Oct-2023
      • (2023)DQFORMER: Dynamic Query Transformer for Lane DetectionICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP49357.2023.10097047(1-5)Online publication date: 4-Jun-2023
      • (2023)RGB No More: Minimally-Decoded JPEG Vision Transformers2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52729.2023.02139(22334-22346)Online publication date: Jun-2023
      • (2022)Deep Active Learning for Computer Vision Tasks: Methodologies, Applications, and ChallengesApplied Sciences10.3390/app1216810312:16(8103)Online publication date: 12-Aug-2022

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media