High-Speed VLSI Implementation of an Improved Parallel Delayed LMS Algorithm

Liu, Ming; Guan, Mingxiang; Wu, Zhou; Sun, Chongwu; Zhang, Weifeng; Wang, Mingjiang

doi:10.1007/s11036-021-01877-4

High-Speed VLSI Implementation of an Improved Parallel Delayed LMS Algorithm

Published: 11 January 2022

Volume 27, pages 1593–1603, (2022)
Cite this article

Mobile Networks and Applications Aims and scope Submit manuscript

Ming Liu¹,
Mingxiang Guan¹,
Zhou Wu ORCID: orcid.org/0000-0002-1211-334X¹,
Chongwu Sun¹,
Weifeng Zhang¹ &
…
Mingjiang Wang²

400 Accesses
Explore all metrics

Abstract

Motivated by improvement of convergence rate and throughput performance, this work develops a systematic high-speed VLSI implementation of the adaptive filter based on the improved 2-parallel delayed LMS (PDLMS) algorithm. The proposed design uses a novel hardware-efficient architecture for weight updating based on parallel adaptive 2-by-2 algorithm. Compared with the conventional filter structure, the parallel filter has higher throughput rate and lower power dissipation. To improve the convergent characteristic of the adaptive digital filter, we have selected one branch from two weight update branches which has better system performance. The fine-grained arithmetic operation unit and the retiming technology are employed to reduce the delay of critical path effectively. From the ASIC synthesis results we find that the proposed architecture of an 8-tap filter has nearly 24% less power and nearly 18% less area-delay-product (ADP) than the best existing structure. Thus it can be seen that the proposed design has the important practice instruction significance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Optimized VLSI Implementation of the Least Mean Square (LMS) Adaptive Filter Architecture on the Basis of Distributed Arithmetic Approach

Article 15 March 2024

High Throughput VLSI Architectures for CRC-12 Computation

VLSI Implementation for Noise Suppression Using Parallel Median Filtering Technique

References

Meher PK, Park SY (2014) Critical-path analysis and low-complexity implementation of the LMS adaptive algorithm. IEEE Trans Circuits Syst I, Reg Papers 61(3):778–788
Article Google Scholar
Liu D, Wang M (2016) Delay-optimized floating point fused add-subtract unit. IEICE Electron Express 12(1):1–12
MathSciNet Google Scholar
Yi Y, Woods R, Ting LK, Cowan CFN (2005) High speed FPGA-based implementations of delayed-LMS filters. Journal of VLSI signal processing 39(2):113–131
Article Google Scholar
Meher PK, Maheshwari M (2011) A high-speed FIR adaptive filter architecture using a modified delayed LMS algorithm. IEEE International Symposium of Circuits and Systems (ISCAS). IEEE, Rio de Janeiro, Brazil, pp 121–124
Google Scholar
Shanbhag NR, Parhi KK (1994) Pipelined adaptive Digtal filters. Kluwer Academic Publishers, Dordrecht
Book Google Scholar
Douglas SC, Zhu Q, Smith KF (1998) A pipelined LMS adaptive FIR filter architecture without adaptation delay. IEEE trans. Signal Process 46(3):775–779
Google Scholar
Ting LK, Woods R, Cowan CFN (2005) Virtex FPGA implementation of a pipelined adaptive LMS predictor for electronic support measures receivers. IEEE Transactions on Very Large Scale Integration (VLSI) Systems 13(1):86–95
Article Google Scholar
Park Y, Meher PK (2013) Low-power, high-throughput, and low-area adaptive FIR filter based on distributed arithmetic. IEEE Transactions on Circuits and Systems II: Express Briefs 60(6):346–350
Google Scholar
Mohanty BK, Meher PK (2009) Delayed block LMS algorithm and concurrent architecture for high-speed implementation of adaptive FIR filters. Proc. IEEE Region 10 TENCON2008 Conference. IEEE, Hyderabad, pp 1–5
Google Scholar
Van LD, Feng WS (2001) An efficient systolic architecture for the DLMS adaptive filter and its applications. IEEE trans. Circuits Syst. II, analog and digital. Signal Process 48(4):359–366
Google Scholar
Liu X, Zhang X (2020) NOMA-based resource allocation for cluster-based cognitive industrial internet of things. IEEE Trans. Industrial Informatics 16(8):5379–5388
Article Google Scholar
Liu X, Zhang X, Lu W (2021) QoS-guarantee resource allocation for multibeam satellite industrial internet of things with NOMA. IEEE Trans Industrial Informatics 17(3):2052–2061
Article Google Scholar
Feng Li KY, Lam and Xin Liu. (2018) Joint pricing and power allocation for multibeam satellite systems with dynamic game model. IEEE Trans Vehicular Technology 67(3):2398–2408
Article Google Scholar
Liu X, Zhang X, Jia M (2018) 5G-based green broadband communication system design with simultaneous wireless information and power transfer. Physical Communication 28:130–137
Article Google Scholar
Y.-C Tsao and K. Choi.: Area efficient parallel fir digital filter structures for symmetric convolutions based on fast fir algorithm” IEEE Trans VLSI Syst 20(2), 366–371 (2012)
Srinivasan S, Bhudiya K, Ramanarayanan R et al (2013) Split-path fused floating point multiply accumulate (FPMAC). IEEE 21st Symposium on Computer Arithmetic. IEEE, Austin, pp 17–24
Google Scholar
Long G, Ling F, Proakis JG (1989) The LMS algorithm with delayed coefficient adaptation. IEEE Trans Accoust, Speech, and Signal Processing 37(9):1397–1405
Article Google Scholar

Download references

Acknowledgments

This work is supported by the project of shenzhen science and technology innovation committee (JCYJ20180307123857045), the scientific research project in school-level (SZIIT2019KJ026) and the project of guangdong provincial department of education (2019GKQNCX122).

Author information

Authors and Affiliations

School of Microelectronics, Shenzhen Institute of Information Technology, Shenzhen, 518000, China
Ming Liu, Mingxiang Guan, Zhou Wu, Chongwu Sun & Weifeng Zhang
Faculty of Electronics and information Engineering, Harbin Institute of Technology, Shenzhen, 518000, China
Mingjiang Wang

Authors

Ming Liu
View author publications
You can also search for this author in PubMed Google Scholar
Mingxiang Guan
View author publications
You can also search for this author in PubMed Google Scholar
Zhou Wu
View author publications
You can also search for this author in PubMed Google Scholar
Chongwu Sun
View author publications
You can also search for this author in PubMed Google Scholar
Weifeng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Mingjiang Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhou Wu.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, M., Guan, M., Wu, Z. et al. High-Speed VLSI Implementation of an Improved Parallel Delayed LMS Algorithm. Mobile Netw Appl 27, 1593–1603 (2022). https://doi.org/10.1007/s11036-021-01877-4

Download citation

Accepted: 03 November 2021
Published: 11 January 2022
Issue Date: August 2022
DOI: https://doi.org/10.1007/s11036-021-01877-4

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

High-Speed VLSI Implementation of an Improved Parallel Delayed LMS Algorithm

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

An Optimized VLSI Implementation of the Least Mean Square (LMS) Adaptive Filter Architecture on the Basis of Distributed Arithmetic Approach

High Throughput VLSI Architectures for CRC-12 Computation

VLSI Implementation for Noise Suppression Using Parallel Median Filtering Technique

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now