0% found this document useful (0 votes)

401 views

Fast Convolution Cook Toom Algorithm

This document discusses fast convolution algorithms that reduce the number of multiplication operations. These algorithms belong to the class of algorithmic strength reduction, where the number of strong operations like multiplications is decreased at the expense of increasing weaker operations like additions. One such algorithm described is the Cook-Toom algorithm, which is based on Lagrange interpolation. It reduces the complexity of polynomial multiplication from O(LN) to L+N-1 multiplications by evaluating the polynomials at carefully chosen points and using Lagrange interpolation to reconstruct the output polynomial. Two examples are provided to demonstrate how a 2x2 and 2x3 convolution can be implemented using this approach with only 3 and 4 multiplications respectively.

Uploaded by

AMIT VERMA

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

401 views

Fast Convolution Cook Toom Algorithm

Uploaded by

AMIT VERMA

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Fast Convolution

Dr. Arunachalam V
Associate Professor, SENSE
Fast Convolution
• Fast convolution algorithms uses fewer multiplication operations.
• These algorithms belongs to the class of algorithmic strength reduction.
• The number of strong operations such as multiplication operations is reduced possibly at the
expense of an increase in the number of weaker operations such as addition operations.
• These implementations are best suited for implementation using either programmable or dedicated
hardware.
Multiplication of 2 complex numbers in (x + jy)
form
• Assume (a + jb)(c + jd)=(e + jf) , where (a + jb) is a signal sample and
(c + jd) is a coefficient.
 e  c − d a 
• This is expressed in matrix form:  f  = d c  b 
    
• This direct implementation requires 4 multiplications and 2 additions.
e=ca-db = a(c-d) + d(a-b) and
f=da+cb = b(c+d) + d(a-b)
• Coefficient matrix can be decomposed as the product of matrices as shown
below:
c − d 0 0 1 0 
c − d  1 0 1 
d = c+d 0  0 1 
c  0 1 1 
0  
  0 0 d  1 − 1
What is the effect?
• (c-d) and (c+d) are assumed to be pre-computed.
• Thus the algorithmic complexity has been reduced to 3 multiplications and 3 additions.
• One multiplication has traded off for one addition.
• This leads to the reduction in hardware area.
Lagrange interpolation theorem
• Cook-Toom algorithm is a linear convolution algorithm for polynomial multiplication.
• It is based on Lagrange interpolation theorem.

Lagrange interpolation theorem

• Let β0, β1, β2, …, βn be a set of n+1 distinct points,
• Let f(βi), for i = 0, 1, 2, 3, …, n be given.
• There is exactly one polynomial f(p) of degree n or less that has value f(βi) when evaluated at βi for i
= 0, 1, 2, 3, …, n .

n  (p −  )j

f ( p ) =  f ( )
j i

 ( −  )
i
i =0 i j
j i
Convolution of x(p) & h(p)
• Consider an N-point sequence h={h0, h1, … , hN-1} and an L-point sequence
x={x0, x1, … , xL-1} .
• The linear convolution of h and x can be expressed in terms of polynomial
multiplication as : s(p) = h(p) x(p) Where, h( p ) = hN −1 p N −1 +    + h1 p + h0
x( p ) = xL −1 p L −1 +    + x1 p + x0
s( p ) = s N + L − 2 p N + L − 2 +    + s1 p + s0
• The output polynomial s(p) has L+N-2 degrees and L+N-1 different
coefficients.
• It can be uniquely determined by its values at L+N-1 different points.
• Let β0, β1, β2, …, βN+L-2 be a set of L+N-1 different real numbers .
• If s(βi), for i = 0, 1, 2, 3, …, L+N-2 are known, s(p) can be computed using
Lagrange interpolation theorem as:
L+ N −2  (p −  j )
s( p ) =  s( i )
j i

i =0  (i −  j )
j i
Cook-Toom algorithm
1. Choose L+N-1 different real numbers β0, β1, β2, …, βN+L-2 .
2. Compute h(βi) and x(βi), for i = 0, 1, 2, 3, …, L+N-2 .
3. Compute s(βi) = h(βi) . x(βi), for i = 0, 1, 2, 3, …, L+N-2 .
4. Compute s(p) using

L+ N −2  (p −  ) j

s( p ) =  s( )
j i

 ( −  )
i
i =0 i j
j i
The effect of CT algorithm
• The goal of the fast convolution algorithm is to reduce the multiplication
complexity.
• If βi, for i = 0, 1, 2, 3, …, L+N-2 are chosen properly, the computation in step
2 for evaluating h(βi) and x(βi) will involve some additions and multiplications
by small constants (such as positive and negative powers of 2).
• We can ignore these multiplication operations when βi’s are small numbers.
• But these operations may contribute to increased complexity for the larger
problem size.
• Number of multiplications have been reduced from O(LN) to (L+N-1) at the
expense of increase in number of additions.
Example 1
Construct a 2×2 convolution algorithm using Cook-Toom algorithm with β=0, ± 1.
Solution:
• Write 2×2 convolution in polynomial multiplication form as s(p) = h(p) . x(p)
• Where, h(p) = h0+ h1 p, x(p) ) = x0+ x1 p and s(p) ) = s0 + s1 p + s2 p2
• A direct implementation can be expressed in matrix form as follows:
 S 0  h0 0
 S  = h  x0 
 1  1 h0  x 
 S 2   0 h1   1

• Requires 4 multiplications and 1 addition.

Ex 1 – applying CT algorithm
• Compute h(βi) , x(βi) and s(βi), for i = 0, 1, 2 [(L=2)+(N=2)-2].
 0 = 0; h( 0 ) = h0 ; x( 0 ) = x0 ; s( 0 ) = h( 0 )x( 0 )
1 = 1; h(1 ) = h0 + h1 ; x(1 ) = x0 + x1 ; s(1 ) = h(1 )x(1 )
 2 = −1; h( 2 ) = h0 − h1 ; x( 2 ) = x0 − x1 ; s( 2 ) = h( 2 )x( 2 )

• Applying Lagrange interpolation formula:

s ( p ) = s ( 0 )
( p − 1 )( p −  2 ) + s( ) ( p −  0 )( p −  2 ) + s( ) ( p − 1 )( p −  0 )
( 0 − 1 )( 0 −  2 ) 1
(1 −  0 )(1 −  2 ) 2
( 2 − 1 )( 2 −  0 )
(
s( p ) = s( 0 ) − p 2
+ 1) + s( )
(p 2
+p ) + s( 2 )
( p2 − p )
1
2 2
 s (1 ) s ( 2 )  2 s (1 ) s ( 2 ) 
s ( p ) = s ( 0 ) + p  −  + p  − s ( 0 ) + + 
 2 2   2 2 
s( p ) = s0 + ps1 + p 2 s2
Matrix – Vector form
s ( 0 ) = h( 0 )x( 0 )
 s0   1 0 0   s( 0 )
 s  =  0 1 − 1  s ( 1 )  s(1 ) = h(1 )x(1 )
 1   2 
 s2  − 1 1 1   s ( 2 2 )  s( 2 ) = h( 2 )x( 2)

 s0   1 0 0  h( 0 ) 0 0   x( 0 ) h( 0 ) = h0 ; x( 0 ) = x0 ;

 s  =  0 1 − 1  0 h ( 1 )
0   x( )
 1   2  1  h(1 ) = h0 + h1 ; x(1 ) = x0 + x1 ;
 s2  − 1 1 1   0 h( 2 )
  x( 2 )
2  h( 2 ) = h0 − h1 ; x( 2 ) = x0 − x1 ;
0

 s0   1 0 0  h0 0 0  1 0 
 s  =  0 1 − 1  0  x0 
 1  
h0 + h1
2 0  1 1 
  x 
 s2  − 1 1 1   0 h0 − h1
 1 − 1  1
0 2 
Computation steps
The Computation is carried out as follows:
h0 + h1 h0 − h1
1. H 0 = h0 , H1 = 2 , H2 = 2 ( precomputed )
2. X 0 = x0 , X 1 = x0 + x1 , X 2 = x0 − x1 (2 additions )

3. S 0 = H 0 X 0 , S1 = H1 X 1 , S 2 = H 2 X 2 (3 multiplications )

4. s0 = S 0 , s1 = S1 − S 2 , s2 = − S 0 + S1 + S 2 (3 additions )

This algorithm requires only 3 multiplications and 5 additions.

Therefore, the number of multiplications has been reduced by 1 at the
expense of 4 extra addition operations.
Example 2
Construct a 2×3 linear convolution s(p) = h(p) . x(p)
where, h(p) = h0+ h1p, and x(p) ) = x0 + x1 p + x2 p2
Use Cook-Toom algorithm to construct an efficient implementation for the
given linear convolution.
Solution:
Compute h(βi) , x(βi) and s(βi), for i = 0, 1, 2,3 [(L=2)+(N=3)-2]
 0 = 0; h( 0 ) = h0 ; x( 0 ) = x0 ; s( 0 ) = h( 0 )x( 0 )
1 = 1; h(1 ) = h0 + h1 ; x(1 ) = x0 + x1 + x2 ; s (1 ) = h(1 )x(1 )
 2 = −1; h( 2 ) = h0 − h1 ; x( 2 ) = x0 − x1 + x2 ; s ( 2 ) = h( 2 )x( 2 )
 3 = 2; h( 3 ) = h0 + 2h1 ; x( 3 ) = x0 + 2 x1 + 4 x2 ; s( 3 ) = h( 3 )x( 3 )
Applying Lagrange interpolation formula
s ( p ) = s ( 0 )
( p − 1 )( p −  2 )( p −  3 )
+ s (1 )
( p −  0 )( p −  2 )( p −  3 )
+
( 0 − 1 )( 0 −  2 )( 0 −  3 ) (1 −  0 )(1 −  2 )(1 −  3 )
s ( 2 )
( p −  0 )( p − 1 )( p −  3 ) + s( ) ( p −  0 )( p − 1 )( p −  2 )
( 2 −  0 )( 2 − 1 )( 2 −  3 ) 3
( 3 −  0 )( 3 − 1 )( 3 −  2 )

s ( p ) = s ( )
( p − 2 p − p + 2)
3 2
+ s ( )
( p − p − 2 p)
+
3 2

−2
0 1
2

s ( )
( p − 3 p + 2 p)
3 2
+ s ( )
( p − p) 3

−6
2 3
6
(
s ( p ) = s ( 0 ) + p − s ( 2 0 ) + s (1 ) − s ( 3 2 ) − s ( 6 3 ) + )
( )
p 2 − s ( 0 ) + s (21 ) + s ( 2 2 ) + p 3 ((s 0 )
2 − s (21 ) − s ( 6 2 ) + s ( 6 3 ) )
s( p ) = s0 + ps1 + p 2 s2 + p 3 s3
Matrix-vector form
 s0   2 0 0 0   s ( 2 0 ) 
 s   − 1 2 − 2 − 1  s ( 1 ) 
 1 =   2 
 s2   − 2 1 3 0   s ( 6 2 ) 
     s (3 ) 
 s3   1 − 1 − 1 1   6 

 s0   2 0 0 0   h20 0 0 0  1 0 0
 s   − 1 2 − 2 − 1    x0 
 1 =   0
h0 + h1
2 0 0  1 1 1  x 
 s2   − 2 1 3 0  0 0 h0 − h1
0  1 − 1 1  1
   
6
h0 + 2 h1     x2 
 3 
s 1 − 1 − 1 1   0 0 0 6   1 2 4
Computation steps
The Computation is carried out as follows:
h0 + h1 h0 − h1 h0 + 2 h1
1. H 0 = h0
2 , H1 = 2 , H2 = 6 , H3 = 6 ( precomputed )
2. X 0 = x0 , X 1 = ( x0 + x2 ) + x1 , X 2 = ( x0 + x2 ) − x1 , X 3 = x0 + 2 x1 + 4 x2 (5 additions )

3. S 0 = H 0 X 0 , S1 = H1 X 1 , S 2 = H 2 X 2 , S3 = H 3 X 3 (4 multiplications )
4. s0 = 2S0 , s1 = −(S1 + S3 ) + 2(S1 − S 2 ), s2 = −2S0 + S1 + 3S 2 ,
s3 = (S1 + S3 ) − (S1 + S 2 ) (7 additions)

This algorithm requires only 4 multiplications and 12 additions.

Therefore, the number of multiplications has been reduced by 2 at the
expense of 10 extra addition operations.
Comments on CT algorithm
• s = Tx
• Convolution matrix, T = CHD
– C is a post addition matrix
– D is a pre addition matrix
– H is a diagonal matrix with Hi, i = 0, 1, 2, …, L+N-2.
• The CT algorithm provides a way to factorize the T in to C, H and D such that the total number of
general multiplications is determined solely by non-zero elements on the H matrix.
• Although the number of multiplications has been reduced by one-third, the number of additions has
increased.
Iterated Convolution
• Long convolutions are realized using short convolutions.
– 4×4 convolution can be realized using two 2×2 convolutions.
• This method will not achieve minimal multiplication complexity, but
achieves good balance between multiplication and addition
complexity.
• The order of short convolutions in the decomposition affects the
complexity of the derived long convolutions.
Iterated Convolution Algorithm
1. Decompose the long convolution in to several levels of short
convolutions.
2. Construct FAST CONVOLUTION ALGORITHM for short
convolutions.
3. Use the short convolution algorithms to iteratively (hierarchically)
implement the long convolution.
– 4×4 convolution can be realized using two levels of nested 2×2 convolutions.
Example 1
Construct a 4×4 linear convolution algorithm using 2×2 short convolution.
Solution:
• Where, h(p) = h0+ h1 p + h2 p2 + h3 p3, x(p) ) = x0+ x1 p + x2 p2 + x3 p3 and
s(p) ) = h(p) . x(p)
• Define h0'(p) = h0+ h1 p , h1'(p) = h2 + h3 p, x0'(p) = x0+ x1 p , x1'(p) = x2 + x3 p
and q = p2 .s0 + s1 p + s2 p2
• h(p) = h0'(p) + h1'(p) p2 = h0'(p) + h1'(p) q
• x(p) ) = x0'(p) + x1'(p) p2 = x0'(p) + x1'(p) q
• s(p) ) = h(p) . x(p)= (h0'(p) + h1'(p) q ) (x0'(p) + x1'(p) q )
• = s0'(p) + s1'(p) q + s1'(p) q2

 s0' ( p ) 1 0 0 h0' ( p ) 0 0  1 0  '

 '
( ) 
= −  0 '
( ) − '
( )  1 − 1  x0 ( p )
s
 1   p 1 1 1  h0 p h1 p 0     x ' ( p )
 s ' ( p ) 0 0 1  0
 2   0 h1 ( p )
'
0 1   1 
Example 1 – Cont…
 s0 ( p ) 1 0 0 h0 ( p ) 0  1 0  '
'
0
 s ( p ) = 1 − 1 1  0 '
( ) − '
( )  1 − 1  x0 ( p )
 1    h0 p h1 p 0     x ' ( p )
 s2 ( p ) 0 0 1  0 0 h1 ( p )
'
0 1   1 

• This uses 3 polynomial multiplications, 1 degree – 1 polynomial

addition and 2 degree – 2 polynomial additions.
• The 3 polynomial multiplications s0'(p), s1'(p) and s2'(p) are again 2×2
convolutions, each of which requires 3 multiplications and 3 additions.
• Due to the overlap terms between s0'(p) and s1'(p), and s1'(p) and s2'(p)
2 more additions are required.
• Therefore total number of multiplications and additions required are 9
and 19 respectively.
• In the direct form the total number of multiplications and additions are
16 and 12 respectively.
Cyclic or Circular Convolution
• Let the filter coefficients, h = {h0, h1, …, hn-1} and data sequence, x =
{x0, x1, …, xn-1} then cyclic convolution is
s( p ) = hOn x = h( p )x( p ) mod( p n − 1)
n −1
• The output samples are given by si =  h((i − k )) xk , i = 0,1, 2, ..., n − 1
k =0
where ((i-k)) denotes (i-k) mod n.
• The cyclic convolution can be computed as a linear convolution
reduced by modulo pn – 1.
• Notice that there are 2n – 1 different output samples for this linear
convolution .
• Alternatively, the cyclic convolution can be computed using CRT
with m(p) = pn – 1, which is much simpler.
Convolutions - Fast convolution
• Efficient linear convolution algorithm can be used to obtain an
efficient cyclic convolution algorithm.
• Conversely, an efficient cyclic convolution algorithm can be used to
derive an efficient linear convolution algorithm.
• All efficient convolution algorithms cannot be generated by Cook-
Toom or Winograd algorithms. Sometimes, a clever factorization by
inspection may generate better algorithm.
• Fast convolution algorithms form the basis for design of fast
parallel FIR filters.
Next Class
INTRODUCTION TO NUMERICAL STRENGTH REDUCTION

MELZG623 Assignment 2
100% (1)
MELZG623 Assignment 2
11 pages
Assignment3 2021HT80531
100% (1)
Assignment3 2021HT80531
14 pages
VLSI Testing Solutions BUSHNELL
No ratings yet
VLSI Testing Solutions BUSHNELL
17 pages
SImple and Compound Interest Notes Lyst6475
No ratings yet
SImple and Compound Interest Notes Lyst6475
11 pages
VSP Lec02 Unfolding
No ratings yet
VSP Lec02 Unfolding
47 pages
ISI & Nyquist Criterion For Distortion Less Baseband Binary Data Transmission
0% (1)
ISI & Nyquist Criterion For Distortion Less Baseband Binary Data Transmission
7 pages
VLSI Design
No ratings yet
VLSI Design
8 pages
Chapter 4 Retiming: 1 ECE734 VLSI Arrays For Digital Signal Processing
No ratings yet
Chapter 4 Retiming: 1 ECE734 VLSI Arrays For Digital Signal Processing
24 pages
DSP Two Marks
100% (1)
DSP Two Marks
33 pages
Unit 3 - Week 2
No ratings yet
Unit 3 - Week 2
4 pages
8259 Interfacing With 8086
100% (1)
8259 Interfacing With 8086
4 pages
ARM INstruction Set
100% (1)
ARM INstruction Set
6 pages
DSP Processors & Architecture: Course Code:13EC1138 L TPC 4 0 0 3
No ratings yet
DSP Processors & Architecture: Course Code:13EC1138 L TPC 4 0 0 3
3 pages
DSP Unit-5 Solutions
No ratings yet
DSP Unit-5 Solutions
17 pages
M.Tech ES ARM LAB
No ratings yet
M.Tech ES ARM LAB
14 pages
Verilog Project: Verilog Code For ALU
No ratings yet
Verilog Project: Verilog Code For ALU
8 pages
Data Converters
No ratings yet
Data Converters
37 pages
Application Specific Processors
No ratings yet
Application Specific Processors
8 pages
SMDP - Project - Final - Proposal - NIT - N - Draft PDF
No ratings yet
SMDP - Project - Final - Proposal - NIT - N - Draft PDF
17 pages
Module-1: Principles of Combination Logic, ECE Dept., VCET, Puttur
No ratings yet
Module-1: Principles of Combination Logic, ECE Dept., VCET, Puttur
78 pages
Vlsi Digital Signal Processing Keshab K Parhi
No ratings yet
Vlsi Digital Signal Processing Keshab K Parhi
66 pages
Longest Path Matrix Algorithm
No ratings yet
Longest Path Matrix Algorithm
12 pages
Embedded System LESSONPLAN
No ratings yet
Embedded System LESSONPLAN
7 pages
Reflection Coefficients For Lattice Realization
No ratings yet
Reflection Coefficients For Lattice Realization
8 pages
Hybrid Low Radix Encoding-Based Approximate Booth Multipliers
100% (1)
Hybrid Low Radix Encoding-Based Approximate Booth Multipliers
33 pages
II - Software Design For Low Power
No ratings yet
II - Software Design For Low Power
11 pages
Lecture04 MOS
No ratings yet
Lecture04 MOS
59 pages
Asic Library Design
No ratings yet
Asic Library Design
12 pages
Radix-4 Modified Booth's Multiplier Using Verilog RTL
No ratings yet
Radix-4 Modified Booth's Multiplier Using Verilog RTL
10 pages
Placement Test
No ratings yet
Placement Test
4 pages
DSP-FPGA - Ch03 - Pipelining and Parallel Processing - HK202
No ratings yet
DSP-FPGA - Ch03 - Pipelining and Parallel Processing - HK202
53 pages
Ece-Vii-dsp Algorithms & Architecture (10ec751) - Assignment
100% (1)
Ece-Vii-dsp Algorithms & Architecture (10ec751) - Assignment
9 pages
Assignment 8 - 2023 - Gate
No ratings yet
Assignment 8 - 2023 - Gate
10 pages
CEC332 Adsp
No ratings yet
CEC332 Adsp
1 page
Unit 3.1 Miller Effect
No ratings yet
Unit 3.1 Miller Effect
3 pages
DSP Design - Lecture 6: Unfolding
No ratings yet
DSP Design - Lecture 6: Unfolding
44 pages
Expt-9 Interfacing of 8253 With 8086
No ratings yet
Expt-9 Interfacing of 8253 With 8086
2 pages
Vlsi Imlememt of Odfm
No ratings yet
Vlsi Imlememt of Odfm
10 pages
FSMD Block Diagram: Datapath Elements
No ratings yet
FSMD Block Diagram: Datapath Elements
19 pages
Solns 28
No ratings yet
Solns 28
22 pages
Algorithms For Vlsi Design Automation - b0609
No ratings yet
Algorithms For Vlsi Design Automation - b0609
1 page
Electronic Voting Machine
0% (1)
Electronic Voting Machine
10 pages
Unit-5 DSP Processor
No ratings yet
Unit-5 DSP Processor
28 pages
Santosh V Hegde-2022HT01035-ESZG553 RTS
No ratings yet
Santosh V Hegde-2022HT01035-ESZG553 RTS
7 pages
Image Processing Verilog
No ratings yet
Image Processing Verilog
18 pages
Software Design For Low Power
No ratings yet
Software Design For Low Power
20 pages
Block Diagram Reduction Techniques
No ratings yet
Block Diagram Reduction Techniques
47 pages
Data, Array Subsystems
No ratings yet
Data, Array Subsystems
114 pages
Lesson Plan - Signals & Systems 2012
No ratings yet
Lesson Plan - Signals & Systems 2012
3 pages
Verilog - PPT 1
No ratings yet
Verilog - PPT 1
41 pages
Microelectronics BITS Pilani
No ratings yet
Microelectronics BITS Pilani
2 pages
On Approximate Computing Techniques
No ratings yet
On Approximate Computing Techniques
58 pages
Distributed Arithmetic: Implementations and Applications: A Tutorial
No ratings yet
Distributed Arithmetic: Implementations and Applications: A Tutorial
30 pages
Embedded Systems Assignment Questions
No ratings yet
Embedded Systems Assignment Questions
6 pages
Low Power Estimation
No ratings yet
Low Power Estimation
82 pages
Cook Toom Algorithm
No ratings yet
Cook Toom Algorithm
27 pages
Chap 8
No ratings yet
Chap 8
50 pages
Fast Convolution
No ratings yet
Fast Convolution
2 pages
6515 Transcripts DC4
No ratings yet
6515 Transcripts DC4
28 pages
Slides06 FFT
No ratings yet
Slides06 FFT
60 pages
FFT and Algebraic Computation
No ratings yet
FFT and Algebraic Computation
29 pages
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
hw11 Solution
No ratings yet
hw11 Solution
6 pages
LOW Power Report
No ratings yet
LOW Power Report
16 pages
JCOMP
No ratings yet
JCOMP
22 pages
VLSI INTERVIEW QUESTION - Static - Puneet Mittal
No ratings yet
VLSI INTERVIEW QUESTION - Static - Puneet Mittal
63 pages
English Communication
No ratings yet
English Communication
5 pages
VLSI Testing Question
No ratings yet
VLSI Testing Question
5 pages
SoC Assignment
No ratings yet
SoC Assignment
16 pages
Stockmeyer
No ratings yet
Stockmeyer
34 pages
Scan Design Rule
No ratings yet
Scan Design Rule
5 pages
8086 Pin Configuration Part 5B
No ratings yet
8086 Pin Configuration Part 5B
9 pages
Precision Agriculture System Using Verilog Hardware Description Language To Design An ASIC
No ratings yet
Precision Agriculture System Using Verilog Hardware Description Language To Design An ASIC
6 pages
Designing of Multiplexer and De-Multiplexer Using Different Adiabatic Logic in 90nm Technology
No ratings yet
Designing of Multiplexer and De-Multiplexer Using Different Adiabatic Logic in 90nm Technology
7 pages
FPGA Implementation of Addition Subtraction Module For Double Precision Floating Point Numbers Using Verilog
No ratings yet
FPGA Implementation of Addition Subtraction Module For Double Precision Floating Point Numbers Using Verilog
5 pages
School of Electronics Engineering Student Course Feedback Form
No ratings yet
School of Electronics Engineering Student Course Feedback Form
1 page
Designing of AES Algorithm Using Verilog
No ratings yet
Designing of AES Algorithm Using Verilog
5 pages
WINSEM2021-22 ECE5014 ETH VL2021220505486 Reference Material III 27-04-2022 STA Fundamentals Session2
No ratings yet
WINSEM2021-22 ECE5014 ETH VL2021220505486 Reference Material III 27-04-2022 STA Fundamentals Session2
31 pages
Generation of PWM Using Verilog in FPGA
No ratings yet
Generation of PWM Using Verilog in FPGA
5 pages
Lesson 19: Partial Derivatives (Worksheet Solutions)
100% (2)
Lesson 19: Partial Derivatives (Worksheet Solutions)
2 pages
Digital Logic Design: Dr. Fenghui Yao Tennessee State University Department of Computer Science Nashville, TN
No ratings yet
Digital Logic Design: Dr. Fenghui Yao Tennessee State University Department of Computer Science Nashville, TN
33 pages
SMathStudio.0 85.NumericFunctions - Article.Eng PDF
No ratings yet
SMathStudio.0 85.NumericFunctions - Article.Eng PDF
8 pages
ISC-2025 Sample Question Paper - 4
No ratings yet
ISC-2025 Sample Question Paper - 4
6 pages
The GCSE Maths Introductory Skills Booklet
No ratings yet
The GCSE Maths Introductory Skills Booklet
24 pages
Section 8.4 - Composite and Inverse Functions: (F G) (X) F (G (X) )
No ratings yet
Section 8.4 - Composite and Inverse Functions: (F G) (X) F (G (X) )
5 pages
Alg 2 Resource Ws CH 7
No ratings yet
Alg 2 Resource Ws CH 7
127 pages
Mathematics Grade-10: K-12 Basic Education Curriculum Scope & Sequence 2021
No ratings yet
Mathematics Grade-10: K-12 Basic Education Curriculum Scope & Sequence 2021
2 pages
ch5 Linear Equations in Two Unknowns
No ratings yet
ch5 Linear Equations in Two Unknowns
12 pages
Grade 5 Basic Algebra Decimals A
No ratings yet
Grade 5 Basic Algebra Decimals A
2 pages
l5 Advanced Power System Optimization l5 Simplex Method Part1
No ratings yet
l5 Advanced Power System Optimization l5 Simplex Method Part1
27 pages
[Ebooks PDF] download McGraw Hill Education TABE Level A Math Workbook 2nd Edition Richard Ku full chapters
100% (6)
[Ebooks PDF] download McGraw Hill Education TABE Level A Math Workbook 2nd Edition Richard Ku full chapters
37 pages
Air University Multan Campus: Guidelines For Solving Entrance Test Paper
No ratings yet
Air University Multan Campus: Guidelines For Solving Entrance Test Paper
9 pages
fd0249ce-9b01-4a5a-ad43-64da674c975b
No ratings yet
fd0249ce-9b01-4a5a-ad43-64da674c975b
12 pages
Practice Sheet DPP 1 - 2
No ratings yet
Practice Sheet DPP 1 - 2
5 pages
Classroom Management Binder Scott Cagnard
No ratings yet
Classroom Management Binder Scott Cagnard
30 pages
20 21 T1 Practice Sheet MAT71
No ratings yet
20 21 T1 Practice Sheet MAT71
47 pages
6 Primitive Roots and The Discrete Logarithm: 6.1 The Order of An Integer
No ratings yet
6 Primitive Roots and The Discrete Logarithm: 6.1 The Order of An Integer
16 pages
MA106
No ratings yet
MA106
20 pages
RESONANCE Equations For PreRmo
No ratings yet
RESONANCE Equations For PreRmo
36 pages
NTU_EE2007_Vector Calculus - Part 4a
No ratings yet
NTU_EE2007_Vector Calculus - Part 4a
18 pages
09_maths_ch2_tp2
No ratings yet
09_maths_ch2_tp2
4 pages
CBSE Class-12 Mathematics NCERT Solution Chapter - 7 Integrals - Exercise 7.4
No ratings yet
CBSE Class-12 Mathematics NCERT Solution Chapter - 7 Integrals - Exercise 7.4
31 pages
Nominal Stability and Nominal Performance
No ratings yet
Nominal Stability and Nominal Performance
6 pages
Btech 1 Sem Mathematics 1 Kas 103 2018 19
No ratings yet
Btech 1 Sem Mathematics 1 Kas 103 2018 19
4 pages
Hungerford Solution 2group
No ratings yet
Hungerford Solution 2group
14 pages
Vector Algebra
No ratings yet
Vector Algebra
6 pages
JR Maths Volume IV
No ratings yet
JR Maths Volume IV
276 pages
تقدير متجه المتوسطات ومصفوفة التباين والتباين المشترك PDF
No ratings yet
تقدير متجه المتوسطات ومصفوفة التباين والتباين المشترك PDF
3 pages
Course Description: CEE511 - Structural Dynamics (3 Credits) Fall Semester 2013-2014
No ratings yet
Course Description: CEE511 - Structural Dynamics (3 Credits) Fall Semester 2013-2014
1 page

Fast Convolution Cook Toom Algorithm

Uploaded by

Fast Convolution Cook Toom Algorithm

Uploaded by

Fast Convolution

Lagrange interpolation theorem

• Requires 4 multiplications and 1 addition.

• Applying Lagrange interpolation formula:

 s0   1 0 0  h( 0 ) 0 0   x( 0 ) h( 0 ) = h0 ; x( 0 ) = x0 ;

This algorithm requires only 3 multiplications and 5 additions.

This algorithm requires only 4 multiplications and 12 additions.

 s0' ( p ) 1 0 0 h0' ( p ) 0 0  1 0  '

• This uses 3 polynomial multiplications, 1 degree – 1 polynomial

You might also like