Professional Documents
Culture Documents
Manuscript received Feb. 20, 2014. The rest of this brief is organized as follows. In
S.Nazneen Shagufa, S.Sufia Anjum, E.C.E Dept., Dr.K.V.Subba Reddy
Section II, we present a new algorithm for computing an N
College Of Engineering For Women, India
S.Sufia Anjum, E.C.E Dept., Dr.K.V.Subba Reddy College Of Engineering -point DHT. In Section III, we present an algorithm for a
For Women, India small-length DHT. In Section IV, we analyze the arithmetic
B.Raghunath Reddy, E.C.E Dept., Dr.K.V.Subba Reddy Institute of cost, and in Section V, we present some examples of our
technology, India
200 www.erpublication.org
DHT ALGORITHM FOR HIGHLY PARALLEL IMPLENTATION OF VLSI ARCHITECTURE
Where cas(x)=cos(x)+sin(x).
XN(k){x(i)}= XN/2(k){x(2i)}+u(0).sin(2k/N) +
[XN/2(k){u(i)}-u(0)/2].2.cos(2k/N)------------------(2)
XN(N/2+k){x(i)}=
XN/2(k){x(2i)}-u(0).sin(2k/N)-[XN/2(k){u(i)}-u(0)/2].2.cos(
2k/N) for k=0,1,.N/4 -------------- (3)
XN(N/2-k){x(i)}=
III. ALGORITHM FOR A SMALL DHT
XN/2(N/2-k){x(2i)}+u(0).sin(2k/N)-[XN/2(N/2-k){u(i)}-u(0) An efficient implementation of a fast DHT algorithm
/2].2.cos(2k/N) for k=0,1,.N/4 --- (4) closely depends on an efficient algorithm for a small DHT.
We present here an efficient DHT algorithm for a length N=8
201 www.erpublication.org
International Journal of Engineering and Technical Research (IJETR)
ISSN: 2321-0869, Volume-2, Issue-2, February 2014
Table I lists the required number of multiplications very good potential for using hardware sharing techniques,
and additions for the proposed algorithm, the Sorensen one and many sub expressions have been used in common. We
and Bi algorithm, where rotations are implemented with four can thus significantly reduce the hardware complexity of the
multiplications and two additions (Radix-2 [13] ) and with VLSI implementation. Moreover, due to the fact that the same
three multiplications and three additions (Radix-2 [13] ). constant is used in several multiplications, we can use the
The values of M in the proposed algorithm are computed technique of sharing the multipliers with the same constant.
considering that the multipliers with the same constant are Having only multiplications with a constant, we can
shared. The number of multipliers in Sorensen algorithm [11] efficiently implement these multipliers in VLSI.
is significantly greater than that in the proposed one. The
number of multipliers for Bi algorithm where rotations are
implemented with four multiplications and two additions is VI. HIGHLY PARALLEL VLSI ARCHITECTURE
greater than the necessary number of multipliers for our
algorithm and slightly smaller when the rotations are
implemented with three multiplications and three additions.
However, the split-radix algorithm has an irregular structure In order to clearly illustrate the features and
and is difficult to be implemented in hardware as opposed to advantages of the proposed algorithm, the VLSI architecture
our algorithm that has a regular and modular structure and can for a DHT of length N=32 is presented in Fig. 1(a) and (b). It
be very easily implemented in parallel as it will be shown in can be seen that the proposed architecture is highly parallel
Section VI for a DHT of length N=32 . Moreover, the number and has a modular and regular structure being formed of only
of multipliers in the proposed implementation can be a few blocks: U, MUL, ADD/SUB, XCH, and a few
significantly further reduced by sharing multiplications as additional adders/subtracters. The U blocks implement
shown in Section IV. (20), XCH blocks interchange the values and are simply
implemented in hardware by appropriate wiring, and MUL
blocks are used to implement the shared multipliers with a
constant. This block contains four multipliers with a constant.
V. EXAMPLE OF THE PROPOSED ALGORITHM Each multiplier is shared by four input sequences that are
multiplied with the same constant in an interleaved manner
using multiplexers and demultiplexers controlled by two
We shall illustrate the main features of the proposed clocks. One of the advantages of this algorithm and
algorithm considering a DHT of length N =32 architecture is the fact that the multiplications with the same
A.DHT of Length N=32 constant are shared in the MUL blocks. Thus, the number of
We first compute recursively the auxiliary input multipliers is significantly less than the value 40 given in
sequences Table I which has become now only 16. The final values Y(k)
of Section A and Y0(k) of Section B are finally added to
obtain the output sequence Y(k) using an additional adder not
presented in Fig. 1 for simplicity.
202 www.erpublication.org
DHT ALGORITHM FOR HIGHLY PARALLEL IMPLENTATION OF VLSI ARCHITECTURE
+ }.2.cos(2k/16)]+x(0)/2+ .sin(2k/16)+
Sin(2k/32)+ { + { .2.cos(2k/16)].2.cos((2k/32)+ .2.sin(
2k/16)cos(2k/32).--------------------------------------------------------------------------------------------------------------(21)
}.2.cos(2k/16)]+x(0)/2- .sin(2k/16)+
sin(2k/32)- { { .2.cos(2k/16)].2.cos((2k/32)+ .2.sin(2k/1
6) cos(2k/32).-------------------------------------------------------------------------------------------------------------------(26)
203 www.erpublication.org
International Journal of Engineering and Technical Research (IJETR)
ISSN: 2321-0869, Volume-2, Issue-2, February 2014
REFERENCES
204 www.erpublication.org
DHT ALGORITHM FOR HIGHLY PARALLEL IMPLENTATION OF VLSI ARCHITECTURE
Trans. Circuits Syst. II, Analog Digit. Signal Process., vol. 39, no.
10, pp. 723733, Oct. 1992.
[21] P. K. Meher, LUT optimization for memory-based
computation, IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 57,
no. 4, pp. 285289, Apr. 2010.
[22] P. K. Meher, New approach to look-up table design and
memory-based realization of FIR digital filters, IEEE Trans.
Circuits Syst. I, Reg. Papers, vol. 57, no. 3, pp. 592-603, Mar. S.SufiaAnjum, completed her Bachelor of
2010. Technology from Vaagdevi Institute of Technology and Science .She has
[23] D. F. Chiper and P. Ungureanu, Novel VLSI algorithm and completed her Masters degree with specialization in VLSI system Design
architecture with good quantization properties for a high She has a teaching experience of two years.Her areas of interest are VLSI
throughput area efficient systolic array implementation of DCT, and Digital Signal Processing.
EURASIP J. Adv. Signal Process., vol. 2011, no.1, pp. 114, Jan.
2011.
[24] R. I. Hartley, Subexpression sharing in filters using canonic
signed digit multipliers, IEEE Trans. Circuits Syst. II, Analog
Digit. Signal Process., vol. 43, no.10, pp. 677688, Oct. 1996.
[25] K. K. Parhi, VLSI Digital Signal Processing. New York, NY,
USA: Wiley, 1999
205 www.erpublication.org