Professional Documents
Culture Documents
Lecturer: Arvind
TA: K. Elliott Fleming
Assistant: Sally Lee
Why?
Power savings: Specialized hardware for a video
decoder (H.264) may consume 1/100th to 1/1000th
the power of a software implementation
Cost
Performance
Size …
advanced microprocessors
Standard-Cell-Based ASICs
High volume, moderate performance: Graphics chips, network
chips, cell-phone chips
Intel 8080A, 1974 Intel 8086, 1978, 33mm2 Intel 80286, 1982, 47mm2 Intel 386DX, 1985, 43mm2
3Mhz, 6K transistors, 6u 10Mhz, 29K transistors, 3u 12.5Mhz, 134K transistors, 1.5u 33Mhz, 275K transistors, 1u
Intel 486, 1989, 81mm2 Intel Pentium, 1993/1994/1996, 295/147/90mm2 Intel Pentium II, 1997, 203mm2/104mm2
50Mhz, 1.2M transistors, .8u 66Mhz, 3.1M transistors, .8u/.6u/.35u 300/333Mhz, 7.5M transistors, .35u/.25u
2001
1996
2000
1993
1995
1997
1998
1999
2001
2002
2002
Front-end is designing the logic (RTL)
Back-end is fitting all the gates and wires on the
chip; meeting timing specifications; wiring up
power, ground, and clock
February 4, 2009 http://csg.csail.mit.edu/6.375/ L01-8
Design Cost Impacts Chip Cost
An Altera study
Non-Recurring Engineering (NRE) costs for a
90nm ASIC is ~ $30M
59% chip design (architecture, logic & I/O design,
product & test engineering)
30% software and applications development
11% prototyping (masks, wafers, boards)
Bluespec
February 4, 2009 http://csg.csail.mit.edu/6.375/ L01-12
Bluespec: Enabling High-level
Synthesis
Bluespec SystemVerilog source what we did until
last year in 6.375
Bluespec Compiler
what we plan to
do this year
C Verilog 95 RTL
Bluesim Cycle
Accurate
Verilog sim RTL synthesis
Debussy Power
estimatio FPGA
Visualization
n tool
Team:
2(IBM) + 2(MIT) + Linux and FPGA help
Example OFDM
WiMAX: 256pt based protocols
@ 0.03MHz
Pilot &
TX FEC CP
MAC Scrambler
Encoder
Interleaver Mapper Guard IFFT
Insertion D/A
Controller
WUSB: 128pt 8MHz Insertion
standard specifi
Convolutional
WiFi:x7+x 4
+1reuse
potential
Parse Inverse
NAL
+ Quant
unwrap
Bits
CAVLC Transformation
Frames
Inter Intra Deblock
Prediction Prediction Filter
Ref
Frames
Different requirements for different environments
- QVGA 320x240p (30 fps) May be implemented in hardware
- DVD 720x480p or software depending upon ...
- HD4,DVD
February 2009 1280x720p http://csg.csail.mit.edu/6.375/
(60-75 fps) L01-20
H.264 in Bluespec
Initial Design: Base profile
Eight man-months
8K lines of Bluespec
in contrast to 80K lines of C standard
Decoded 720p @ 32FPS