Professional Documents
Culture Documents
Outline:
the ARM 3-stage pipeline
the ARM7TDMI core
the ARM 5-stage pipeline
the ARM9TDMI core
the ARM10TDMI core
fetch
decode
execute
the operands are read from the register
bank, shifted, combined in the ALU and the
result written back
Cores - v4 - 1
execute
fetch
decode
execute
fetch
decode
3
instruction
decode
execute
fetch STR
time
Cores - v4 - 2
execute
decode
fetch ADD
decode
fetch ADD
execute
decode
execute
execute
instruction
time
2001 PEVEIT Unit - ARM System Design
Cores - v4 - 3
PC behaviour
ARM components:
register bank
2 read ports, 1 write port
plus additional read and write ports for r15
Cores - v4 - 4
Cores - v4 - 5
barrel shifter
ALU
address register and incrementer
memory data registers
instruction decoder and control
Cores - v4 - 6
A[31:0]
control
address register
P
C
3-stage
ARM
organization
incrementer
PC
register
bank
decode
A
L
U
b
u
s
multiply
register
&
b
u
s
b
u
s
barrel
shifter
Outline:
the ARM 3-stage pipeline
the ARM7TDMI core
the ARM 5-stage pipeline
the ARM9TDMI core
the ARM10TDMI core
instruction
control
ALU
data in register
D[31:0]
Cores - v4 - 7
The ARM7TDMI
Cores - v4 - 8
ARM7TDMI organization
scan chain 2
extern0
extern1
scan chain 0
Embedded
ICE
opc, r/w,
mreq, trans,
mas[1:0]
A[31:0]
processor
core
D[31:0]
Din[31:0]
Dout[31:0]
other
signals
scan chain 1
bus
splitter
JTAG TAP
controller
described later
TCK TMSTRST TDI TDO
2001 PEVEIT Unit - ARM System Design
Cores - v4 - 9
clock
control
mclk
wait
eclk
configuration
bigend
interrupts
irq
fiq
isync
The
ARM7TDMI
core
interface
signals
initialization
reset
bus
control
enin
enout
enouti
abe
ale
ape
dbe
tbe
busen
highz
busdis
ecapclk
debug
dbgrq
breakpt
dbgack
exec
extern1
extern0
dbgen
rangeout0
rangeout1
dbgrqi
commrx
commtx
coprocessor
interface
opc
cpi
cpa
cpb
power
Vdd
Vss
Cores - v4 - 10
A[31:0]
Din[31:0]
Dout[31:0]
D[31:0]
memory
interface
bl[3:0]
r/w
mas[1:0]
mreq
seq
lock
ARM7TDMI
core
ARM7TDMI
trans
mode[4:0]
abort
MMU
interface
Tbit
state
tapsm[3:0]
ir[3:0]
tdoen
tck1
tck2
screg[3:0]
TAP
information
drivebs
ecapclkbs
icapclkbs
highz
pclkbs
rstclkbs
sdinbs
sdoutbs
shclkbs
shclk2bs
boundary
scan
extension
TRST
TCK
TMS
TDI
TDO
JTAG
controls
Cores - v4 - 11
ARM7TDMI characteristics:
Process
Metal layers
Vdd
0.35 m
3
3.3 V
Transistors
Core area
Clock
74,209
2
2.1 mm
0 to 66 MHz
MIPS
Power
MIPS/W
60
87 mW
690
Cores - v4 - 12
ARM7TDMI
Outline:
the ARM 3-stage pipeline
the ARM7TDMI core
the ARM 5-stage pipeline
the ARM9TDMI core
the ARM10TDMI core
Cores - v4 - 13
Fetch
Decode
Execute
Memory
Cores - v4 - 15
Write-back
Cores - v4 - 16
ARM9TDMI
Cores - v4 - 14
Cores - v4 - 17
The ARM9TDMI is
a classic Harvard architecture 5-stage
pipeline
separate instruction and data memory ports
Cores - v4 - 18
ARM9TDMI
pipeline
ARM9TDMI pipeline
next
pc
+4
I-cache
fetch
pc + 4
pc + 8
I decode
r15
ARM7TDMI:
Fetch
Decode
instruction
fetch
Thumb
decompress
Execute
ARM
decode
reg
read
reg
write
shift/ALU
ARM9TDMI:
r. read
instr uction
fetch
decode
Fetch
Decode
shift/ALU
data memor y
access
Execute
Memory
reg
write
very similar to
StrongARM
immediate
elds
mul
LDM/
STM
see CPU
section
no separate
branch adder
instruction
decode
register read
+4
postindex
reg
shift
shift
pre-index
execute
ALU
forwarding
paths
mux
B, BL
MOV pc
SUBS pc
byte repl.
Write
buffer/
data
D-cache
load/store
address
rot/sgn ex
LDR pc
register write
Cores - v4 - 19
ARM9TDMI
write-back
Cores - v4 - 20
ARM9TDMI
EmbeddedICE
as ARM7TDMI, plus:
hardware single-stepping
breakpoints on exceptions
Process
Metal layers
Vdd
0.25 m
3
2.5 V
Transistors
Core area
Clock
111,000
2
2.1 mm
0-200 MHz
MIPS
Power
MIPS/W
220
150 mW
1,500
Cores - v4 - 21
ARM10TDMI
Cores - v4 - 22
ARM10TDMI pipeline
The ARM10TDMI is
aimed at significantly higher performance
than the ARM9TDMI
achieved through use of:
higher clock rate
64-bit I- and D-memory buses
branch prediction
hit-under-miss D-memory interface
addr.
calc.
branch
prediction
instruction
fetch
decode
Fetch
Issue
data memory
access
r. read
decode
shift/ALU
multiply
multiplier
par tials add
Decode
Execute
Memory
data
write
reg
write
Write
6-stage pipeline
2001 PEVEIT Unit - ARM System Design
Cores - v4 - 23
Cores - v4 - 24