Professional Documents
Culture Documents
ALGEBRA
Almost
AXD
Diagonal
ITS XPPLIC4TIONS
Matrices
with
Multiple
or Close Eigenvalues*
J. H. WILKISSOS
~Vational
Physical
Teddington,
Laboratory
Eragland
1. INTRODUCTIOX
In a number
latter
of algorithms
is reduced
to almost
by an iterative
diagonal
form.
sequence
(assuming
exact
of almost diagonal
of a matrix
of similarity
eigenvalue
computation).
matrices
with multiple
interest
the convergence
for reducing
a matrix
2. THE HERMITIAN
eigenvalues.
which are of
of iterative
procedures
form.
CASE
We first consider
A be hermitian
to diagonal
this is true
We are interested
considerable
as regards
A,, the
transformations
hermitian
matrices
with eigenvalues
with multiple
eigenvalues.
Let
the argument.)
and let
A=D+E,
(3)
* This work was done while the author was Visiting Professor in the Cornputer Science Department
of Stanford University.
It was supported
by the
Xational Science Foundation and the Office of Naval Research.
Lznear
Copyright
4lgcbm
1968 by American
asd
Its
Applications
Elsevier
Publishing
1, I-12
Company,
(1968)
Inc.
J. H. WILKIXSON
2
where D is the diagonal
of A.
Suppose
we have
ilE;lF = E < 6
(3)
so that
(APi-
ai;)
(4)
of A similarly
< &2.
Y eigenvalues
loss of generality
inequality
numbering
(4) becomes
We write
A=GTH
i
(6)
where F is an Y x Y matrix.
If the eigenvalues
of H are I:+,,
numbering
of the Ai
(7)
Hence
I& -
& = I& -
aii + u,i -
GE+&<28
(8)
and
Linear
1, l-12
ilil
(1968)
ALMOST
DIAGONAL
The matrix
H -
MATKICES
Ail is therefore
nonsingular,
i.e., it is of rank
this means
A -
that F is especially
F -
If we premultiply
A -
alI =
A -
iliT by
;i,I
G7
be true
r.
(10)
&I
the derived
Since H -
We partition
(11)
matrix
AlI)-Gr
I$
H -
GT
is also of rank n -
to G and H.
H -
- G(H - AlI)-
iill - G(H -
is already
I,I
Y.
and hence
F -
related
n -
Y. We shall show
(12)
AlI
of rank n -
Y this can
only if
F -
il,I -
G(H -
il,I)plGT = 8,
;i,I)pG?
= ,?,I + M
(13)
i.e.,
F = 1,I + G(H Now the elements
of G are a subset
of those
(say).
(14)
of E and hence
while
(H where R is unitary.
norm
and from
= R diag(A, -
&I)-
A,)-iRH,
invariance
(16)
of the Frobenius
(17)
We see then
bounded
that
diagonal
elements
of F differ
elements
from A, by quantities
are bounded
by e2/6.
the largest
method
matrices.
matrix
is annihilated
elements
This
tending
simple
the classical
observation
annihilated
strategy
the annihilated
3.
removes
remark
is used [9].
elements
In practice
element
shows that
is never associated
after
with two
root.
a difficulty
applies
in demonstrating
quadratically
to the serial
that
convergent
Jacobi
CLOSE
method
which is
errors.
the quantity
matrix
is made on a matrix
merely
In discussing
has very
the convergence
elements
IiOOTS
stage
root.
when a transformation
the transformed
rounding
hermitian
element in
PATHOLOGICALLY
roots,
in connection
norm of off-diagonal
tending
element
consequences
of A is
associated
Jacobi
element
elements
the current
certain
Jacobi
off-diagonal
having multiple
close roots
of the
because
Jacobi
of
method
of very
of A are
a,;
a,,a,
,...)
I.,, ,,...,
a,,,
(W
where
ii =
and the ei are very small.
close.
Define
a+
Ei,
i-
l,...,v,
(19)
pathologically
6 by the relation
(20)
and assume
that
Now A may
be expressed
in the form
A = RD,RH,
where
D, = diag(Q
and D,
can
(22)
be separated
D, = diag(2, A, . . ., 1, a,,,,
into
D, and D, where
(23)
. . . , An),
(24)
+ RD,RH
= B + C
(say).
(25)
elements.
/JElI,+
The Frobenius
IICj!F=
root
corresponding
is therefore
elements
Suppose
llE~I~_l- (~E~~)~~~==.z<~.
the multiple
I -
elements
bounded
by ?/6
of A is bounded
for example
B = A -
a matrix
(W),
of B associated
and hence
by &Z/C?
+ (2
that
with
of the
E~~)~.
1 -
10-10,
1,
4,
and
IIEl,fl + 211(1()-10) E
The
have
off-diagonal
elements
a Frobenius
and therefore
element
they
of A.
the current
positions
than
the largest
Jacobi
off-diagonal
method
or
associated
with
method
Provided
b\
serial Jacobi
serial Jacobi
of A associated
bounded
Hence
the threshold
tion above,
norm
10-5.
since
elements
is concentrated
from
the main
on fewer elements.)
weight
(In fact, it
in the off-diagonal
J. H. WILKINSOS
6
4. NONHERhlITIAX
The above
associated
result
MATRICES
proofs
that
with hermitian
specifically
matrices.
Again
let A have
the roots
matrix
the result
having
an r-fold
d = D -t E
above
root.
Let
(D diagonal),
(26)
(28)
iiEl!,=E<6.
Suppose
[2]*
&I is of rank n -
there
the last
n) include
the Ai (i=r+l,...,
that A -
are at least
i for which
following
from
(28).
This
it appears
if so let us associate
possible
that
there
precisely
n -
and columns
the southeast
Y diagonal
&I
that
there
s;
so that these n -
Now permute
Y diagonal
elements
rows
are in
As in the symmetric
A -
elements
of A similarly
corner.
means
1, and radius e.
a,, with $.
s so
s indices
two inequalities
is
case, A -
A,1 is of rank +z -
Y and partitioning
in the form
G
H -
il,I
we have
I; -
l,Z -
G(H -
,?,I)-K
= 0,
(31)
* I would like to thank Dr. B. Levinger of Case Institute for drawing my attention
to this result while we were both enjoying
Division,
*Argonne Xatlonal
Laboratory.
the hospitality
of :\pplied Mathematics
ALMOST
DIAGONAL
provided
H -
MATRICES
A,1 is nonsingular.
Now
of those
of E.)
laiz -
elements of H.
(34
(These elements
are
Since
a,1 2
/Ai-
A,/ -
(aii -
A,(
(i=r+l,...,n),
> 2s
D, is nonsingular
(say),
(33)
and
H -
&I = D, jI f
D,-lL].
(34)
Now
(35)
and hence
z
Equation
(31) therefore
1
-.
i)
(36)
gives
F = A,1 + G(H -
;2,1)-1K
= R,I + M
(say),
where
(37)
Liwav
_zilgebra and
Its
Applications
1, l-
12 (1968)
J.
n -
l,...,
on 1,.
the level
r diagonal
elements
Again off-diagonal
well below
problem
of the largest
off-diagonal
elements
elements
In this respect
have
2.
this
deceptive
PATHOLOGICALLY
In the hermitian
CLOSE
The deceptive
nature
lIEllm < E
X-lBX
problem
then,
= A such that A
quantity,
we shall,
if B is ill-conditioned
case the hypothesis
than
does not
feature.
ROOT5
IN
NONHERMITIAN
of the result
eigenvalue
transformation
E << d.
(though not as
it is the hypothesis
if it is well-conditioned.
when
diagonal
in general,
Indeed
If B has an ill-conditioned
which is deceptive.
asso-
of the eigenvalzbe
with the
are all in
H. WILKIXSOS
having
becomes
CASE
apparent
as soon as we
as its columns
n independent
eigenvectors
we have
A = X diag(&)X-l.
Using a similar notation
to that in Section
(33)
close roots
A = XD,X-l
+ XD,X-1
(39)
= B i_ C,
In the hermitian
case X is unitary
(40)
involved.
number
of a multiple
[l].
and
large.
Its Applications
though,
is irrelevant.
Algebra
is
It should be emphasized,
X that is relevant
Linear
and
1, I-
12
(1968)
Provided
6.
ITERATIVE
The
REFINEMENT
above
procedures
results
OF
have
EIGENSYSTEM
important
consequences
of a computed
In such procedures
AN
in connection
eigensystem
with
of a matrix
set of eigen-
If the system
neither
diag(,u;) = X-1R
were exact
both
rounding
with
errors
but
error.
S
If the computed
for calculating
small compared
(S -
.q).
is obtained
(13)
(Note S IS computed
for S -
an almost
of
an .q is determined
a bound
In practice
we have
system is accurate
1IS[I.
(42)
procedures
A .Y = diag(,u;) + . f
with
of (43) is therefore
exactly
Hence
R and X-lR
= S.
with well-designed
a low relative
(41)
S.)
diagonal
explicitly
The matrix
matrix
but a bound
which is exactly
similar
to A.
Now when A has a multiple
result
shows that
the off-diagonal
provided
elements
root corresponding
s is small
(and hence
of 9 associated
S -
elements
roots
find typically
of A is ill conditioned
binary
we shall
s/j,
of 5.
will be approximately
computer).
The
5 is very small),
diagonal
elements
that
2-%
permutations
elements
if /IsI/, = E
(with a t-digit
of diag&)
off-diagonal
roots will be
+ s
of the order
the form
Linear
.4lgebra
12
(196X)
10
J. H. WILKINSON
modifies
matrix
to the form
EEL WM
I I
;Ar
and because
multiple
of this Gerschgorins
roots
what
which
theorem
as for well-separated
Forgetting
(45)
EP
roots.
can be achieved
with
can be expressed
it is interesting
an approximate
matrix
to consider
X of eigenvectors
in the form
Z = X(1 + EE),
where
for
(46)
of exact
normalized
eigenvectors.
We have
X-rAx
= (1 + &E)-X-r/fX(I
= (I -
FE + ,s2E2 -
EE)
. * a) diag(&)(I
= dia.&A,) + EF + terms
+ EE)
in .9, etc.,
(47)
where
the elements
elements
Notice
that
coincident,
the
A has eigenvalues
associated
similarity
Ai = A,.
eigenvalues
which,
off-diagonal
elements
transformations
smaller
are again
of Gerschgorins
gives bounds
The weakest
bounds
being
than
When
whichever
Linear
Algebra
the system
and
Its
using
eigenis the
are themselves
merely
transformations.
the procedure
then provided
truly
E, (48)
appreciably
theorem
the off-
Hence
are of order 3.
while not
diagonal
(48)
ilieLj.
with multiple
have separations
shows that
smaller
associated
when
Ajeij
for refining
an eigensystem
Af$&ations
1, l-
12 (1968)
is used iteratively,
the final eigensystem
ALMOST
DIAGOSAL
11
MATRICES
is correct
to working accuracy.
computed
system
of vectors
= (X-i
X-iEX-i
= diag(&) Equation
. . .)AX(I
of a prescribed
are certainly
the quadratic
the bounds
bounded
and diagonal
consists
transformations
of multiple
with a minimum
consists
separation
order of fiz/S,.
into
large
using
three
we
(51)
Gerschgorins
theorem
in the following
groups.
The first
form.
group
of eigenvalues
group
separations
separation
of
of order S,
if
of
Writing
For a member
with
elements
max(Aij = fi,
For an eigenvalue
in E.
terms
eigenvalues;
separation
(50)
accuracy
off-diagonal
can be expressed
be divided
of the remainder.
a minimum
The
the eigenvalues
+ ...
and higher-order
attainable
+ X-1E)
on the attainable
precision.
2%. 2-1j[Xj/,I(X-11j,
Let
(4%
diag(&) + diag(&)X-lE
X-lE
computation
ignore
of the form
we have
X-ijfX
X-lAX
a relation
where
X=_X+E
Hence
Generally
satisfies
the
bounds
In general
better
than
2-t maxIAil
of p.
in practice-multiple
eigenvalues
eigen-
values.
REFERESCES
1 F. L. Bauer.
Optimally
Algebra
einer Matrix,
Izv.
1, l-12
Akad.
(1968)
J. H. WILKINSON
12
eigenvalucs
6 C.
G.
Duke
and H. W. Wielandt,
Math.
J. Jacobi,
J.
20(1953),
real symmetric
11-18.
8 H. P. M. van
X.
of the spectrum
of a normal
der Sacular-
9(1966),
On the quadratic
Numer. Math.
4( 1957))
Verfahren,
with non-distinct
Kempken,
method,
The variation
On the convergence
matrices
Math.
7 H. P. M. van Kempken,
Mach.
Jacobi methods
37-38.
stBrungen vorkommenden
51-94.
Jacobi
matrices,
144-162.
5 A. J. Hoffman
matrix,
of Hermitian
convergence
method for
Numer. Math.
9(1966),
of the special
cyclic
19-22.
Maximizing
functions
of rotations,
J. Ass.
Camp.
459-466.
Schtinhage,
Zur Konvergenz
des Jacobi-Verfahrens,
Numer.
Math. 3(1961),
374380.
11 A. Schonhage,
Math.
12
Zur quadratischen
6(1964),
J. Varah,
Ph. D. Thesis,
13 J. H. Wilkinson,
4(1961),
Konvergenz
Stanford
Numer.
(1967).
eigensystems,
Computev J.
230-241.
14 J. H. Wilkinson,
Numer. Math.
16 J. H. Wilkinson,
The QR algorithm
eigenvalues,
Computer
Received September
Algebra
convergence
4( 1962)) 296-300.
15 J. H. Wilkinson,
Lilaear
des Jacobi-Verfahrens,
410-412.
14.
Eigenvalue
J. 8(1963),
Problem,
85-87.
1967
Oxford
1, l-12
(1968)