Content Asp PDF

Machine Vision and Applications (1998) 10: 308–320 Machine Vision and
Applications

c Springer-Verlag 1998
Extracting characters of license plates from video sequences

Yuntao Cui, Qian Huang
Siemens Corporate Research, 755 College Road East, Princeton, NJ 08536, USA; e-mail: {cui,huang}@scr.siemens.com
Received: 13 August 1997 / Accepted: 7 October 1997
Abstract. In this paper, we present a new approach to ex- low resolution of the characters on the plate due to the appli-
tract characters on a license plate of a moving vehicle, cation requirement that the entire car has to be visible in the
given a sequence of perspective-distortion-corrected license image; 2) global thresholding or enhancement methods work
plate images. Different from many existing single-frame ap- well only when the plate is uniformly illuminated and not
proaches, our method simultaneously utilizes spatial and too noisy, which usually is not the case in real applications.
temporal information. We first model the extraction of char- Takahashi et al. [19] proposed a morphology-based thresh-
acters as a Markov random field (MRF), where the random- olding method to improve the performance in extracting
ness is used to describe the uncertainty in pixel label assign- characters from license plates. They viewed a character to
ment. With the MRF modeling, the extraction of characters is be the combinations of “ditches”, where a ditch is formed by
formulated as the problem of maximizing a posteriori prob- two edges with opposite direction. Then, they designed ap-
ability based on a given prior knowledge and observations. propriate morphological operators to enhance the area within
A genetic algorithm with local greedy mutation operator is the ditch (between two edges). Although this method is suit-
employed to optimize the objective function. Experiments able in the case when the contrast between the character and
and comparison study were conducted and some of our ex- the background is strong, it is very difficult to pick up the
perimental results are presented in the paper. It is shown that correct locations of edges in low-contrast images, a situation
our approach provides better performance than other single often occurs in real applications.
frame methods. Another set of solutions to extract the characters of a
license plate is to use adaptive thresholding (see [20] for
Key words: Document analysis – Binarization – Image se- a survey of binarization methods). Unlike global methods,
quence analysis adaptive approaches find the thresholds based on the infor-
mation from local regions. Therefore, they are capable of
dealing with non-uniformly illuminated license plate images.
However, the performance of these adaptive thresholding
methods depends on the selection of the local regions. An-
1 Introduction other problem is that these methods tend to generate broken
characters as a single character may belong to different local
Automatic recognition of car license plates plays an impor- regions that have different local thresholds. To deal with this
tant role in traffic surveillance systems. Recently, we have problem, some algorithms apply region growing to fill the
seen quite a few computer-vision-based systems that recog- holes [11, 23].
nize the license plates [2, 8, 9, 13]. Most existing systems Another alternative to improve the extraction is to use
focus on the development of a reliable optical character rec- additional data constraints from multiple frames. Multiframe
ognizer (OCR). However, prior to the recognition an OCR methods have been shown to be effective in image restora-
system performs, the characters have to be extracted from tion tasks [16, 21]. In this paper, we address the issue of
license plates. To simplify the extraction problem, the exist- extracting characters by simultaneously utilizing spatial and
ing systems assume that a license plate is a rectangular area temporal information. In our approach, the extraction of
containing a number of dark characters on a white back- characters from license plate images is modeled as a 3D
ground. With this assumption, various approaches exist that Markov random field (MRF), where the randomness is used
extract characters using global threshold methods [8, 9, 13] to describe the uncertainty in the label assignment of pix-
with sometimes global contrast enhancement prior to the ex- els. As a result, the prior knowledge which promotes the
traction [2]. Unfortunately, these methods do not work well consistency between adjacent pixels can be represented in
in most real applications because of the following factors: 1) terms of the clique functions associated with the underlying
Gibbs probability distribution function (pdf) describing the
Correspondence to: Y. Cui
309
MRF. Under the MRF modeling assumption, the extraction

problem can be formulated as an optimization problem of B1 B2
maximizing the a posteriori probability given prior knowl-
edge and observations. Then, a genetic algorithm with local
greedy mutation operator is employed to optimize the ob-
jective function. B3 B4
The paper is organized as follows. In Sect. 2, we use the
MRF model to formulate the character extraction problem
Fig. 1. A four-node clique
as an optimization problem. Then, we apply a genetic algo-
rithm with our local greedy mutation operator in Sect. 3 to
optimize the objective function. Section 4 is related to the
preprocessing of how to extract license plates from images
in a sequence. In Sect. 5, we show the experimental results.
Finally, we draw our conclusions in Sect. 6.
Localize the plate of one image
2 The MRF-model-based character extraction
The MRF model, as an extension of the one-dimensional
Markov process, has attracted much attention in the image
processing and computer vision community (e.g., [3, 4]).
MRF models can be used to incorporate prior contextual Extract feature points from the plate
information or constraints in a quantitative way. Local spa-
tial/contextual dependencies can be utilized to perform bi-
narization [7]. Another advantage of the MRF-model is that
it tends to be local, hence it is suitable for parallel hardware
implementation. In this section, we present an MRF model- Track the feature points
based approach to extract characters from multiple frames.
The model combines the prior knowledge and observations
from both spatial and temporal dimensions into a unified
framework.
Estimate motion parameters
2.1 Problem statement
Our problem is to extract characters from a moving vehicle.
Such extracted results can be fed into an OCR to perform
automatic recognition. The developed approach can be used Correct the perspective distortion
in many applications, including automatic traffic violation
control, automatic parking lot billing, etc.. Formally, assume
yl be the lth frame which contains a license plate, where l =
{1, 2, · · · , n}. Our goal is to extract one rectangular image
Fig. 2. The diagram of the plate localization and the correction of the
I of the license plate with size N1 × N2 . Each pixel zi,j ∈ I perspective distortion
is labeled either as 1 (on the character) or 0 (background
pixel).
2.3 The MRF model
2.2 Motion model
After correction of the perspective distortion, we have a
Multiple frames of license plates are obtained during differ- stack of equally sized license plate images, based on which,
ent time intervals. Since the vehicle is moving and the view we intend to obtain I using and MRF-model-based approach.
of camera is not necessarily perpendicular to the incoming Let S be a finite set of N1 × N2 sites. Consider z to be a
vehicle, the license plates from different frames have not binary random field and g = {gi,j , (i, j) ∈ S} to be a neigh-
only different size but also different perspective distortion. borhood system on S, such that
The first thing we need to do is transform each license plate
y(l) to a rectangular image fl with size N1 × N2 . For any 1. (i, j) is not in gi,j ,
l
pixel yi,j on yl , we can map it onto fil0 ,j 0 using a planar 2. (i, j) ∈ gk,l , if and only if (k, l) ∈ gi,j , for all (i, j), (k, l) ∈
surface motion model [1], where S.
i0 = i + p1 i + p2 j + p5 + p7 i2 + p8 ij , We say z is an MRF with respect to g if and only if [4]:
j 0 = j + p3 i + p4 j + p6 + p7 ij + p8 j 2 . (1) 1. P (z = s) > 0, for any realization s of z,
The coefficients pi can be solved if four correspondences 2. P (zi,j = si,j |zk,l = sk,l , (i, j) =
/ (k, l)) = P (zi,j =
are available. We will discuss this in detail in Sect. 4. si,j |zk,l = sk,l , (k, l) ∈ gi,j ),
310
Fig. 3. Five images of a moving vehicle
where P (·) and P (·|·) are the joint and conditional pdfs, where
respectively. X
An important feature of the MRF model is that its joint U (s) = Vc (s) (3)
c∈C
pdf has a general form, known as the Gibbs distribution
defined based on the concept of cliques [4]. A clique is a is the Gibbs energy function and Vc (s) is called the clique
subset C ⊆ S if and only if every pair of distinct sites in C potential, T is the “temperature” parameter. Finally,
are neighbors. Another important feature of the MRF model X 0
is that z is an MRF on S with respect to the neighborhood Z= exp−U (s )/T (4)
system g if and only if the probability distribution is a Gibbs all s0
distribution based on the cliques. A Gibbs distribution can is the normalization factor. Notice that the preceding MRF
be represented as follows: pdf is quite rich in that the clique functions can be arbitrary
as long as they depend only on the nodes in the correspond-
1 ing cliques. Therefore, the MRF-model-based approach pro-
P (s) = exp−U (s)/T , (2) vides potential advantages in the problem of the character
Z
311
Table 1. Clique energies
Configuration of B1 B2 B3 B4 Energy
0000 1.0
0001 6.0
0010 6.0
0011 3.0
0100 6.0
0101 3.0
0110 18.0
0111 6.0
1000 1.0
1001 18.0
1010 3.0
1011 6.0
1100 3.0
1101 6.0
Fig. 4. The results of the localization is shown using white rectangular
1110 6.0
1111 1.0
extraction, for example, we can define the clique functions to

promote the consistence of the labeling between neighboring
pixels.
2.4 The MRF-model-based formulation
We formulate the character extraction from multiframes as

a Bayesian MAP estimation problem. The MAP estimate is
at the maximum of the posterior probability P (z|{fl }), or
equivalently, at the maximum of the log-likelihood function
z = arg max log P (z|f1 , f2 , · · · , fn ). (5)
fl Fig. 5. The top 30 features within the license plate region
Applying Bayes’ theorem, we have
z = arg max{log P (z) + log P (f1 , f2 , · · · , fn |z)}. (6) where N is a zero mean white Gaussian random field with
fl
variance σl for each variable in N. Then, we have
The prior probability can be written as kz−zl k2
1 −
1 P (fl |z) = exp 2σ 2
l , (10)
P (z = s) = exp−U (s)/T 2
(2πσl ) N1 N2 /2
Z
X
U (s) = Vc (s) (7) for l = {1, 2, · · · , n}. Incorporate the prior and the condi-
c∈C
tional density into Eq. 6, we have
X n
X
The parameter T is assumed to be 1 for simplicity. Here, kz − zl k2
the clique energies are chosen to encourage the consistence z = arg min{ Vc (z) + }. (11)
z
c∈C l=1
2σl2
of the labeling between neighboring pixels. Encouragement
or discouragement is done by assigning some energy values The above objective function is not well behaved. Many
with a clique. We use a four node clique as shown in Fig. 1. gradient-based techniques cannot be applied here since the
For the extraction problem, each pixel Bi on the clique function is not differentiable. The genetic algorithm (GA)
is assigned either 1 or 0. Therefore, B1 B2 B3 B4 is a hex- is an adaptive search algorithm based on the mechanics of
adecimal number which has 16 different choices. Table 1 natural selection and natural genetics [5]. GA requires only
shows one set of energy which was used in the experiments. objective function values to perform an effective search. This
The observation between frames is assumed to be in- characteristic makes a GA a more canonical method than
dependent, so that the complete conditional density can be many other search schemes.
written as
n
Y
n
P (f , f , · · · , f |z) =
1 2
P (fl |z). (8) 3 Optimization using GA
l=1
GAs were introduced by Holland [6] as a computational
Let zl be the extraction result based on the single frame l.
analog of adaptive systems. Given the objective function
Assume the observation model is given by
(11), GAs can be used to find solution zi to minimize
z = zl + N, (9) the fitness value g(zi ). With GAs, a set of fixed size is
312
Fig. 6. The tracking results
Fig. 7. The results of the mapping
prepared, called a population, consisting of M individuals 3. Mutation. Given a mutation probability µ, randomly
zi = zi,1 zi,2 · · · zi,N1 ×N2 , where zi,j = 0, 1. The fitness func- choose an individual from the population and position,
tion g is used to evaluate individuals. and then invert the bit.
3.1 Simple GA
3.2 Greedy mutation operator
A simple GA is composed of three operators:
1. Selection. It is a process in which individuals are re- In the simple GA, the fittest individual in a population is
produced according to their fitness values. Intuitively, not guaranteed to survive into the next generation, and thus
we would like to duplicate the individuals whose fitness an extremely good solution to the fitness function may be
values are lower with higher probability for our minimiz- discovered and subsequently lost. One way to avoid this
ing problem. The probability of an individual zi being problem is to use elitist selection in which the best individ-
reproduced in the next generation is defined as ual in the population survives with probability one [15, 18].
1/g(zi ) Since the simple GA runs with a non-zero mutation rate, it is
Pzi = PM . (12) trivial to show that a global optimum will be reached when
j=1 (1/g(zj )) the GA is left to run infinitely. However, the crossover and
2. Crossover. It selects two individuals from the current mutation operators randomly explore the solution space in
population with a probability χ. Then, it mates them by the simple GAs. The random search is not very efficient and
exchanging the 1 ≤ l ≤ N1 × N2 right-most bits of the it makes the GAs converge very slowly, especially when the
two individuals, where the number of the exchanged bits, entire solution space is large, like our case. In this paper, we
l, is chosen uniformly at random from [1, N1 × N2 ]. use a local greedy mutation operator to speed up the con-
313
Fig. 8. The results of the character extraction using Park’s

method
Fig. 9. The results of the character extraction using Yanowitz

and Bruckstein’s method
the objective function. A simple probabilistic Markov chain

is adopted to model the behavior of the algorithm. Consider
Fig. 10. The result after 20 iterations a stochastic process {zi , i = 0, 1, 2, · · · , N }. If zn = i, then
the process is said to be in state i at time n. We assume that,
whenever the process is in state i, there is a fixed probability
vergence. Let zi,j be the bit selected to flip. We define the Pi,j that it will be in state j. Such a stochastic process is
local greedy flip probability P f to be known as a Markov chain [14].

µ1 if g(zi ) < g(z0i ) Theorem 1. Let E be a simple greedy problem and N be
Pf = , (13)
µ2 otherwise the number of possible populations. The expected generation
where z0i is zi with the jth bit flipped and µ1 > µ2 > 0. number that the GA population will contain a copy of the
A non-zero µ2 is necessary to prevent the GA from being optimum is O(log N ) when the greedy mutation probability
stuck on the local minima. Due to the localalism of both µ1 = 1.0 and the crossover probability χ = 0.0.
the prior (cliques) and the conditional density, only a few Proof. Since E is a simple greedy problem, at any time
local pixels are needed to compute the difference between the population has jth best solution, then after the greedy
g(zi ) and g(z0i ). This greedy mutation operator utilizes the mutation, the population should have a better solution. As-
problem information, thus making the solution search more sume that the new solution is equally likely to be any of
efficient. the j − 1 best. This can be modeled by a Markov chain for
which P1,1 = 1 and
3.3 Expected convergence rate 1
Pi,j = , (15)
with the greedy mutation operator i−1
where j = 1, 2, · · · , i−1, i > 1. Let Ti denote the number of
Let l = N1 × N2 be the length of the binary strings, then transitions needed to go from state i to state 1. A recursive
r = 2l is the total number of possible strings. If n is the formula for E(Ti ) can be obtained by conditioning on the
population size, then the number of possible populations N initial transition:
is [10]
i−1
1 X
n+r−1 E(Ti ) = 1 + E(Tj )
N= . (14) i−1
n j=1
i−1
X
As can be seen, even for a relatively small image 64×64, N
can be very large. So, with the random crossover and mu- = 1/j. (16)
tation operators, the GAs converge very slowly. The greedy j=1
mutation operator which utilizes the problem information Since

is expected to be more efficient. In this section, we use a Z N N −1 Z N −1
Markov chain model to see how the greedy mutation oper- dx X dx
< 1/j < 1 + (17)
ator moves in the solution space and answer the following 1 x 1 x
1
question: what is the expected generation number that the
GA population will contain a copy of the optimum. The or
closed form analysis is difficult in general. Here, we con- N
X −1
sider a type of problems called simple greedy problems. log N < 1/j < 1 + log(N − 1), (18)
1
Definition 1. Let zi be a realization in the solution space of
a problem E, which is defined to minimize g(zi ). If, for any and so
zi , there exists a j such that g(z0i ) < g(zi ), where z0i is zi with E(Ti ) = O(log N ). (19)
the jth bit flipped, then P is a simple greedy problem.
Theorem 1 indicates that the greedy mutation converges
For a simple greedy problem, we can use the greedy much faster than random search for simple greedy problems.
mutation operator, where µ1 = 1.0 and χ = 0.0, to minimize What if the problem is not a simple greedy problem? This
314
Fig. 11. Five images of the second sequence of a moving vehicle
means that the search is stuck in a situation in which a single Figure 2 illustrates the scheme of the plate localization and
flip cannot minimize the objective function. In that case, we the correction of the perspective distortion.
have to use a non-zero µ2 to pull out from the local minima. In this scheme, we first localize the license plate in an
Once it is out of the local minima, we can continue to apply image. Next, we identify distinct point features from the
the greedy mutation operations. detected license plate region. The feature points are tracked
in subsequent frames. Based on tracked correspondences,
warping parameters can be computed and used to map all
4 License plate localization and distortion correction the detected license plate to specified windows of fixed size.
Prior to applying our algorithm to extract characters from 4.1 Localization of the license plate
the license plates, we first have to localize the license plate
in each frame of a sequence and then to warp them to a We assume that region of the license plate consists of dark
fixed-size window by correcting the perspective distortion. characters on top of a light background. The localization of
315
Fig. 11. (continued) Experimental results of the second sequence. a The extracted license plates of third
sequence. b The results of the character extraction using Park’s method. c The results of Yanowitz and
d Bruckstein’s method. d The result after 20 iterations of our method
the license plate can be considered as finding text in images. 4.3 Feature tracking
In our current implementation, we use the spatial variance
method proposed by Zhong et al. [24] to locate the text We assume that vehicles are moving along the road. The
region in the input image. The main idea of this method can trajectories of points on the license plates are parallel in
be explained as follows. If we compute the spatial variance 3D space. These parallel trajectories have the same vanish-
along the line of the license plate direction, we see that the ing point in the image plane [12]. Let V = (Vx , Vy ) be the
regions with high variance corresponding to the text region, vanishing point in the image plane and P = {(xi , yi )|i =
and regions with low variance corresponding to the non-text 1, 2, · · · , n} be the points at the current frame. Our goal is
region. Given the fact that the camera is mounted at a fixed find t,r,c, such that
position, we roughly know what area in the image the license n
X
plates should appear which is independent of the motion of (kI(xi , yi ) − I(x0i + r, yi0 + c)k2
vehicles. Additionally, the range of the size of license plates i=1
can be used to discard some spurious detections.
+kG(xi , yi ) − G(x0i + r, yi0 + r)k2 ) (21)
is minimized. Here,
x0i = Vx + (1 − t)xi ,
4.2 Feature extraction yi0 = Vy + (1 − t)yi . (22)
Here, I is the intensity and G is the gradient. Both intensity
and edgeness are used in the tracking as described in [22].
For each pixel in the detected text region, we compute its Conditions −3 ≤ r ≤ 3 and −3 ≤ c ≤ 3 are used to
feature using a window of size 5 × 5. Shi and Tomasi’s [17] do some local refinement in the case where the vehicle has
method was used. Let (x, y) be a pixel location within a slight direction change and the vanish point estimation has
detected text region and Ix (x, y) and Iy (x, y) be its partial errors. The search of the solution is unidirectional and very
derivatives. At each pixel, we have a matrix whose elements fast.
are sums of the products of the partial derivatives taken in
a 5 × 5 window.
4.4 Correction of the perspective distortion
P P
Ix2 (x, y) IP
x (x, y)Iy (x, y) Tracked feature points provide correspondences needed to
A= P . (20)
Ix (x, y)Iy (x, y) Iy2 (x, y) solve motion parameters. Since license plates are planar ob-
jects, we need only to solve for warping parameters in or-
der to correct perspective distortion of different frames. The
The second eigenvalue of this matrix is used as the rank at planar surface motion model is described in Sect. 2.2. Us-
pixel (x, y). Across the entire text region, we use this rank to ing warping parameters, we map all the license plate images
rate all the pixels. Currently, the top 30 among all such rated onto fixed-sized rectangular windows, providing a stack of
pixels are chosen to be the set of features that represents the license plate images for the MRF-model-based character ex-
text region and will be tracked along the sequence. traction mechanism.
316
Fig. 12. Six images of the third sequence of a moving vehicle
5 Experimental results Yanowitz and Bruckstein’s [23] methods. These two meth-
ods are among the best according to [20].
We implemented our MRF-model-based approach to extract Figure 3 shows five images of the first sequence of a
characters from image sequences. The sequences of the mov- moving vehicle.
ing vehicles were obtained from a stationary video camera. The top 30 features of Fig. 4 are shown in Fig. 5.
The experimental results from three different sequences are The tracking results are shown in Fig. 6.
shown and discussed here. These sequences are representa- The tracking results were used to compute the motion
tive, because they vary in terms of the resolution of the parameters. These motion parameters were then used to cor-
license plate images and the illumination conditions. We rect the perspective distortion. After perspective correction,
show them in a descending order, which means the first we have a stack of equally sized license plate images as
sequence is the best in terms of the resolution and the illu- shown in Fig. 7.
mination conditions and the third is the worst. For compari- Now, we are ready to apply our algorithm to extract the
son, we also show the results obtained using Park’s [11] and characters from the sequence of rectangular frames. Park’s
317
Fig. 12. (continued) Experimental results of the third sequence. a The extracted license plates of third
sequence. b The results of the character extraction using Park’s method. c The results of Yanowitz and
d Bruckstein’s method. d The result after 20 iterations of our method
[11] method was used to extract the characters from each characters, but also has quite a few noises because of spu-
individual frame. The results are shown in Fig. 8. We also rious edges generated by the edge detector. The size of the
show the results of Yanowitz and Bruckstein’s [23] method plate image is 40 × 280. The block size which we use for
in Fig. 9. Park’s method is 40 × 35, which is roughly the size of one
In the current implementation, we set the mutation prob- character in the plate.
ability µ1 = 0.9 and µ2 = 0.01 and the crossover probability The next two sequences have poorer resolutions of the
χ = 0.001. We use µ1 = 0.9 instead of µ1 = 1.0 to avoid the license plate images and the illumination conditions are also
oscillation between two individual bits. The population size worse than in the first sequence. Same procedures were ap-
was 100. The use of the greedy mutation operator makes plied to these two sequences. Figures 11 and 12 show the
convergence very fast. Figure 10 shows the result after 20 results. The third sequence has very poor resolutions and il-
generations. The result is clearly better than both Park’s and lumination conditions. Both Park’s and Yanowitz and Bruck-
Yanowitz and Bruckstein’s single-frame methods. Yanowitz stein’s methods failed. However, our method still gives rea-
and Bruckstein’s method generates the thin skeletons of the sonable results.
318
Fig. 13. Five images of a moving vehicle
Finally, we applied our approach to extract the numbers clique functions associated with the underlying Gibbs pdf
from containers. A sequence of images of a container is describing the MRF. Under the MRF modeling assumption,
shown in Fig. 13. The results are shown in Fig. 13a–d. extraction is then formulated as the optimization problem of
maximizing the a posteriori probability given prior knowl-
edge and observations. Then, a genetic algorithm with local
6 Conclusions greedy mutation operator is employed to optimize the ob-
jective function. Our experimental results have shown better
In this paper, we present a new approach to extract charac- performance than other single-frame methods.
ters from license plates given a sequence of images. In our By using the greedy mutation operator, we can dramat-
approach, the extraction of characters from license plate im- ically reduce the computation cost. However, it is still far
ages is modeled as an MRF, where the randomness is used from the real-time requirement. Fortunately, the evaluation
to model the uncertainty in the assignment of the pixels. As a of our objective function is local, and it is suitable for par-
result, the prior knowledge which promotes the consistency allel hardware implementation.
between adjacent pixels can be represented in terms of the
319
Fig. 13. (continued) Experimental results of the second sequence. a The extracted container numbers. b The results of
the character extraction using Park’s method. c The results of Yanowitz and Bruckstein’s method. d The result after 20
d iterations of our method
Acknowledgements. We would like to thank Dr. Shang-Hong Lai for sug- 4. Geman S, Geman D (1984) Stochastic relaxation, Gibbs distribution
gestions on MRF and GAs, and Dr. Marie-Pierre M. Jolly for providing us and the Bayesian restoration of images. IEEE Trans Pattern Anal Mach
the codes of Yanowitz and Bruckstein’s method. Intell 6(6):721–741
5. Goldberg DE (1989) Genetic Algorithm in Search, Optimization and
Machine Learning. Addison-Wesley, Reading, Mass.
6. Holland JH (1975) Adaptation in Natural and Artificial Systems. Uni-
versity of Michigan Press, Ann Arbor, Mich.
References 7. Mardia KV, Hainsworth TJ (1988) A spatial thresholding method for
image segmentation. IEEE Trans Pattern Anal Mach Intell 10(6):919–
927
1. Adiv G (1985) Determining three-dimensional motion and structure 8. Nijhuis JAG, ter Brugge MH, Helmholt KA, Pluim JPW, Spaanenburg
from optical flow generated by several moving objects. IEEE Trans L, Venema RS, Westenberg MA (1995) Car license plate recognition
Pattern Anal Mach Intell 7(4):384–401 with neural networks and fuzzy logic. In: Proceedings of IEEE In-
2. Comelli P, Ferragina P, Granieri MN, Stabile F (1995) Optical recog- ternational Conference on Neural Networks, Perth, Western Australia,
nition of motor vehicle license plates. IEEE Trans Vehicular Technol pp 2185–2903
44(4):790–799 9. Nishiyama K, Kato K, Hinenoya T (1991) Image processing system
3. Cross GR, Jain AK (1983) Markov random field texture models. IEEE for traffic measurement. In: Proceedings of International Conference
Trans Pattern Anal Mach Intell 5(1):25–39
320
on Industrial Electronics, Control and Instrumentation, Kobe, Japan, Yuntao Cui received his BS degree
pp 1725–1729 from Shanghai Jiao Tong University
10. Nix AE, Vose MD (1992) Modelling genetic algorithm with Markov in 1988, MS degree from Southeastern
chains. Ann Math Artif Intell 5:79–88 Massachusetts University in 1991, and
11. Park JR (1991) Gray-level thresholding in badly illuminated images. PhD from Michigan State University in
IEEE Trans Pattern Anal Mach Intell 13(8):813–819 1996, all in Computer Science. He is
12. Rangarajan K, Shah M (1992) Interpretation of motion trajectories currently with Siemens Corporate Re-
using focus of expansion. IEEE Trans Pattern Anal Mach Intell search, Inc., Princeton, NJ. His current
14(2):1205–1210 research interests are video processing
13. Raus M, Kreft L (1996) Reading car license plates by the use of arti- applications for surveillance, multime-
fical neural networks. In: Proceedings of 38th Midwest Symposium on dia, and visualization.
Circuits, Systems, USA, pp 538–541
14. Ross SM (1993) Introduction to Probability Models. Academic Press,
San Diego, Calif.
15. Rudolph G (1994) Convergence analysis of canonical genetic algo-
rithms. IEEE Trans Neural Networks 5(1):96–101
16. Schultz RR, Stevenson RL (1996) Extraction of high-resolution frames
from video sequences. IEEE Trans Image Process 5(6):996-1011 Qian Huang got her BS degree in
17. Shi J, Tomasi C (1994) Good features to track. In: IEEE Conference Computer Engineering from Shanghai,
on Computer Vision and Pattern Recognition Seattle, WA, pp 593–600 China. She obtained her MS degree in
18. Suzuki J (1995) A Markov chain analysis on simple genetic algorithms. both computer science and industrial
IEEE Trans Syst Man Cybernetics 25(4):655–659 engineering from Kansas State Univer-
19. Takahashi Y, Shio A, Ishii K (1993) Morphology based thresholding sity. Dr Huang’s PhD is in the domain
for character extraction. IEICE Trans Inf Syst E76-D(10):1208–1215 of computer vision and pattern recog-
20. Trier OD, Jain AK (1995) Goal-directed evaluation of binarization nition from Michigan State University.
methods. IEEE Trans Pattern Anal Mach Intell 17(12):1191–1201 Her current research interests include
21. Tsai RY, Huang TS (1984) Multiframe image restoration and registra- real time video processing, content based
tion. In: Tsai RY, Huang TS (eds) Advances in Computer Vision and visual information archiving, indexing,
Image Processing, volume 1. pp 317–339 and retrieval, new media systems, and
22. Weng J, Ahuja N, Huang TS (1992) Matching two perspective views. integration of visual and non-visual in-
IEEE Trans Pattern Anal Mach Intell 14(8):806–825 formation in multimedia applications.
23. Yanowitz SD, Bruckstein AM (1989) A new method for image seg-
mentation. Comput Vision Graphics Image Process 46(1):82–95
24. Zhong Y, Karu K, Jain AK (1995) Locating text in complex color
images. Pattern Recognition 28(10):1523–1535

Content Asp PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Content Asp PDF

Uploaded by

Copyright:

Available Formats

Machine Vision and Applications (1998) 10: 308–320 Machine Vision and

Extracting characters of license plates from video sequences

Received: 13 August 1997 / Accepted: 7 October 1997

MRF. Under the MRF modeling assumption, the extraction

Fig. 3. Five images of a moving vehicle

Table 1. Clique energies

extraction, for example, we can define the clique functions to

2.4 The MRF-model-based formulation

We formulate the character extraction from multiframes as

Fig. 6. The tracking results

Fig. 7. The results of the mapping

Fig. 8. The results of the character extraction using Park’s

Fig. 9. The results of the character extraction using Yanowitz

the objective function. A simple probabilistic Markov chain

mutation operator which utilizes the problem information Since

Fig. 11. Five images of the second sequence of a moving vehicle

Fig. 12. Six images of the third sequence of a moving vehicle

Fig. 13. Five images of a moving vehicle

You might also like