Professional Documents
Culture Documents
I. INTRODUCTION
Tracking points on the face image from a video
sequence is the object to provide possibility for
computers to generate descriptive vectors on the
position of these points over time.
The input of such system is a set of points on the
face of the first frame of a video sequence (chosen
on the face) and the output is to present the vectors
describing the new position points followed by
succession the video images over time. The major
challenge of tracking is to identify the most similar
region to the original points surrounded regions of
face.
Were interested in our work to track feature
point on the face using Lucas-Kanade methode. A
lot of methods have been developed in this research
area including 2D and 3D reconstruction. All these
methods used the principal of motion estimation
(ME).
Several studies have been implemented to
tracking video and 3D facial reconstruction [21],
[22], [23].
Motion estimation (M.E.) has been studied by
many authors like Wang, Weiss, Adelson and
Bergen [5, 6, 7], Heeger [8], Horn and Schunk [9]
Weber and Malik [10], Wu and Kanade [11], and
more recently Leduc [12], Bernard [13], Lee [14].
It should be noted first that the motion estimation
is a quantification of the simple motion, directed by
IEEE Card
CANON Camera
Video Record.
Hard disk
Fig.1 Block diagram of the experiment used for video
capture.
Fig 3.
Center of projection
Virtual image of
I x (q n )Vx + I y (q n )Vy = - I t (q n )
Fig 4. transparent
Facial
image
Retroprojector
with grid
Retro-projector
Fig. 2
Mirror
Iy(q1 )
I x (q1 )
I t (q1 )
Iy(q2 )
I x (q2 )
I t (q2 )
V
, v x , b ...........
A ................
Vy
................
...........
I x (qn )
I t (qn )
Iy(qn )
Camera
Fig. 5
Fig. 3
i=1
j=1
Apply the algorithme : locate the region j looks
like the model j in image i, and then define the
new center of point j (Pi,j).
j=j+1
j<=N
i=i+1
i<=M
Fig. 6
V. FEATURE EXTRACT
We took several videos. Each speaker repeat the
sentence (on est le 11 juin 2011, il est 15 heure : its
June 11, 2011, it is 15 oclick ) more times.
Fig. 9
Fig. 8
VII.
VIII.
CONCLUSION
We propose in this paper a method of tracking
feature points on face. Using a video acquisition
and a grid, it makes easier to locate key points on
the studied face.
A lot of points have been used in order to have
good performance to represent correctly the face
shape using MPEG-4 key points.
After selection of key points on the first image of
the video, then we apply the lucas-kanade
algorithm, an optical flow estimation, which
assumes that the flow is essentially constant in a
local neighbourhood of the pixel under
consideration. To verify the position i on the picture
we have make a label on all selected points, where
good results were obtained by using a window of
21 pixels. The points inside the lip contour havent
been correctly tracked because of the loss
information during opening and closing lip.
This work tracks the movement of facial talking
face, so we create a 3D matrix containing the
position (x, y) of each point of standard MPEG-4 at
different moments. The second task of this work is
to generate a mesh grid to reconstruct models of
faces using the method of triangulation delauny.
The results are satisfactory. From a neutral face, we
can generate new positions of key points.
The future work is to analyse several visual
phonemes then create a database containing the
position of each point in order to generate a talking
face. Our method can be used in analysis/synthesis
of talking face.
REFERENCES
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]
[10]
Fig. 14 Application of the Delaunay triangulation on points
tracked in an image.
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20]
[21]
[22]
[23]
[24]