Professional Documents
Culture Documents
Bertrand Merlier
Frieder Weiss
Aurachweg 6
90449 Nrnberg
RFA
http://www.frieder-weiss.de/
info@frieder-weiss.de
ABSTRACT
This paper takes stock of a decade performing
electroacoustic music thanks to a spatial gesture
interface (3DGC), designed in 2002 for a complex
instrumental control. 10 years later, motion capture
technology has seriously improved, yet our device has
not aged a bit.
The following lines recall its main features.
Highlighted specificities appear to be perfectly suited
to instrumental music, in a comprehensive and timeless
way. The author shows that musical gesture control is
not really a hardware problem, but rather a software
problem i.e. making the good choices for gesture
detection and analysis. The 3DGC gives many ideas on
this topic.
Excerpt of 2004 paper introduction:
Both hands gestures are analyzed by means of a camera
and by a motion capture software. Contrary to classical
instrument interfaces, neither any mechanical organ nor
any mark is visible! This concept offers important
freedom of gestural control in any 3 dimensions of
space. However it requires reconsidering the relation
between cause and effect, i.e. between gesture and
sound result. We shall present several specific features
linked to lack of any visual mark or mechanical
constraints. Then we shall underline the interest of
programmability and simultaneous control of several
parameters in only one simple gesture.
First use of 3DGC is to control generation,
transformation and spatialization of live electroacoustic
music. Many other artistic applications are imaginable
in the multimedia area, such as live video pictures
generation and transformation.
Keywords: HCI, Human Computer Interface, virtual
instrument, computer-based musical instrument,
gestural controller, motion capture software,
electroacoustic music live performance, hand gesture
analysis by means of a camera, Eyecon, MAX/MSP.
1. CONTEXT
1.1. Aim of our research
Our research fits into the context of creating innovative
electronic music to be performed live. Composition and
interpretation of objets sonores or musique de
timbre frequently require a special job of lutherie
informatique, in order to be able to simultaneously and
effectively control all the numerous sound parameters.
1.2. Gestures types and uses
The present interface uses both hand gestures to
accurately and effectively control1 a sampler playing
complex electroacoustic sounds (rich timbres whose
heights are not necessarily defined) with multiple
modulations possibilities (see pictures 1 and 3).
Using both hands motion tracking aims at replacing
the synthesizer keyboard, leaving the musical keyboard
paradigm2 and the world of mechanics in order to obtain
a new control interface with more degrees of freedom
and greater wealth of play.
This virtual controller - detached from any
mechanical constraint - has many degrees of freedom in
three dimensions, such as: detection of "polyphonic"
positions, accelerations, surface, form factors,
symmetry, distance between both hands...
Picture 4: the 2 MIDI settings panels (notes and controllers) associated to a detector
2.1.1.
1.3. Birth of the device
In 2002, winner of an artist residency funded by the city
of Dresden and the Art Festival CyNet, I designed a Tridimensional Gestures Controller (3DGC): a virtual
instrument based on a sampler, driven by a both hands
motion tracking system. This device has been used in
dozens of concerts in Europe, in solo, in duets (music +
video) or in trios. Project 4 Hands a Multimedia
instrumental duet used two 3DGC for creating sound
and live video. It was presented in September 2002 at
Ars Electronica Festival (Linz Austria).
The 3DGC was the subject of several publications,
including Sound and Music Computing at IRCAM Paris in 2004.
Interests of the device are: affordability, simple
standard and proven technology, small size, portability,
reliability and accuracy in detecting the gesture...
1.4. 4 Hands
[ 4 Hands ] is a multimedia instrumental duet using 2
3DGC. Two performers on stage are playing each their
own invisible instrument. Jean-Marc DUCHENNE video creator - performs live video sequences, cast on a
main screen hanging at the rear of the stage and
Bertrand MERLIER - composer - performs live
electroacoustic music on 2 or 4 loudspeakers.
Four hands spin, caress, knock, scratch, stretch,
describe complex arabesques, strange figures. The
gestures of the two performers obviously control images
and sounds, but no instrument is visible!
The pictures of the two performers' hands are also
projected on two lateral screens. Soft lightings may also
create hand shadows on the walls or on the ceiling.
The apparent simplicity of the equipment and of the
stage design, the use of intimist lightings lead the
audience to specially focus on the gestures themselves
and on the main elements of the multimedia discourse.
The "4 Hands" project was presented in September
2002 at Ars Electronica Festival (Linz - Autriche), and
then in several European electronic arts festivals.
Some presentation elements (music, pictures and
video excerpts) can be seen at: http://tc2.free.fr/4hands/
2. 10 YEARS LATER
2.1. A brief overview of the technology
As motion capture is concerned, we must distinguish
two levels: the gesture capture device (input device) and
the gesture analysis or interpretation algorithm.
New sensors
1 http://www.xbox.com/fr-FR/kinect/home
or
http://www.microsoft.com/en-us/kinectforwindows/
2 http://leapmotion.com/
3 http://www.eyesight-tech.com/
4 See for example: Proceedings of the International Workshop on
Recognition, Analysis, and Tracking of Faces and Gestures in RealTime Systems, 1999 or 2001, IEEE Conference Publications.
http://ieeexplore.ieee.org/
Or else: Proceeedings of Workshop on Motion and Video Computing,
2009, IEEE Conference Publications.
Rehg, J.M. and Kanade, T., DigitEyes: vision-based hand tracking for
human-computer interaction, Proceedings of the 1994 IEEE
Workshop on Motion of Non-Rigid and Articulated Objects, 11-12
Nov 1994, p. 16 22.
5 http://live.leapmotion.com/about.html
6 Note the interesting OpenCV library (Open Computer Vision) for
Python language: a free graphics library, originally developed by
Intel, that specializes in realtime picture processing. Since 2010, the
Willow Garage Robotics Society provides has been supporting for this
Library. http://opencv.willowgarage.com/wiki/
7 IRISIS software developed by Collectif & Cie in Annecy France: http://collectifetcie.free.fr/irisis.htm
Cyclops :Analyze and Track Live Video, developed by Eric Singer:
http://cycling74.com/products/cyclops/
Reverse engineering has determined that the Kinect sensor outputs video at a
frame rate of 30 Hz. The RGB video stream uses 8-bit VGA resolution (640
480 pixels) with a Bayer color filter, while the monochrome depth sensing video
stream is in VGA resolution (640 480 pixels) with 11-bit depth, which provides
2,048 levels of sensitivity. The Kinect sensor has a practical ranging limit of 1.2
3.5 m (3.911 ft) distance when used with the Xbox software. The area required
to play Kinect is roughly 6 m2, although the sensor can maintain tracking through
an extended range of approximately 0.76 m (2.320 ft). The sensor has an
angular field of view of 57 horizontally and 43 vertically, while the motorized
pivot is capable of tilting the sensor up to 27 either up or down. The horizontal
field of the Kinect sensor at the minimum viewing distance of ~0.8 m (2.6 ft) is
therefore ~87 cm (34 in), and the vertical field is ~63 cm (25 in), resulting in a
resolution of just over 1.3 mm (0.051 in) per pixel.1
1
2
http://en.wikipedia.org/wiki/Kinect
http://cycling74.com/products/cyclops/
= 240
1000
ms
2000
ms
4000
ms
8000
ms
500
ms
1000
ms
2000
ms
4000
ms
250
ms
500
ms
1000
ms
2000
ms
125
ms
250
ms
500
ms
1000
ms
62,5
ms
125
ms
250
ms
500
ms
31,25
ms
62,5
ms
125
ms
250
ms
= 120
= 60
= 30
g) symmetry detection;
'
'
4. REFERENCES
4.1.1.
Books
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
4.1.2.
[9]
Papers
Bertrand MERLIER
(Bertrand.Merlier@univ-lyon2.fr)
Electroacoustic and instrumental composer
Lecturer in Computer Music at the Lyon 2 University France.
co-director of professional master MAAAV (Music
Applied to Visual Arts)
For several years, he has been conducting researches on
gestural interfaces in electroacoustic music, on
interactive music and sound spatialization...
Author of "vocabulary of space" published in 2006 by
Delatour Editions.
http://tc2.free.fr/merlier/