Professional Documents
Culture Documents
Multimodal
Interfaces
Overview
Multimedia Content Representation
Introduction,
definitions, etc.
MMI_05
Multimodal
Interfaces
Overview
Information Visualization
Introductio,
Definitions
The Power of Information Visualization
Visualization for What ?
techniques
MMI_05
Multimodal
Interfaces
Whats Multimedia?
Multi: Many
Media:
Multimedia:
MMI_05
Multimodal
Interfaces
MMI_05
Multimodal
Interfaces
Representation Media
How
Presentation Media
Physical
MMI_05
Multimodal
Interfaces
Transmission Media
Physical
MMI_05
Multimodal
Interfaces
Context
Recent advances in the technologies of communication,
computer science and electronics have facilitated production,
and distribution of multimedia data
MMI_05
Multimodal
Interfaces
MMI_05
Multimodal
Interfaces
Text-based retrieval
Approach
MMI_05
10
Multimodal
Interfaces
Content-based retrieval
Approach
MMI_05
11
Multimodal
Interfaces
Generation
production/authoring
Representation
compression
Storage
file
Transmission
networking
search
database
and retrieval
delivery
of multimedia
information
and formats
system design
server
MMI_05
tools
issues
management
design, streaming
12
Multimodal
Interfaces
Actions
Tree-based indexing
Text database
Transmission
Multimedia analysis
Multimedia data
Multimedia Indexing
Actions
Transmission
Multimedia database
MMI_05
13
Multimodal
Interfaces
Audio
Sound Fundamentals
Sound
MMI_05
14
Multimodal
Interfaces
MMI_05
15
Multimodal
Interfaces
Problems
Spyware,
MMI_05
16
Multimodal
Interfaces
feature extraction
Audio classification and Retrieval
and expensive
Insufficient
MMI_05
17
Multimodal
Interfaces
Audio Repository
Classification: Male,
Laughing,
Features Extraction
Indexing:Using
feature describe
audio unit
Audio Example
Keywords
Features
Extraction
Retrieval
User
Interface
Audio Database
Browsing
MMI_05
18
Multimodal
Interfaces
Features
Frequency-domain Features
Audio classification
Goal
MMI_05
19
Multimodal
Interfaces
Conclusions
MMI_05
20
Multimodal
Interfaces
Image
Whats an Image?
Pixel
9 Picture elements in digital images, it usually indicate a point in an image.
Image Resolution
9 The number of pixels in a digital image.
Depth
9 The number of bit used to characterize each pixel information.
Bit Map: 1 bit/pixel, Gray scale: 2-8bits/pixel, Full color: 24 bits/ pixel, Color
mapped: 8 bits/ pixel
Image Depth, Monochrome/Bit-Map, Dithering, Gray Scale Images, 8-bit/24bit Color Images, Image Format, etc.
MMI_05
21
Multimodal
Interfaces
Image Retrieval
Text-based retrieval
Using
MMI_05
22
Multimodal
Interfaces
Image retrieval
MMI_05
23
Multimodal
Interfaces
Image retrieval
MMI_05
24
Multimodal
Interfaces
Image retrieval
Content-based image retrieval (CBIR)
Examples
Challenges
MMI_05
25
Multimodal
Interfaces
MMI_05
26
Multimodal
Interfaces
MMI_05
27
Multimodal
Interfaces
MMI_05
28
Multimodal
Interfaces
Variety of Similarity
Degree of difficulty
Histogram matching
Texture analysis
Similar shape/pattern
Image Segmentation,
Pattern recognition
MMI_05
29
Multimodal
Interfaces
Indexing
Use any text available: Title, Subject, Caption
Use content information: Colour histogram, Shape, Texture
MMI_05
30
Multimodal
Interfaces
Image retrieval
Demonstrations
http://zomax.wins.uva.nl:5345/ret_user/
http://www.ifp.uiuc.edu/~nakazato/CBIR/
Other demos
http://eidetic.ai.ru.nl/egon/cogw/co440/CBIR_Demo-s.html
http://www.ee.surrey.ac.uk/Research/VSSP/imagedb/demo.html
http://www.fb9-ti.uni-duisburg.de/rotdemo.html
http://mmdb.ece.ucsb.edu/~demo/corelacm/
Conclusions
Image
Retrieval
Content-based Image Retrieval (CBIR)
General Measures:
9 Gray intensity, Color, Texture, Shape
Distances
Measures:
MMI_05
31
Multimodal
Interfaces
Video
Video consists of images
Whats the interval of images?
Sampling rates must be high enough to avoid motion "aliasing.
1. At
least 15 frames/Sec
2. 30 frames/ Sec appears smoothly
3. At least 50 frames/ sec needed in the ideal case
MMI_05
32
Multimodal
Interfaces
Surveillance
On-Demand
MMI_05
33
Multimodal
Interfaces
Sample Query
Text : Find pictures of George Washington
Image:
Video:
MMI_05
34
Multimodal
Interfaces
User
Result
MMI_05
35
Multimodal
Interfaces
MMI_05
36
Multimodal
Interfaces
A set of
shots
Keyframe browser
combined with
transcript or objectbased search
The MMI team
MMI_05
37
Multimodal
Interfaces
Information Need
Video Structure
Result
The MMI team
MMI_05
38
Multimodal
Interfaces
Ideal solution
Video Database
User
Information Need
Video Structure
Understanding the
semantic meaning and
retrieve
Result
The MMI team
MMI_05
39
Multimodal
Interfaces
Ideal solution
However,
1. Hard to represent query in natural
language and for computer to understand
2. Computers have no experience
3. Other representation restriction like
position, time
Video Database
User
Information Need
Video Structure
Understanding the
semantic meaning and
retrieve
Result
The MMI team
MMI_05
40
Multimodal
Interfaces
Alternative Solution
Video Database
User
Video Structure
Provide evidence of
relevant information ( text,
image, audio)
Information Need
Result
The MMI team
MMI_05
41
Multimodal
Interfaces
information
Image information
Motion information
Audio information
MMI_05
42
Multimodal
Interfaces
Text
Information
Keyword
Information Need
Video Structure
Image
Information
Query
Images
Motion
Information
Motion
Audio
Information
Audio
MMI_05
43
Multimodal
Interfaces
MMI_05
44
Multimodal
Interfaces
Video retrieval
Primitives of Color Moments Method
http://debut.cis.nctu.edu.tw/Demo/ContentBasedVideoRetrieval/CBV
R/PrimitivesE/index.html
R/DominantE/index.html
Combination Method
http://debut.cis.nctu.edu.tw/Demo/ContentBasedVideoRetrieval/CBV
R/demoE.html
MMI_05
45
Multimodal
Interfaces
Information Need
Video Structure
Understanding the
semantic meaning and
retrieve
Result
The MMI team
MMI_05
46
Multimodal
Interfaces
TSR Study
Retrieval
Production
TV news
Indexing
MMI_05
47
Multimodal
Interfaces
Production
Script
Subtitle
Teletext
Edited Video
Described rushes
Journalist
commentaries
Described video
relevant to the
query
Ineffective
information
exchange
Indexing
MMI_05
Retrieval
Video
segments
described
following the
TSR scheme
(places,
events,
persons,
dates, etc.)
48
Multimodal
Interfaces
MPEG-7 in Practice
Library of audiovisual descriptions
Coverage
MMI_05
49
Multimodal
Interfaces
MMI_05
50
Multimodal
Interfaces
<Mpeg7>
<StillRegion id = news>
</StillRegion>
</Mpeg7>
Title
MMI_05
51
Multimodal
Interfaces
<Mpeg7>
<StillRegion id = news>
<SpatialDecomposition>
<StillRegion id = background>
Back ground
<VisualDescriptor
features
xsi:type=DominantColorType>
110 108 140
</VisualDescriptor>
<StillRegion id = speaker>
</SpatialDecomposition>
</StillRegion>
</Mpeg7>
MMI_05
52
Multimodal
Interfaces
<StillRegion id = speaker>
<TextAnnotation>
<FreeTextAnnotation> Journalist
Anna Blanco
More features
</FreeTextAnnotation>
</TextAnnotation>
<Mask xsi:type="SpatialMaskType">
<SubRegion>
<Poly>
<Coords> 80 288, 100 200, ,
352 288
</Coords>
</Poly>
</SubRegion>
</Mask>
</StillRegion>
</Mpeg7>
MMI_05
53
Multimodal
Interfaces
Provide a common TV news retrieval platform for professional and nonprofessional users
Design
Example
Find A news item in the context of Euro 2000 football games containing a shot of at least 5 seconds
showing a French football supporter saying que le meilleur gagne
MMI_05
54
Multimodal
Interfaces
Its duration
Physical is
at leastView
5 seconds
It is
in the
context
Thematic
of EURO
View2000
football games
A video segment
I can hear
Audio
Que le meilleur
View
gagne!
It is
Production
a news
Viewitem
It contains a shot
Visual
showing an French
View
football supporter
MMI_05
55
Multimodal
Interfaces
COALA
Audiovisual
Repository
system
Description
system
TV news
MPEG-7 corpus
Retrieval
system
Visualization
system
MMI_05
56
Multimodal
Interfaces
Indexing tool
Demo
The MMI team
MMI_05
57
Multimodal
Interfaces
Indexing tool
MMI_05
58
Multimodal
Interfaces
Five Views
ViewDescriptions
BasicViewEntities
InterViewRelations
IntraViewRelations
The MMI team
MMI_05
59
Multimodal
Interfaces
Demo
MMI_05
60
Multimodal
Interfaces
MMI_05
61
Multimodal
Interfaces
MMI_05
62
Multimodal
Interfaces
Information visualization
What is Information Visualization?
Visualize:
Visualize:
Transformation
...
finding the artificial memory that best supports our natural means
of perception.' (Bertin, 1983)
The
MMI_05
63
Multimodal
Interfaces
etc.
MMI_05
64
Multimodal
Interfaces
about Information
Communicate
Explain
Make
Decisions
Reason about Information
visual comparisons
Tell stories about the data
The MMI team
MMI_05
65
Multimodal
Interfaces
MMI_05
66
Multimodal
Interfaces
9
9
9
9
approaches.
Distortion-oriented
Approaches:
MMI_05
67
Multimodal
Interfaces
MMI_05
68
Distortion-based Techniques
Multimodal
Interfaces
function.
Combination of detailed view and two distorted side views.
MMI_05
69
Distortion-based Techniques
Multimodal
Interfaces
MMI_05
70
Distortion-based Techniques
Multimodal
Interfaces
Fisheye View
Basic
MMI_05
71
Multimodal
Interfaces
an overview of a collection
show user what aspects of their interests are present in a collection
help user understand why documents retrieved as a result of a
query
MMI_05
72
Multimodal
Interfaces
http://www.kartoo.com/
Grokker
http://www.groxis.com/service/grok
MMI_05
73
Multimodal
Interfaces
MMI_05
74