Professional Documents
Culture Documents
the Cloud
Jianlin Feng
School of Software
SUN YAT-SEN UNIVERSITY
Jun 5, 2009
What is the Cloud?
Fault Tolerance
If a query must restart each time a node fails, then long, complex
queries are difficult to complete.
Efficiency
MapReduce is good for brute-force scan over unstructured
data such as text documents.
Parallel DBMS is good for selective access of structured
data.
Fault Tolerance
MapReduce takes it as a high priority.
Most parallel DBMS restart a query upon a faiure.
Ability to run in a heterogeneous environment.
MapReduce does well.
Parallel DBMS are generally designed to run in a
homogeneous environment.
MapReduce vs. Parallel DBMS (2)