You are on page 1of 3

Hadoop Basic------------Introduction to Big Data and Hadoop:

- What is Big Data?


- Big Data in Use cases
- Big data 3 Vs concepts
- Big data diff data types--- structured, unstructured and semi structured
- RDBMS vs Hadoop
- Hadoop Ecosystem
- Various Hadoop Distributions
- Hadoop Cluster(HDFS & MapReduce concepts)
Apache Hadoop Installation on Ubuntu 14.4.1
- Installation of Hadoop on Ubuntu
-Installation of Mysql
-Installation of Eclipse and Maven
HDFS: Hadoop Distributed File System:
- HDFS Architecture
- HDFS Writes
- HDFS Reads
- Rack awareness
- Fault Tolerance
- NameNode
- Secondary NameNode
- Interact with HDFS
- HDFS Commands
- HDFS Java API
- HDFS REST API

- Hands on Exercises
MapReduce:
- Mapreduce architecture
- Mapreduce Model
- Mapreduce Framework and Various phases
- Input/Output formats
- Partiioner and combiner concepts with example
- Map Reduce Hands On Programming with word count example run on Ubuntu
- Distributed Cache
- Map side join Vs Reduce Side Join
- Hands on Exercises
Pig:
- Pig philosophy and architecture
- Installation of PIG
- Grunt shell
- Loading data
- Exploring Pig Latin commands
- Pig Transformations
--- UDF function in java in Pig
- Hands on Exercises
Hive:
- Hive architecture
-- Installation of Hive
- Hive vs RDBMS
- HiveQL and the Hive shell
- Data types and schemas

- Creating tables (external vs managed)


- Creating Partitions
-- creating bucketing
- Creating Views and Indexes
-- UDF function in java in Hive
-- Hive storage format-- RCfile,ORCfile,JSON,XML

You might also like