You are on page 1of 6

Address: FlatNO: 402, Nukala Residency, Jaihind Enclave, Madhapur, Hyderabad-81, Andhra Pradesh

Phone No: INDIA: Tel 040-64104169 Mobile +91-9052699906, USA: 909-666-5386


Email: contact@rstrainings.com

Course Content:

Course Objective Summary

During this course, you will learn:

Introduction to Big Data and Analytics
Introduction to Hadoop
Hadoop ecosystem - Concepts
Hadoop Map-reduce concepts and features
Developing the map-reduce Applications
Pig concepts
Hive concepts
Sqoop concepts
Flume Concepts
Oozie workflow concepts
Impala Concepts
Hue Concepts
HBASE Concepts
ZooKeeper Concepts
Real Life Use Cases

Reporting Tool

Tableau

1. Virtualbox/VM Ware

Basics
Installations
Backups
Snapshots

2. Linux

Basics
Installations
Commands

3. Hadoop

Why Hadoop?
Scaling
Distributed Framework

Address: FlatNO: 402, Nukala Residency, Jaihind Enclave, Madhapur, Hyderabad-81, Andhra Pradesh
Phone No: INDIA: Tel 040-64104169 Mobile +91-9052699906, USA: 909-666-5386
Email: contact@rstrainings.com

Hadoop v/s RDBMS
Brief history of hadoop

4. Setup hadoop

Pseudo mode
Cluster mode
Ipv6
Ssh
Installation of java, hadoop
Configurations of hadoop
Hadoop Processes ( NN, SNN, JT, DN, TT)
Temporary directory
UI
Common errors when running hadoop cluster, solutions

5. HDFS- Hadoop distributed File System

HDFS Design and Architecture
HDFS Concepts
Interacting HDFS using command line
Interacting HDFS using Java APIs
Dataflow
Blocks
Replica

6. Hadoop Processes

Name node
Secondary name node
Job tracker
Task tracker
Data node

7. Map Reduce

Developing Map Reduce Application
Phases in Map Reduce Framework
Map Reduce Input and Output Formats
Advanced Concepts
Sample Applications
Combiner

8. Joining datasets in Mapreduce jobs

Address: FlatNO: 402, Nukala Residency, Jaihind Enclave, Madhapur, Hyderabad-81, Andhra Pradesh
Phone No: INDIA: Tel 040-64104169 Mobile +91-9052699906, USA: 909-666-5386
Email: contact@rstrainings.com


Map-side join
Reduce-Side join

9. Map reduce customization

Custom Input format class
Hash Partitioner
Custom Partitioner
Sorting techniques
Custom Output format class

10. Hadoop Programming Languages :-

I.HIVE

Introduction
Installation and Configuration
Interacting HDFS using HIVE
Map Reduce Programs through HIVE
HIVE Commands
Loading, Filtering, Grouping.
Data types, Operators..
Joins, Groups.
Sample programs in HIVE

II. PIG

Basics
Installation and Configurations
Commands.

OVERVIEW HADOOP DEVELOPER

11. Introduction

12. The Motivation for Hadoop

Problems with traditional large-scale systems
Requirements for a new approach

13. Hadoop: Basic Concepts


Address: FlatNO: 402, Nukala Residency, Jaihind Enclave, Madhapur, Hyderabad-81, Andhra Pradesh
Phone No: INDIA: Tel 040-64104169 Mobile +91-9052699906, USA: 909-666-5386
Email: contact@rstrainings.com

An Overview of Hadoop
The Hadoop Distributed File System
Hands-On Exercise
How MapReduce Works
Hands-On Exercise
Anatomy of a Hadoop Cluster
Other Hadoop Ecosystem Components

14. Writing a MapReduce Program

The MapReduce Flow
Examining a Sample MapReduce Program
Basic MapReduce API Concepts
The Driver Code
The Mapper
The Reducer
Hadoops Streaming API
Using Eclipse for Rapid Development
Hands-on exercise
The New MapReduce API

15. Common MapReduce Algorithms

Sorting and Searching
Indexing
Machine Learning With Mahout
Term Frequency Inverse Document Frequency
Word Co-Occurrence
Hands-On Exercise.

16.PIG Concepts..

Data loading in PIG.
Data Extraction in PIG.
Data Transformation in PIG.
Hands on exercise on PIG.

17. Hive Concepts.

Hive Query Language.
Alter and Delete in Hive.
Partition in Hive.
Indexing.
Joins in Hive.Unions in hive.
Industry specific configuration of hive parameters.

Address: FlatNO: 402, Nukala Residency, Jaihind Enclave, Madhapur, Hyderabad-81, Andhra Pradesh
Phone No: INDIA: Tel 040-64104169 Mobile +91-9052699906, USA: 909-666-5386
Email: contact@rstrainings.com

Authentication & Authorization.
Statistics with Hive.
Archiving in Hive.
Hands-on exercise

18. Working with Sqoop

Introduction.
Import Data.
Export Data.
Sqoop Syntaxs.
Databases connection.
Hands-on exercise

19. Working with Flume

Introduction.
Configuration and Setup.
Flume Sink with example.
Channel.
Flume Source with example.
Complex flume architecture.

20. OOZIE Concepts
21. IMPALA Concepts
22. HUE Concepts
23. HBASE Concepts
24. ZooKeeper concepts

Reporting Tool..

Tableau

This course is designed for the beginner to intermediate-level Tableau user. It is for anyone who works
with data regardless of technical or analytical background. This course is designed to help you
understand the important concepts and techniques used in Tableau to move from simple to complex
visualizations and learn how to combine them in interactive dashboards.

Course Topics

Overview

What is visual analysis?

Address: FlatNO: 402, Nukala Residency, Jaihind Enclave, Madhapur, Hyderabad-81, Andhra Pradesh
Phone No: INDIA: Tel 040-64104169 Mobile +91-9052699906, USA: 909-666-5386
Email: contact@rstrainings.com

Strengths/weakness of the visual system.

Laying the Groundwork for Visual Analysis

Analytical Process
Preparing for analysis

Getting, Cleaning and Classifying Your Data

Cleaning, formatting and reshaping.
Using additional data to support your analysis.
Data classification

Visual Mapping Techniques

Visual Variables : Basic Units of Data Visualization
Working with Color
Marks in action: Common chart types

Solving Real-World Problems with Visual Analysis

Getting a Feel for the Data- Exploratory Analysis.
Making comparisons
Looking at (co-)Relationships.
Checking progress.
Spatial Relationships.
Try, try again.

Communicating Your Findings

Fine-tuning for more effective visualization
Storytelling and guided analytics
Dashboards

You might also like