Hadoop Online Training

Hadoop is an open-source software framework for storing and processing big data in a distributed fashion on large clusters of commodity hardware. Browse through the contents of the program.

HADOOP Course Contents
Virtual Box/VM Ware
Basics
Installations
Backups
Snapshots
ClouderaVM
Installations
Hadoop
Why Hadoop
Scaling
Distributed Framework
Hadoop v/s RDBMS
Brief history of Hadoop
Problems with traditional large-scale systems
Requirements for a new approach
Anatomy of a Hadoop cluster
Other Hadoop Ecosystem components
Setup Hadoop
Pseudo mode
Cluster mode
Installation of Java
Hadoop
Configurations of Hadoop
Hadoop Processes ( NN, SNN, JT, DN, TT)
Temporary directory
UI
Common errors when running Hadoop cluster
Solutions
HDFS- Hadoop Distributed File System-
HDFS design and architecture
HDFS concepts
Interacting HDFS using command line
Dataflow
Blocks
Replica
Hadoop Processes
Name node
Secondary name node
Job tracker
Task tracker
Data node
Developing MapReduce application
Phases in MapReduce framework
MapReduce input and output formats
Advanced concepts,
Sample applications
Combiner
Writing a MapReduce Program
The MapReduce flow
Examining a sample MapReduce program,
Basic MapReduce API concepts
Driver code
Mapper
Reducer
Hadoop’s streaming API
Using Eclipse for rapid development
Hands-on exercise
New MapReduce API
Common MapReduce Algorithms
Sorting and Searching, Indexing
Term Frequency – Inverse Document Frequency
Word Co-occurrence
Hands-on exercise
Writing advance map reduce programs
Building multivalue writable data
Accessing and using counters
Partitioner - Hashpartitioner
Hands on Exercises
Hadoop Programming Languages
HIVE: Introduction
Installation
Configuration
Interacting HDFS using HIVE
MapReduce programs through HIVE
HIVE commands
Loading, Filtering
Grouping
Data types
Operators
Joins
Groups
Sample programs in HIVE
PIG: Basics
Configuration
Commands
Loading
Filtering
Grouping,
Data types
Operators
Joins
Groups
Sample programs in PIG
HBase
What is HBase
HBase architecture
HBase API
Managing large data sets with HBase
Using HBase in Hadoop applications
Integrating Hadoop into the Enterprise Workflow
Integrating Hadoop into an Existing Enterprise
Loading Data from an RDBMS into HDFS by Using Sqoop
Managing Real-Time Data Using Flume
Contact us today at info@techflames.com or call us at +91-9100947004.