Hadoop Development

Hadoop Development

Course Curriculum

Introduction to Big data &Hadoop
Hadoop and its features
Hadoop Ecosystem
Hadoop 1.x core components (Daemons)
Hadoop 2.x core components (Daemons)
Read and Write anatomy in Hadoop
Understanding Rack Awareness concept
Understanding Hadoop 2.x Cluster Architecture &different cluster modes
Hadoop 2.x Cluster Architecture –Federation &High Availability
Understanding parameters in Hadoop 2.x configuration files
Password-less SSH on Hadoop cluster
Understanding Hadoop 2.x environment
Implement basic Hadoop commands on Terminal MR dump
Introduction to MapReduce and its applications
MapReduce paradigm
Hadoop 2.x MapReduce architecture and its components
Execution flow of YARN MapReduce application
Running a MapReduce Program
Basic & Advance level of MapReduce programming
Combiner & Partitioner
Data Encryption
Map-side Vs. Reduce-side Joins
Custom Data Types
Input & Output Formats
Hadoop Counters
Distributed Cache
MRUnit Testing
Introduction to PIG
PIG Use cases
Understanding PIG Data flow &Program Structure
PIG –Running Modes
PIG Latin Language
Basic Data Types &Data Models
Pig Latin Relational Operators
Diagnostic Operators
PIG UDF Statements
Introduction to Hive and its Use Cases
Difference between Hive and PIG
Understanding Hive Architecture and Hive Components
Limitations of Hive
Primitive and Complex types in Hive
HiveData Models
BasicHiveoperations Hive scripts
Hive UDFs
Introduction to HBase
CAP Theorem &NoSQL Landscape
HBase use cases
HBase Major Components
HBase Storage Architecture
Read & Write anatomy of HBase
Simple Cluster Deployment
Understanding Compactions
HBase Attributes
Data Model and Physical Storage in HBase
HBase shell
Data Loading Techniques in HBase
Implement HBase API
Zookeeper Data Model and its Services
Relationship between HBase and Zookeeper Advance HBase Actions
Data loading techniques (Flume &Sqoop)
Apache Oozie
Search Course
Quick Contact
Top