Hadoop Online Training
Get a Call
Welcome to the Hadoop Online Training of TGC India. of TGC India would be conducted in conjunction with our training partners, iMentors.
iMentors is an independent subsidiary of TGC INDIA that specialized in short term online training. Through the iMentors platform, the course of education and training has been greatly reduced and more tutors have been engaged from across the world.
The Hadoop Online Training include:
1. HADOOP ECOSYSTEM & CLUSTER
- Available version Hadoop 1.x & 2
- Available Distributions of Hadoop (Cloudera, Hortonworks)
- Hadoop Projects & Components
- Architecture of Hadoop & Planning for cluster
- The Hadoop Distributed File System (HDFS)
- Cluster Daemons & Its Functions.
- Name Node
- Secondary Node
- Data Nodes
- Application Master and Task Tracker
- YARN Responsibilities
- Deployment of Hadoop Cluster
2. CLOUDERA SANDBOX OR QUICK START
- Installation of cloudera quick start
- Difference in sandbox and distributed environment
- Overview of apache HUE
3. MAP-REDUCE, MAP-REDUCE STEAMING (IN JAVA)
- All Map-Reduce API Concepts
- Architecture of Map-Reduce
- Writing Map-Reduce Drivers, Mappers, and Reducers in Java
- Speeding Up Hadoop Development by Using Eclipse
- Differences between the Old and New Map-Reduce APIs
- Writing Mappers and Reducers with the Streaming API
- Different question raised for Map-Reduce
4. HBASE: THE HADOOP DATABASE
- Problems with RDBMS
- Introduction to HBase
- Non-RDBMS, Not-Only SQL or No-SQL
- Installation HBase& Deployment
- Types CRUD & Batch Operations
- Filters, Counters, Pool
- Rest Interface & Web-UI
5. Hadoop Shell and Commands
- Hadoop Developer commands using shell
- Map-Reduce job deployment
- Oozie workflow design
- Different Components Jobs design.
6. APACHE DRILL – REPLACEMENT OF MAP-REDUCE
- Installation of Drill
- Query data using apache drill
- Query data from Hadoop/HDFS file system
- Drill &Hbase integration
- Drill & Hive integration & Replacement
7. HCATALOG OR METASTORE TABLES
- Introduction of apache Hcatalog
- Creating tables using Hcatalog
- Bulk uploads using MetaStore Tables
- Play with semi-structured data
- Integration of Hcatalog with Hive
- Hive SQL query analysis
8. HIVE
- Problems with No-SQL Database
- Introduction & Installation Hive
- Hive Schema and Data Storage
- Data Types & Introduction to SQL
- Hive-SQL: DML & DDL
- Hive-SQL: Views & Indexes
- Explain and use the various Hive file formats
- Use Hive to run SQL-like queries to perform data analysis
- Use Hive to join data sets using a variety of techniques, including Mapside joins and Sort-Merge-Bucket joins
Integration to HBase& Cassandra - Sentiment Analysis and N-Grams
- Hive Thrift Service
9. FLUME
- Installation of Flume
- Ingesting Data from External Sources with Flume
- Configuration for flume
- REST Interfaces
- Best Practices for Importing Data
10. SQOOP
- Installation of Sqoop
- Ingesting Data from External (RDBMS) Sources with Sqoop
- Ingesting Data from/to Relational Databases with Sqoop
- Integration of Sqoop and HBase
- Integration of Sqoop and Hive
- Best Practices for Importing Data
11. CONCLUSION & FAQS
Note:
- Every Topic has practical session
- Hadoop uses different components which discussed in required
Sessions
- Hue
- Cloudera Manager
- Zookeeper
- Ooozie
- etc
This course is best suited to developers and engineers who have some or little bit programming experience. Knowledge of Java is not mandatory, Any programming language can be used with Hadoop and is required to complete the hands-on exercises.
Course |
Duration |
Course Fee (INR) |
Early bird Offer |
16 virtual classes |
25,000/- |
20,000/- |
Register Now & avail attractive discounts !!
Upcoming Batches:
Course Reviews
No Reviews found for this course.
0 Responses on Hadoop Online Training"