Hadoop Administration Certification Training

Live Online & Classroom Certification Training

So your organization or your client has already installed a Hadoop Cluster and now looking to manage it throughout it's lifecycle ? Or you simply want to upgrade your skills to become a hadoop admin. Whatever may be the case, this course is an excellent introduction and deep dive into all the aspects of Hadoop Administration. Back up recovery, balancer, distcp and rack awareness are some of the topics deal with in a practical manner in this course.

(4.5) 127 Learners
Instructed by SPRINGPEOPLE
INDIA
  • 29
    Jul
    4 Days
    Online, 29-Jul to 06-Aug (Saturday - Sunday), LVC (09:30 AM Start) ₹19,950.00  Early Bird Offer: ₹17,950.00
  • 14
    Aug
    6 Days
    Online, 14-Aug to 19-Aug (Monday - Saturday), LVC (08:30 PM Start) ₹19,950.00  Early Bird Offer: ₹17,950.00
  • 21
    Aug
    2 Days
    Bangalore, 21-Aug to 22-Aug (Monday - Tuesday), Classroom (09:00 AM Start) ₹19,950.00  Early Bird Offer: ₹17,950.00
  • 26
    Aug
    4 Days
    Online, 26-Aug to 03-Sep (Saturday - Sunday), LVC (09:30 AM Start) ₹19,950.00  Early Bird Offer: ₹17,950.00
  • 21
    Sep
    6 Days
    Online, 21-Sep to 27-Sep (Thursday - Wednesday), LVC (07:00 AM Start) ₹19,950.00  Early Bird Offer: ₹17,950.00
  • 25
    Sep
    6 Days
    Online, 25-Sep to 30-Sep (Monday - Saturday), LVC (08:30 PM Start) ₹19,950.00  Early Bird Offer: ₹17,950.00

Course Description

Overview

Hadoop Administration training for System Administrators is designed for technical operations personnel whose job is to install and maintain production Hadoop clusters in real world. We will cover Hadoop architecture and its components installation process monitoring and troubleshooting of the complex Hadoop issues. The training is focused on practical hands-on exercises and encourages open discussions of how people are using Hadoop in enterprises dealing with large data sets.

Objective

At the end of Hadoop Administration training course, the participants will:

  • Understand Hadoop main components and Architecture
  • Be comfortable working with Hadoop Distributed File System
  • Understand MapReduce abstraction and how it works
  • Plan Hadoop cluster
  • Deploy and administer Hadoop cluster
  • Optimize Hadoop cluster for the best performance based on specific job requirements
  • Monitor a Hadoop cluster and execute routine administration procedures
  • Deal with Hadoop component failures and recoveries
  • Get familiar with related Hadoop projects: Hbase, Hive and Pig
  • Know best practices of using Hadoop in enterprise world

Suggested Audience

System Administrators and Support Engineers who will maintain and troubleshoot Hadoop clusters in production or development environments.

Duration - 2 Days

Prerequisites

Basic knowledge of unix and system administration. Prior knowledge of Hadoop is not required.

Course Curriculum

Expand All
  • The amount of data processing in today's life
  • What Hadoop is why it is important?
  • Hadoop comparison with traditional systems
  • Hadoop history
  • Hadoop main components and architecture
  • HDFS overview and design
  • HDFS architecture
  • HDFS file storage
  • Component failures and recoveries
  • Block placement
  • Balancing the Hadoop cluster
  • Planning a Hadoop cluster and its capacity
  • Hadoop software and hardware configuration
  • HDFS Block replication and rack awareness
  • Network topology for Hadoop cluster
  • Different Hadoop deployment types
  • Hadoop distribution options
  • Hadoop competitors
  • Hadoop installation procedure
  • Distributed cluster architecture
  • Lab: Hadoop Installation
  • Ways of accessing data in HDFS
  • Common HDFS operations and commands
  • Different HDFS commands
  • Internals of a file read in HDFS
  • Data copying with 'distcp'
  • Lab: Working with HDFS
  • What MapReduce is and why it is popular
  • The Big Picture of the MapReduce
  • MapReduce process and terminology
  • MapReduce components failures and recoveries
  • Working with MapReduce
  • Hadoop configuration overview and important configuration file
  • Configuration parameters and values
  • HDFS parameters MapReduce parameters
  • Hadoop environment setup
  • 'Include' and 'Exclude' configuration files
  • Lab: MapReduce Performance Tuning
  • Namenode/Datanode directory structures and files
  • File system image and Edit log
  • The Checkpoint Procedure
  • Namenode failure and recovery procedure
  • Safe Mode
  • Metadata and Data backup
  • Potential problems and solutions / what to look for
  • Adding and removing nodes
  • Lab: MapReduce File system Recovery
  • Best practices of monitoring a Hadoop cluster
  • Using logs and stack traces for monitoring and troubleshooting
  • Using open-source tools to monitor Hadoop cluster
  • How to schedule Hadoop Jobs on the same cluster
  • Default Hadoop FIFO Schedule
  • Fair Scheduler and its configuration
  • Hadoop Multi Node Cluster Setup using Amazon ec2 - Creating 4 node cluster setup
  • Running Map Reduce Jobs on Cluster

Certification

SpringPeople works with top industry experts to identify the leading certification bodies on different technologies - which are well respected in the industry and globally accepted as clear evidence of a professional’s “proven” expertise in the technology. As such, these certification are a high value-add to the CVs and can give a massive boost to professionals in their career/professional growth.

Our certification courses are fully aligned to these high-profile certification exams; at the end of the course, participants will have detailed knowledge, be eligible and be fully ready take up these certification exams and pass with flying colours.

 

Resources

Hadoop Administration Introduction Slides

Technology Introduction Slides

Course Testimonial

SpringPeople Corporate Learning Center

Job Trends

About the Instructor

Founded in 2009, SpringPeople is a global premier eLearning marketplace for Online Live, Instructor-led classes in the region. It is a certified training delivery partner of leading technology creators, namely Pivotal, Elastic, Lightbend, EMC, VMware, MuleSoft, RSA, and... Read More


Course Rating and Reviews

4.5

Average Rating
5 Stars
3
4 Stars
3
3 Stars
0
2 Stars
0
1 Star
0

SPRINGPEOPLE SpringPeople Trainer

Amit Kumar Das

DBA2
INTUIT
Course:
Instructor:
Course Material:
Class Experience:
NA

SPRINGPEOPLE SpringPeople Trainer

Lokesh

Course:
Instructor:
Course Material:
Class Experience:
NA

SPRINGPEOPLE SpringPeople Trainer

Tapas Mishra

Intuit
Course:
Instructor:
Course Material:
Class Experience:
Trainer is exceptionally well. Very good teaching style.

This class is intended for participants with some prior exposure to the technology and are now looking to build up their expertise on the topic.

On successful completion of the course, participants will be eligible to sit of the related certification exam (see course overview). All participants receive a course completion certificate, demonstrating their expertise on the subject.

Total duration of the online, live instructor led sessions. Sessions are typically delivered as short lectures (2-hrs weekdays/3-hrs weekends) and detailed hands-on guidance.

Expected offline lab work hours that participants will need to complete and submit to the trainer, during and after the instructor-led online sessions.

  1. We are happy to refund full fee paid - no questions asked - should you feel that the training is not up to your expectations.
  2. Our dedicated team of expert training enablement advisors are available on email, phone and chat to assist you with your queries.
  3. All courseware, including session recordings, will always be available to access to you for future reference and rework.

Contact Us

+91-80-6567-9700 (BLR)

training@springpeople.com

Request Call Back

Related Courses

Recently Viewed