Hortonworks Certified HDP Developer: Java Training

Live Online & Classroom Training/Tutorial

Get certified with Hortonworks as HDP Developer: Java. This course covers all the aspect of the framework like MapReduce, HDFS, Pig, Hive, Hbase and MR Unit etc.

(4.7) 57 Learners
Instructed by Springpeople
INDIA
  • 27
    Feb
    4 Days
    Bangalore, 27-Feb to 02-Mar (Monday - Thursday), Classroom (09:00 AM Start) ₹64,950.00  Early Bird Offer: ₹59,950.00
  • 20
    Mar
    12 Days
    Online, 20-Mar to 01-Apr (Monday - Saturday), LVC (08:30 PM Start) ₹64,950.00  Early Bird Offer: ₹59,950.00

View All

Course Description

Overview

This advanced course provides Java programmers a deep-dive into Hadoop application development. Students will learn how to design and develop efficient and effective MapReduce applications for Hadoop using the Hortonworks Data Platform including how to implement combiners partitioners secondary sorts custom input and output formats joining large datasets unit testing and developing UDFs for Pig and Hive. Labs are run on a 7-node HDP 2.1 cluster running in a virtual machine that students can keep for use after the training.

Objective

At the completion of the course students will be able to:

  • Describe Hadoop 2 and the Hadoop Distributed File System
  • Describe the YARN framework
  • Develop and run a Java MapReduce application on YARN
  • Use combiners and in-map aggregation
  • Write a custom partitioner to avoid data skew on reducers
  • Perform a secondary sort
  • Recognize use cases for built-in input and output formats
  • Write a custom MapReduce input and output format
  • Optimize a MapReduce job
  • Configure MapReduce to optimize mappers and reducers
  • Develop a custom RawComparator class
  • Distribute files as LocalResources
  • Describe and perform join techniques in Hadoop
  • Perform unit tests using the UnitMR API
  • Describe the basic architecture of HBase
  • Write an HBase MapReduce application
  • List use cases for Pig and Hive
  • Write a simple Pig script to explore and transform big data
  • Write a Pig UDF (User-Defined Function) in Java
  • Write a Hive UDF in Java
  • Use JobControl class to create a MapReduce workflow
  • Use Oozie to define and schedule workflows

Prerequisites

Students must have experience developing Java applications and using a Java IDE. Labs are completed using the Eclipse IDE and Gradle. No prior Hadoop knowledge is required.

Course Curriculum

Expand All
  • At the completion of the course students will be able to:
  • Describe Hadoop 2 and the Hadoop Distributed File System
  • Describe the YARN framework
  • Develop and run a Java MapReduce application on YARN
  • Use combiners and in-map aggregation
  • Write a custom partitioner to avoid data skew on reducers
  • Perform a secondary sort
  • Recognize use cases for built-in input and output formats
  • Write a custom MapReduce input and output format
  • Optimize a MapReduce job
  • Configure MapReduce to optimize mappers and reducers
  • Develop a custom RawComparator class
  • Distribute files as LocalResources
  • Describe and perform join techniques in Hadoop
  • Perform unit tests using the UnitMR API
  • Describe the basic architecture of HBase
  • Write an HBase MapReduce application
  • List use cases for Pig and Hive
  • Write a simple Pig script to explore and transform big data
  • Write a Pig UDF (User-Defined Function) in Java
  • Write a Hive UDF in Java
  • Use JobControl class to create a MapReduce workflow
  • Use Oozie to define and schedule workflows
  • Configuring a Hadoop Development Environment
  • Putting data into HDFS using Java
  • Write a distributed grep MapReduce application
  • Write an inverted index MapReduce application
  • Configure and use a combiner
  • Writing custom combiners and partitioners
  • Globally sort output using the TotalOrderPartitioner
  • Writing a MapReduce job to sort data using a composite key
  • Writing a custom InputFormat class
  • Writing a custom OutputFormat class
  • Compute a simple moving average of stock price data
  • Use data compression
  • Define a RawComparator
  • Perform a map-side join
  • Using a Bloom filter
  • Unit testing a MapReduce job
  • Importing data into HBase
  • Writing an HBase MapReduce job
  • Writing User-Defined Pig and Hive functions
  • Defining an Oozie workflow

Certification

SpringPeople works with top industry experts to identify the leading certification bodies on different technologies - which are well respected in the industry and globally accepted as clear evidence of a professional’s “proven” expertise in the technology. As such, these certification are a high value-add to the CVs and can give a massive boost to professionals in their career/professional growth.

Our certification courses are fully aligned to these high-profile certification exams; at the end of the course, participants will have detailed knowledge, be eligible and be fully ready take up these certification exams and pass with flying colours.

 

Resources

SpringPeople Corporate Learning Center

Job Trends

About the Instructor

Founded in 2009, SpringPeople is a global premier eLearning marketplace for Online Live, Instructor-led classes in the region. It is a certified training delivery partner of leading technology creators, namely Pivotal, Elastic, Lightbend, EMC, VMware, MuleSoft, RSA, and... Read More


Rating and Reviews

4.7

Average Rating
5 Stars
28
4 Stars
12
3 Stars
1
2 Stars
0
1 Star
0

Rating and Reviews

SPRINGPEOPLE SpringPeople Trainer

Harekrushna

Course:
Instructor:
Course Material:
Class Experience:
i am happy

SPRINGPEOPLE SpringPeople Trainer

Nithyanand Vasudevan

Senior Software Engineer
Mindtree
Course:
Instructor:
Course Material:
Class Experience:
Network issues sometimes. But otherwise, it was well presented

SPRINGPEOPLE SpringPeople Trainer

Pankaj Rana

SSE
Mindtree Ltd
Course:
Instructor:
Course Material:
Class Experience:
Superb Teaching

This class is intended for participants with some prior exposure to the technology and are now looking to build up their expertise on the topic.

On successful completion of the course, participants will be eligible to sit of the related certification exam (see course overview). All participants receive a course completion certificate, demonstrating their expertise on the subject.

Total duration of the online, live instructor led sessions. Sessions are typically delivered as short lectures (2-hrs weekdays/3-hrs weekends) and detailed hands-on guidance.

Expected offline lab work hours that participants will need to complete and submit to the trainer, during and after the instructor-led online sessions.

  1. We are happy to refund full fee paid - no questions asked - should you feel that the training is not up to your expectations.
  2. Our dedicated team of expert training enablement advisors are available on email, phone and chat to assist you with your queries.
  3. All courseware, including session recordings, will always be available to access to you for future reference and rework.

Contact Us

+91-80-6567-9700 (BLR)

training@springpeople.com

Contact Us

Recently Viewed