Hortonworks Certified HDP Developer: Enterprise Apache Spark I Training

Live Online & Classroom Certification Training

This course is for you if you want to become a Hortonworks Certified Spark Developer. The course will give you both, the fundamental concepts and hands on practical experience in Hortonworks Development platform (HDP), specially on the Apache Spark framework. Technical aspects of the framework like RDDs, Actions, Transformations, single and multi node cluster installations are taken up in great detail in this course

(4.7) 165 Learners
Instructed by SPRINGPEOPLE
INDIA

No Public/Open-house class on the topic scheduled at the moment!

Course Description

Overview

This course is designed as an entry point for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Spark. Topics include: An overview of the Hortonworks Data Platform (HDP) including HDFS and YARN; using Spark Core APIs for interactive data exploration; Spark SQL and DataFrame operations; Spark Streaming and DStream operations; data visualization reporting and collaboration; performance monitoring and tuning; building and deploying Spark applications; and an introduction to the Spark Machine Learning Library.

Objective

  • Describe Hadoop, HDFS, YARN, and the HDP ecosystem
  • Describe Spark use cases
  • Explore and manipulate data using Zeppelin
  • Explore and manipulate data using a Spark REPL
  • Explain the purpose and function of RDDs
  • Use Spark Streaming stateless and window transformations
  • Visualize data, generate reports, and collaborate using Zeppelin
  • Monitor Spark applications using Spark History Server
  • Learn general application optimization guidelines\/tips
  • Use data caching to increase performance of applications
  • Explain and use the various Hive file formats
  • Build and package Spark applications
  • Use Hive to run SQL-like queries to perform data analysis
  • Deploy applications to the cluster using YARN
  • Understand the purpose of Spark MLlib

Prerequisites

Students should be familiar with programming principles and have previous experience in software development using either Python or Scala. Previous experience with data streaming, SQL, and HDP is also helpful, but not required.

Course Curriculum

Expand All
  • Describe Hadoop, HDFS, YARN, and the HDP ecosystem
  • Describe Spark use cases
  • Explore and manipulate data using Zeppelin
  • Explore and manipulate data using a Spark REPL
  • Explain the purpose and function of RDDs
  • Use Spark Streaming stateless and window transformations
  • Visualize data, generate reports, and collaborate using Zeppelin
  • Monitor Spark applications using Spark History Server
  • Learn general application optimization guidelines/tips
  • Use data caching to increase performance of applications
  • Explain and use the various Hive file formats
  • Build and package Spark applications
  • Use Hive to run SQL-like queries to perform data analysis
  • Deploy applications to the cluster using YARN
  • Understand the purpose of Spark MLlib
  • Labs can be performed using either Python or Scala
  • Use common HDFS commands
  • Use a REPL to program in Spark
  • Use Zeppelin to program in Spark
  • Perform RDD transformations and actions
  • Perform Pair RDD transformations and actions
  • Utilize Spark SQL
  • Perform stateless transformations using Spark Streaming
  • Perform window-based transformations
  • Use Zeppelin for data visualization and reporting
  • Monitor applications using Spark History Server
  • Cache and persist data
  • Configure checkpointing, broadcast variables, and executors
  • Build and submit a Spark application to YARN
  • Run Spark MLlib applications

Certification

SpringPeople works with top industry experts to identify the leading certification bodies on different technologies - which are well respected in the industry and globally accepted as clear evidence of a professional’s “proven” expertise in the technology. As such, these certification are a high value-add to the CVs and can give a massive boost to professionals in their career/professional growth.

Our certification courses are fully aligned to these high-profile certification exams; at the end of the course, participants will have detailed knowledge, be eligible and be fully ready take up these certification exams and pass with flying colours.

 

Resources

About the Instructor

Founded in 2009, SpringPeople is a global premier eLearning marketplace for Online Live, Instructor-led classes in the region. It is a certified training delivery partner of leading technology creators, namely Pivotal, Elastic, Lightbend, EMC, VMware, MuleSoft, RSA, and... Read More


Course Rating and Reviews

4.7

Average Rating
5 Stars
28
4 Stars
12
3 Stars
1
2 Stars
0
1 Star
0

SPRINGPEOPLE SpringPeople Trainer

Narasimha

Lead
TTT
Course:
Instructor:
Course Material:
Class Experience:
Excellent training with good walkthrough of many examples.Trainer is very helpful in clearing our doubts.

SPRINGPEOPLE SpringPeople Trainer

Mahender Pandiri

Course:
Instructor:
Course Material:
Class Experience:
NA

SPRINGPEOPLE SpringPeople Trainer

Vinayak Suryawanshi

Experis
Course:
Instructor:
Course Material:
Class Experience:
Trainer can ask few queries related to subject to students randomly during session so that they will attend carefully. :)

This class is intended for participants with some prior exposure to the technology and are now looking to build up their expertise on the topic.

On successful completion of the course, participants will be eligible to sit of the related certification exam (see course overview). All participants receive a course completion certificate, demonstrating their expertise on the subject.

Total duration of the online, live instructor led sessions. Sessions are typically delivered as short lectures (2-hrs weekdays/3-hrs weekends) and detailed hands-on guidance.

Expected offline lab work hours that participants will need to complete and submit to the trainer, during and after the instructor-led online sessions.

  1. We are happy to refund full fee paid - no questions asked - should you feel that the training is not up to your expectations.
  2. Our dedicated team of expert training enablement advisors are available on email, phone and chat to assist you with your queries.
  3. All courseware, including session recordings, will always be available to access to you for future reference and rework.

Contact Us

+91-80-6567-9700 (BLR)

training@springpeople.com

Request Call Back

Related Courses

Recently Viewed