Left

Apache Spark and Scala Certification Training

Live Online & Classroom Certification Training

Apache Spark is a fast, in-memory distributed collections framework written in Scala. In this Apache Spark and Scala training course, you will understand Scala's programming model in detail apart from gaining hands-on experience on near-to-real-time data analytics through practical examples in Spark and Scala.

(4.9) 56 Learners
Instructed by JAYMIN
INDIA

No Public/Open-house class on the topic scheduled at the moment!

Course Description

Overview

Apache Spark and Scala training module teaches you to create applications in Spark with the implementation of Scala programming. It provides a clear comparison between Spark and Hadoop and covers techniques to increasing your application performance and enabling high-speed processing.

Objective

After the completion of 'Apache Spark & Scala' course, you will be able to:

  • Understand Scala and its implementation
  • Apply Lazy values, Control Structures, Loops, Collection, etc.
  • Learn the concepts of Traits and OOPS in scala
  • Understand Functional programming in scala
  • Get an insight into the BigData challenges
  • How spark acts as a solution to these challenges
  • Install spark and implement spark operations on spark shell
  • Understand what are RDDs in spark
  • Implement spark application on YARN (Hadoop)
  • Analyze Hive and Spark SQL Architecture

Duration - 5 Days

Prerequisites

  • Basic familiarity with Linux or Unix
  • Intermediate-level of Hadoop

Course Curriculum

Expand All
  • Why second generation frameworks?
  • Introduction to Spark
  • Scala shell
  • Spark Architecture
  • Spark on Cluster
  • Spark Core
  • SparkSQL
  • Spark Streaming
  • Cluster Managers
  • Spark Users
  • What is use of Spark
  • Spark Versions
  • Spark Storage Layers
  • Download Spark
  • A. Why second generation frameworks?
  • B .The Driver
  • C. Executors
  • D. Execution components: jobs, tasks, stages
  • E. Spark Web UI
  • A. Standalone Cluster Manager
  • B .Hadoop YARN
  • C. Apache Mesos
  • D. Amazon EC2
  • E. Which Cluster Manager?
  • F. Spark-submit for deploying applications
  • G. Using MAVEN for JAVA SPARK application
  • H. Using SBT for A SCALA Application
  • Different file formats:

  • Text files
  • Json
  • Comma ,tab separated values
  • Object files
  • Sequence files
  • Input /output formats
  • SPARKSQL for Structured data
  • What is RDD
  • Why RDD?
  • RDD operations
  • Transformations
  • Actions
  • Lazy Evaluation
  • Basic RDD’s
  • Caching
  • Converting between RDD types
  • Spark Api supports Python, Java, Scala
  • Working with Key, value pairs
  • Create key, value pair RDD’s
  • Aggregations
  • Grouping data
  • Joins
  • Sorting data
  • RDD’s partitioner
  • Operations from partitioning
  • Page Rank example
  • Aggregate
  • Fold
  • Map partitions
  • Glom
  • Accumulators
  • Broadcasters
  • Anatomy of a spark RDD
  • Splits
  • Localization
  • Serialization
  • Transformations Vs. Actions
  • Spark sql in applications
  • Spark sql initialization
  • Spark sql basic query
  • Schema RDD’s
  • Caching
  • Load data from hive
  • Load data from json
  • Load data from RDD’s
  • Beeline
  • Long-lived tables and queries
  • Query hands-on
  • Spark sql UDF’s
  • Performance
  • Streaming Architecture
  • Two types of Transformations

  •    1. Stateless Transformations
  •    2. Stateful Transformations
  • Streaming UI
  • Sources: Input
  • Core Sources
  • Additional Sources
  • Multiple Sources
  • Cluster Sizing
  • Driver Fault Tolerance
  • Worker Fault Tolerance
  • Receiver Fault Tolerance
  • Operation 24/7
  • Performance
  • Garbage collection
  • Memory Usage

Certification

SpringPeople works with top industry experts to identify the leading certification bodies on different technologies - which are well respected in the industry and globally accepted as clear evidence of a professional’s “proven” expertise in the technology. As such, these certification are a high value-add to the CVs and can give a massive boost to professionals in their career/professional growth.

Our certification courses are fully aligned to these high-profile certification exams; at the end of the course, participants will have detailed knowledge, be eligible and be fully ready take up these certification exams and pass with flying colours.

 

Resources

SpringPeople Corporate Learning Center

Job Trends

About the Instructor


Course Rating and Reviews

4.9

Average Rating
5 Stars
12
4 Stars
2
3 Stars
0
2 Stars
0
1 Star
0

JAYMIN SpringPeople Trainer

Prathima G S

Enterprise Integration With Spring (SpringSource Certified)
Brocade Communications Systems Private Ltd
Course:
Instructor:
Course Material:
Class Experience:
Excellent training on spark for beginners with indepth explanation of HDFS and handson on Mapreduce and Spark.

JAYMIN SpringPeople Trainer

Akash B

Big Data Specialist
Wipro Technologies
Course:
Instructor:
Course Material:
Class Experience:


JAYMIN SpringPeople Trainer

Ranajit Jana

Course:
Instructor:
Course Material:
Class Experience:
Good Training overallJaymin has deep knowledge of the ecosystem. Thanks for sharing the sameInstructorÔÇÖs capability did made a difference . Thanks for doing all the ground work

This class is intended for participants with some prior exposure to the technology and are now looking to build up their expertise on the topic.

On successful completion of the course, participants will be eligible to sit of the related certification exam (see course overview). All participants receive a course completion certificate, demonstrating their expertise on the subject.

Total duration of the online, live instructor led sessions. Sessions are typically delivered as short lectures (2-hrs weekdays/3-hrs weekends) and detailed hands-on guidance.

Expected offline lab work hours that participants will need to complete and submit to the trainer, during and after the instructor-led online sessions.

  1. We are happy to refund full fee paid - no questions asked - should you feel that the training is not up to your expectations.
  2. Our dedicated team of expert training enablement advisors are available on email, phone and chat to assist you with your queries.
  3. All courseware, including session recordings, will always be available to access to you for future reference and rework.

Contact Us

+91-80-6567-9700 (BLR)

training@springpeople.com

Request Call Back

Related Courses

Recently Viewed