Hortonworks Certified Hdp Analyst: Data Science Training

Live Online & Classroom Training/Tutorial

This course provides some of the vital aspects of data science, including machine learning and natural language processing. Tools and programming languages (Python, IPython, Mahout, Pig, NumPy, pandas, SciPy, Scikitlearn), the Natural Language Toolkit (NLTK), and Spark MLlib are also included in this course

(4.7) 216 Learners
Instructed by Springpeople
INDIA
  • 20
    Mar
    3 Days
    Bangalore, 20-Mar to 22-Mar (Monday - Wednesday), Classroom (09:00 AM Start) ₹54,950.00  Early Bird Offer: ₹49,950.00

View All

Course Description

Overview

This course Provides instruction on the processes and practice of data science, including machine learning and natural language processing. Included are: tools and programming languages (Python, IPython, Mahout, Pig, NumPy, pandas, SciPy, Scikitlearn), the Natural Language Toolkit (NLTK), and Spark MLlib.

Objective

Recognize use cases for data science on Hadoop Describe the Hadoop and YARN architecture Describe supervised and unsupervised learning differences Use Mahout to run a machine learning algorithm on Hadoop Describe the data science life cycle Use Pig to transform and prepare data on Hadoop Write a Python script Describe options for running Python code on a Hadoop cluster Write a Pig User-Defined Function in Python Use Pig streaming on Hadoop with a Python script Use machine learning algorithms Describe use cases for Natural Language Processing (NLP) Use the Natural Language Toolkit (NLTK) Describe the components of a Spark application Write a Spark application in Python Run machine learning algorithms using Spark MLlib Take data science into production

Prerequisites

Students must have experience with at least one programming or scripting language, knowledge in statistics and/or mathematics, and a basic understanding of big data and Hadoop principles. Students new to Hadoop are encouraged to attend the HDP Overview: Apache Hadoop Essentials course.

Course Curriculum

Expand All
  • Recognize use cases for data science on Hadoop
  • Describe the Hadoop and YARN architecture
  • Describe supervised and unsupervised learning differences
  • Use Mahout to run a machine learning algorithm on Hadoop
  • Describe the data science life cycle
  • Use Pig to transform and prepare data on Hadoop
  • Write a Python script
  • Describe options for running Python code on a Hadoop cluster
  • Write a Pig User-Defined Function in Python
  • Use Pig streaming on Hadoop with a Python script
  • Use machine learning algorithms
  • Describe use cases for Natural Language Processing (NLP)"
  • Use the Natural Language Toolkit (NLTK)
  • Describe the components of a Spark application
  • Run a machine learning algorithm on a distributed data set
  • Describe use cases for Natural Language Processing (NLP)
  • Perform sentence segmentation on a large body of text
  • Perform part-of-speech tagging
  • Use the Natural Language Toolkit (NLTK)
  • Write a Spark application in Python
  • Write a Spark application in Python
  • Run machine learning algorithms using Spark MLlib
  • Take data science into production
  • Lab: Setting Up a Development Environment
  • Demo: Block Storage
  • Lab: Using HDFS Commands
  • Demo: MapReduce
  • Lab: Using Apache Mahout for Machine Learning
  • Demo: Apache Pig
  • Lab: Getting Started with Apache Pig
  • Lab: Exploring Data with Pig
  • Lab: Using the IPython Notebook
  • Demo: The NumPy Package
  • Demo: The pandas Library
  • Lab: Data Analysis with Python
  • Lab: Interpolating Data Points
  • Lab: Defining a Pig UDF in Python
  • Lab: Streaming Python with Pig
  • Demo: Classification with Scikit-Learn
  • Lab: Computing K-Nearest Neighbor
  • Lab: Generating a K-Means Clustering
  • Lab: POS Tagging Using a Decision Tree
  • Lab:Using NLTK for Natural Language Processing
  • Lab: Classifying Text using Naive Bayes
  • Lab: Using Spark Transformations and Actions
  • Lab Using Spark MLlib
  • Lab: Creating a Spam Classifier with MLlib

Certification

SpringPeople works with top industry experts to identify the leading certification bodies on different technologies - which are well respected in the industry and globally accepted as clear evidence of a professional’s “proven” expertise in the technology. As such, these certification are a high value-add to the CVs and can give a massive boost to professionals in their career/professional growth.

Our certification courses are fully aligned to these high-profile certification exams; at the end of the course, participants will have detailed knowledge, be eligible and be fully ready take up these certification exams and pass with flying colours.

 

Resources

About the Instructor

Founded in 2009, SpringPeople is a global premier eLearning marketplace for Online Live, Instructor-led classes in the region. It is a certified training delivery partner of leading technology creators, namely Pivotal, Elastic, Lightbend, EMC, VMware, MuleSoft, RSA, and... Read More


Rating and Reviews

4.7

Average Rating
5 Stars
31
4 Stars
10
3 Stars
2
2 Stars
0
1 Star
0

Rating and Reviews

SPRINGPEOPLE SpringPeople Trainer

Saed Hammad

Course:
Instructor:
Course Material:
Class Experience:
overall is very good training :)

This class is intended for participants with some prior exposure to the technology and are now looking to build up their expertise on the topic.

On successful completion of the course, participants will be eligible to sit of the related certification exam (see course overview). All participants receive a course completion certificate, demonstrating their expertise on the subject.

Total duration of the online, live instructor led sessions. Sessions are typically delivered as short lectures (2-hrs weekdays/3-hrs weekends) and detailed hands-on guidance.

Expected offline lab work hours that participants will need to complete and submit to the trainer, during and after the instructor-led online sessions.

  1. We are happy to refund full fee paid - no questions asked - should you feel that the training is not up to your expectations.
  2. Our dedicated team of expert training enablement advisors are available on email, phone and chat to assist you with your queries.
  3. All courseware, including session recordings, will always be available to access to you for future reference and rework.

Contact Us

+91-80-6567-9700 (BLR)

training@springpeople.com

Contact Us

Related Courses

Recently Viewed