Microtek Learning Logo

HDP Developer Apache Pig and Hive Training

4.7
(4.7)

This training is recommended for developers who know how to create apps in analyzing Big data stored in Apache Hadoop by utilizing Hive and Pig.

  • Category : Hortonworks

Course Price : $2495 Per Participant

Course Description

This training is recommended for developers who know how to create apps to analyze Big data stored in Apache Hadoop by utilizing Hive and Pig.

The topics also cover HDFS, data ingestion, Hadoop, data ingestion and workflow definition, and utilizing Pig and Hive in performing data analytics stored on Big Data.

The course also covers an introductory course in Spark SQL and Spark Core.

Microsoft Course Microsoft Course
500+

Courses

experience experience
20+

Years of Experience

learners learners
95K+

Global Learners

What you will learn

  • green-tick Describing about YARN and utilize cases for Hadoop.
  • green-tick Describing about Hadoop frameworks and ecosystem tools.
  • green-tick Describing Hadoop frameworks and ecosystem tools.
  • green-tick Describing about HDFS architecture.
  • green-tick Utilizing the Hadoop client and input data into HDFS.
  • green-tick Transferring of data between Hadoopn and relation database.
  • green-tick Explaining MaoReduce architectures and YARN.
  • green-tick Running MapReduce job on YARN.
  • green-tick Utilizing Pig to transform and explore data in HDFS.
  • green-tick Understanding about Hive Tables which are implemented and defined.
  • green-tick Utilizing the functionalities used in Hive Windows.
  • green-tick Utilizing Hive to analyze and explore datasets.
  • green-tick Utilizing and explaining about the several Hive File Formats.
  • green-tick Populating and creating a Hive table that utilized ORC file supported extension.
  • green-tick Utilizing Hive to execute SQL queries in performing data analysis.
  • green-tick Utilizing Hive for joining data sets and utilizing widespread techniques.
  • green-tick Writing effectual Hive queries.
  • green-tick Performing data analytics utilizing the DataFu Pig library.
  • green-tick Explaining the purposes and utilizing HCatalog.
  • green-tick Scheduling and defining about Oozie workflow.
  • green-tick Presenting high-level architecture and Spark ecosystem.
  • green-tick Exploring DataFrame API and Spark SQL.

Prerequisites

  • The recommended prerequisite for this course is familiarity with software development and programming principles. However, any specific knowledge of Hadoop is not required.

Who should attend this course?

  • This training is intended for software developers who want to gain knowledge in developing apps for Hadoop.

Schedules

Oops! For this course, there are currently no public schedules available. Clicking on "Notify Me" will allow you to express your interest.

For dates, times, and location customization of this course, get in touch with us.

You can also speak with a learning consultant by calling 800-961-0337.

Curriculum

  • Use HDFS commands to add/remove files and folders
  • Use Sqoop to transfer data between HDFS and a RDBMS
  • Run MapReduce and YARN application jobs
  • Explore, transform, split and join datasets using Pig
  • Use Pig to transform and export a dataset for use with Hive
  • Use HCatLoader and HCatStorer
  • Use Hive to discover useful information in a dataset
  • Describe how Hive queries get executed as MapReduce jobs
  • Perform a join of two datasets with Hive
  • Use advanced Hive features: windowing, views, ORC files
  • Use Hive analytics functions
  • Write a custom reducer in Python
  • Analyze clickstream data and compute quantiles with DataFu
  • Use Hive to compute ngrams on Avro-formatted files
  • Define an Oozie workflow
  • Use Spark Core to read files and perform data analysis
  • Create and join DataFrames with Spark SQL
  • Course Details

    • enroll enroll-green
      Enrolled: 1242
    • duration duration green
      Duration: 4 Days

    Talk to Learning Advisor