Introduction to Hadoop with Hive 3 Training

Course duration

Course Benefits

  • Learn to get started using Hive.

Course Outline

  1. Why Hadoop?
    1. The motivation for Hadoop
    2. Use cases and case studies about Hadoop
  2. The Hadoop platform
    1. MapReduce, HDFS, YARN
    2. New in Hadoop 3
      1. Erasure Coding vs 3x replication
  3. Hive Basics
    1. Defining Hive Tables
    2. SQL Queries over Structured Data
    3. Filtering / Search
    4. Aggregations / Ordering
    5. Partitions
    6. Joins
    7. Text Analytics (Semi Structured Data)
  4. New in Hive 3
    1. ACID tables
    2. Hive Query Language (HQL)
      1. How to run a good query?
      2. How to trouble shoot queries?
  5. HBase
    1. Basics
    2. HBase tables - design and use
    3. Phoenix driver for HBase tables
  6. Sqoop
    1. Tool
    2. Architecture
    3. Use
  7. Spark
    1. Overview
    2. Spark SQL
  8. The big picture
    1. How Hadoop fits into your architecture
    2. Hive vs HBase with Phoenix vs Excel

Class Materials

Each student will receive a comprehensive set of materials, including course notes and all the class examples.

Class Prerequisites

Experience in the following is required for this Hadoop class:

  • Exposure to SQL
  • Ability to navigate the Linux command line.
Since its founding in 1995, InterSource has been providing high quality and highly customized training solutions to clients worldwide. With over 500 course titles constantly updated and numerous course customization and creation possibilities, we have the capability to meet your I.T. training needs.
Instructor-led courses are offered via a live Web connection, at client sites throughout Europe, and at our Geneva Training Center.