HPE Ezmeral Unified Analytics Data Engineering

H41CVS

Course ID

H41CVS

Duration

1 day

Format

ILT, VILT

Overview

This course teaches you how to implement the HPE Ezmeral Unified Analytics solution. It also introduces EzPresto, a SQL query engine and it covers using data visualizations and dashboards using Apache Superset, data pipeline management using Apache Airflow and Ray framework on HPE Ezmeral Unified Analytics. The course is comprised of of 30% lecture and 70% practical hands-on labs.


Course ID

H41CVS

Duration

1 day

Format

ILT, VILT

  • Audience

    This course is ideal for System Administrators, integrators, data engineers and learners who wants to implement HPE Ezmeral Unified Analytics solution.


  • Prerequisites

    Before attending this course, you should have:

    • An understanding of Kubernetes or any container orchestration software
    • A basic understanding of big data Open-Source Tools and Frameworks
    • A basic understanding of HPE Ezmeral Data Fabric

  • Objectives

    After completing this course, you should be able to:

    • Describe the features and capabilities of HPE EzUA
    • Demonstrate running federated queries across various data sources using HPE EzUA
    • Demonstrate authoring, and monitoring workflows and data pipelines using HPE EzUA
    • Use data visualizations and dashboards with HPE EzUA

Data Analytics
  • Course outline

Module 1: Introduction to HPE Ezmeral Unified Analytics Software (HPE EzUA)


  • Describe the features and capabilities of HPE EzUA
  • Understand data engineering components of HPE EzUA
  • Identify the navigation in the HPE EzUA software
  • Discuss steps to get started with HPE EzUA

Module 2: Introduction to EzPresto


  • Understand what EzPresto is
  • Discuss EzPresto key features
  • Recognize EzPresto architecture
  • Define connect data sources using HPE EzUA
  • Describe steps to connect to external applications through JDBC using HPE EzUA
  • Use Spark to query EzPresto on HPE EzUA
  • Discuss connectivity to EzPresto via Python client using HPE EzUA
  • Define cache data

Module 3: Introduction to Workflows


  • Define Airflow functionality
  • Recognize Airflow architecture and its components
  • Configure Airflow DAGs Git Repository using HPE EzUA
  • Demonstrate Airflow configuration using HPE EzUA

Module 4: Superset Overview


  • Define Superset
  • Demonstrate BI reporting using Superset on HPE EzUA
  • Demonstrate retail store analysis dashboard using Superset on HPE EzUA

5 reasons to choose HPE as your training partner

  1. Learn HPE and in-demand IT industry technologies from expert instructors.
  2. Build career-advancing power skills.
  3. Enjoy personalized learning journeys aligned to your company’s needs.
  4. Choose how you learn: in-person, virtually, or online—anytime, anywhere.
  5. Sharpen your skills with access to real environments in virtual labs.

Explore our simplified purchase options, including HPE Education Learning Credits.

  • Lab outline

Lab 1: Accessing the Lab Environment

Lab 2: Running Federated Queries Across Various Data Sources

Lab 3: Retail Store Analysis Dashboard Using Superset on HPE EzUA

Lab 4: Authoring, Monitoring - Workflows and Data Pipelines with Apache Airflow

Lab 5: Batch and Stream ETL with Apache Spark

Lab 6: Orchestrating Spark Applications with Apache Airflow

Lab 7: ETL with Apache Spark

Lab 8: Querying EzPresto with Apache Spark and Ray

Lab 9: BI Reporting and Analytics with Superset

Recommended for you