Skip to main content

WEKA Data Platform—Deployment, Management, and Performance Engineering

H54GBS

Table of Contents

Table of Contents

    Course ID

    H54GBS

    Duration

    2 days

    Format

    ILT/VILT

    Overview

    This course provides a comprehensive introduction to the HPE WEKA data platform for high-performance AI, machine learning (ML), and high performance computing (HPC) workloads. You learn the core architecture of WEKA, including its front-end and back-end design, data striping mechanisms, and scalability principles. The course covers end-to-end deployment, cluster management, security configuration, tiering to object storage, and integration with compute workloads. You also gain hands-on experience with automation, API-driven operations, and performance optimization in a distributed WEKA environment.

    Course ID

    H54GBS

    Duration

    2 days

    Format

    ILT/VILT

    Audience

    This course is ideal for:

    • Data infrastructure and storage professionals working with high-performance computing or AI/ML workloads
    • System architects and technical leads responsible for designing scalable data platforms
    • DevOps and infrastructure engineers managing distributed or hybrid data environments
    • MLOps, HPC, and AI platform engineers integrating storage systems with GPU and compute clusters
    • IT administrators and solution engineers interested in data tiering, performance tuning, and automation

    Prerequisites

    There are no prerequisites for this course, but we recommend you have the following before attending:

    • Linux scripting knowledge:
      • Familiarity with SSH, networking commands, and basic shell scripting
      • Awareness of mounting file systems, and managing storage volumes
      • Understanding of users, permissions, and process management
    • Basic understanding of storage and data
      • Knowledge of file, block, and object storage
      • Awareness of RAID, NFS and S3-compatible storages
      • Understanding of redundancy, replication, and throughput concepts.
    • Exposure to big data, AI/ML, or HPC workloads
      • Appreciation of GPU acceleration
      • Understanding of big data
      • Awareness of AI/ML lifecycle

    Objectives

    After completing this course, you should be able to:

    • Understand the HPE WEKA Data Platform architecture and its role in high-performance AI, ML, and HPC environments
    • Deploy, configure, and manage WEKA clusters for distributed file and object storage
    • Integrate WEKA with compute workloads, including GPU-accelerated systems, to achieve optimized throughput
    • Secure WEKA environments through authentication, encryption, and access control policies
    • Automate cluster operations and monitoring using the WEKA CLI and REST APIs
    • Implement data tiering and lifecycle management using file-object storage strategies
    Divider

    Course outline

    Module 1: Introduction to WEKA Data Platform and Architecture


    • Overview of WEKA architecture and design principles
    • Frontend and backend node functions
    • Supported interfaces: POSIX, NFS, SMB, and S3
    • High-level data flow within the WEKA cluster

    Module 2: System Architecture and Cluster Design


    • Logical vs. physical WEKA architecture
    • Data striping and replication strategies
    • Frontend-backend separation and balancing
    • Planning for scalability and high availability

    Module 3: Setting Up the Cluster


    • Node and disk preparation steps
    • Cluster initialization and joining backend nodes
    • Client driver installation and configuration
    • Verification and cluster health checks

    Module 4: Cluster Ops and Management


    • Node management and capacity expansion
    • Monitoring tools and dashboards
    • Snapshotting and version control
    • Data protection, recovery, and maintenance operations

    Module 5: Integration with Compute Workloads


    • Configuring WEKA clients on compute nodes
    • Data locality and caching mechanisms
    • Benchmarking throughput and latency
    • Running distributed workloads and performance validation

    Module 6: Tiering and Object Store


    • Tiering architecture and benefits
    • Configuring object storage backends
    • Automating data movement and rehydration
    • Cost-performance optimization strategies

    Module 7: Security and Access Management


    • User and group management
    • Role-based access control (RBAC)
    • Encryption in transit and at rest
    • Audit logging and security best practices

    Module 8: Automation and Performance Optimization


    • Using WEKA CLI and REST API for operations
    • Scripting backups, scaling, and monitoring
    • Performance diagnostics and tuning
    • Troubleshooting and capacity planning best practices

    5 reasons to choose HPE as your training partner

    1. Learn HPE and in-demand IT industry technologies from expert instructors.
    2. Build career-advancing power skills.
    3. Enjoy personalized learning journeys aligned to your company’s needs.
    4. Choose how you learn: in-person , virtually , or online —anytime, anywhere.
    5. Sharpen your skills with access to real environments in virtual labs .

    Explore our simplified purchase options, including HPE Education Learning Credits .

    Lab outline

    Lab 1: WEKA Cluster Deployment and Initialization


    • Deploy compute nodes to form a WEKA cluster
    • Configure networking, validate connectivity, and ensure node discovery
    • Initialize the WEKA cluster, join nodes, and verify cluster health
    • Mount the WEKA filesystem on the control node and confirm accessibility

    Lab 2: Cluster Management and Monitoring


    • Explore WEKA’s management console and CLI for monitoring performance metrics
    • Add and remove nodes dynamically to simulate scaling operations
    • Create and restore snapshots to demonstrate data protection
    • Monitor throughput, latency, and disk usage

    Lab 3: Client Integration and Workload Simulation


    • Install WEKA client drivers on compute nodes
    • Mount the WEKA filesystem for application access.
    • Run sample I/O tests to measure read/write throughput
    • Simulate machine learning workload performance over the WEKA filesystem

    Lab 4: Tiering and Object Store Configuration

    • Connect the WEKA cluster to an object store for tiering
    • Configure automatic data movement policies for cold and hot data
    • Move datasets between tiers and verify retrieval using rehydration commands

    Lab 5: Security and Access Control Implementation

    • Configure role-based access control (RBAC) for users and groups
    • Enable data encryption
    • Set up audit logging for administrative actions and access attempt

    Lab 6: Automation, API Integration, and Performance Tuning

    • Use WEKA CLI and REST API for automated management tasks
    • Script cluster health checks, capacity monitoring, and scaling workflows
    • Run diagnostic tools to identify I/O bottlenecks
    • Apply performance tuning parameters and validate improvements through benchmarking

    Recommended for you