HPE Slingshot AIOps Software QuickSpecs

Shape the Future of QuickSpecs - Your Input Matters

Understanding how a large complex interconnected computing system behaves is a daunting task.

There are thousands of data sources and millions of data points that offer input to the process of gaining insight and understanding of system performance. Customers consistently ask for advanced automated tools to better understand their system and workloads.


Drawing on raw telemetry data from multiple telemetry sources in an HPC/AI system including the unique telemetry from HPE Slingshot’s premier congestion management capability, the HPE Slingshot AIOps Software provides insights into the HPE Slingshot interconnect operations.

  • Overview

    By informing system administrators of congestion hot-spots, anomalies, and patterns in the interconnect traffic, HPE Slingshot AIOps empowers them to more rapidly detect potential issues, draw correlations, take preventive action, improve operations, and improve system workload and application performance. For HPC administrative staff, end users, and developers, the HPE Slingshot AIOps Software enables visibility into system and application operation and performance to improve the return on investment while reducing the day-to-day burden and costs of operating a complex system.


    What’s New

    • − Name change from HPE Cray Supercomputing Fabric AIOps Software to HPE Slingshot AIOPs Software
    • − HPE Slingshot AIOps Software new version.
  • Standard Features

    Job Summary Analysis

    • – Monitor the status of all the jobs to get a broader view of what jobs are running on the cluster.
    • – Synthesize and summarize jobs based on:
      • • Summary of Jobs completed, failed, jobs running and their runtime, queued
      • • Type of jobs failed (related to network, specific exit code patterns for jobs, jobs failed due to network)
      • • Cause of job failures
      • • Correlations between ports, switches and links for failed jobs (network capabilities, components, etc.
    • – Summary of Resource Utilization
      • • Number of idle resources in a network
      • • Resources with jobs running (with runtime) in a network
      • • Resource requirements to run specific jobs in a network

    Job Impact Visualization

    • – Job-based correlations explaining what caused a job failure, to understand which ports, links, and switches have been impacted by the job failure
    • – Visualize when a job isn't running and understand why that specific job isn't running
    • – Nodes on which the jobs are running (important to understand what network is used)

    Ability to use PBSPro for Job Data

    • – Functionality to use PBSPro for Job Data in addition to Slurm

    With HPE Slingshot AIOps Software, you can easily:

    • – Visualize network utilization heat map, providing real-time insights into bandwidth and network activity levels.
    • – Quickly identify and assess any network hotspots or bottlenecks
    • – Monitor job failures and execution delays, enabling informed decisions on whether to terminate underperforming jobs
    • – Toggle to isolate and view a specific job, providing detailed visibility into its traffic flow across the network
    • – Drill down into a specific job and analyze overall network utilization, pinpointing the causes of congestion.
    • – Identify usage patterns across bandwidth and congestion counters to uncover trends and potential bottlenecks.
    • – Monitor the status of all jobs for a comprehensive view of active cluster jobs and their resource utilization
    • – Receive job-based correlation insights that clarify the cause of job failures, detailing the affected ports, links, and switches
    • – Summarize applications and identify the reason for failed applications
    • – Single switch view to understand their impact on application failure

    Supported Hardware

    HPE Slingshot AIOps Software can be installed and used on HPE Slingshot interconnected systems.

    Many factors including system scale, telemetry data sampling rate, telemetry data retention period etc. must be considered in sizing and defining the hardware required to support HPE Slingshot Software. Please work with the HPE Slingshot Product Management team to develop system specific hardware requirements.


    Supported Operating Systems

    HPE Cray Operating System, Red Hat® Enterprise Linux (RHEL®); SUSE® Linux Enterprise Server (SLES®)

  • Service and Support

    HPE Services

    No matter where you are in your digital transformation journey, you can count on HPE Services to deliver the expertise you need when, where and how you need it. From planning to deployment, ongoing operations and beyond, our experts can help you realize your digital ambitions.

    https://www.hpe.com/services


    Consulting Services

    No matter where you are in your journey to hybrid cloud, experts can help you map out your next steps. From determining what workloads should live where, to handling governance and compliance, to managing costs, our experts can help you optimize your operations.

    https://www.hpe.com/services/consulting


    HPE Managed Services

    HPE runs your IT operations, providing services that monitor, operate, and optimize your infrastructure and applications, delivered consistently and globally to give you unified control and let you focus on innovation. HPE Managed Services | HPE


    Operational services

    Optimize your entire IT environment and drive innovation. Manage day-to-day IT operational tasks while freeing up valuable time and resources. Meet service-level targets and business objectives with features designed to drive better business outcomes.

    https://www.hpe.com/services/operational


    HPE Complete Care Service

    HPE Complete Care Service is a modular, edge-to-cloud IT environment service designed to help optimize your entire IT environment and achieve agreed upon IT outcomes and business goals through a personalized experience. All delivered by an assigned team of HPE Services experts. HPE Complete Care Service provides:

    • − A complete coverage approach -- edge to cloud
    • − An assigned HPE team
    • − Modular and fully personalized engagement
    • − Enhanced Incident Management experience with priority access
    • − Digitally enabled and AI driven customer experience

    https://www.hpe.com/services/completecare


    HPE Tech Care Service

    HPE Tech Care Service is the operational support service experience for HPE products. The service goes beyond traditional support by providing access to product specific experts, an AI driven digital experience, and general technical guidance to not only reduce risk but constantly search for ways to do things better. HPE Tech Care Service delivers a customer-centric, AI driven, and digitally enabled customer experience to move your business forward. HPE Tech Care Service is available in three response levels. Basic, which provides 9x5 business hour availability and a 2-hour response time. Essential which provides a 15-minute response time 24x7 for most enterprise level customers, and Critical which includes a 6-hour repair commitment where available and outage management response for severity 1 incidents. https://www.hpe.com/services/techcare


    For a list of the most frequently purchased services using service credits, see theHPE Service Credits Menu

HPE Lifecycle Services

HPE Lifecycle Services provide a variety of options to help maintain your HPE systems and solutions at all stages of the product lifecycle. A few popular examples include:

  • − Lifecycle Install and Startup Services: Various levels for physical installation and power on, remote access setup, installation and startup, and enhanced installation services with the operating system.
  • − HPE Firmware Update Analysis Service: Recommendations for firmware revision levels for selected HPE products, taking into account the relevant revision dependencies within your IT environment.
  • − HPE Firmware Update Implementation Service: Implementation of firmware updates for selected HPE server, storage, and solution products, taking into account the relevant revision dependencies within your IT environment.
  • − Implementation assistance services: Highly trained technical service specialists to assist you with a variety of activities, ranging from design, implementation, and platform deployment to consolidation, migration, project management, and onsite technical forums.
  • − HPE Service Credits: Access to prepaid services for flexibility to choose from a variety of specialized service activities, including assessments, performance maintenance reviews, firmware management, professional services, and operational best practices.

Notes: To review the list of Lifecycle Services available for your product go to: https://www.hpe.com/services/lifecycle


For a list of the most frequently purchased services using service credits, see theHPE Service Credits Menu


Other Related Services from HPE

Services: HPE Education Services

Training and certification designed for IT and business professionals across all industries. Broad catalogue of course offerings to expand skills and proficiencies in topics ranging from cloud and cybersecurity to AI and DevOps. Create learning paths to expand proficiency in a specific subject. Schedule training in a way that works best for your business with flexible continuous learning options.

https://www.hpe.com/services/training


Defective Media Retention

An option available with HPE Complete Care Service and HPE Tech Care Service and applies only to Disk or eligible SSD/Flash Drives replaced by HPE due to malfunction.


Consult your HPE Sales Representative or Authorized Channel Partner of choice for any additional questions and services options.

Parts and Materials

HPE will provide HPE-supported replacement parts and materials necessary to maintain the covered hardware product in operating condition, including parts and materials for available and recommended engineering improvements.


Parts and components that have reached their maximum supported lifetime and/or the maximum usage limitations as set forth in the manufacturer's operating manual, product quick-specs, or the technical product data sheet will not be provided, repaired, or replaced as part of these services.


How to Purchase Services

Services are sold by Hewlett Packard Enterprise and Hewlett Packard Enterprise Authorized Service Partners:

Services for customers purchasing from HPE or an enterprise reseller are quoted using HPE order configuration tools.

Customers purchasing from a commercial reseller can find services at https://ssc.hpe.com/portal/site/ssc/


AI Powered and Digitally Enabled Support Experience

Achieve faster time to resolution with access to product-specific resources and expertise through a digital and data driven customer experience


Sign into the HPE Support Center experience, featuring streamlined self-serve case creation and management capabilities with inline knowledge recommendations. You will also find personalized task alerts and powerful troubleshooting support through an intelligent virtual agent with seamless transition when needed to a live support agent. https://support.hpe.com/hpesc/public/home/signin


Consume IT On Your Terms

HPE GreenLakeedge-to-cloud platform brings the cloud experience directly to your apps and data wherever they are—the edge, colocations, or your data center. It delivers cloud services for on-premises IT infrastructure specifically tailored to your most demanding workloads. With a pay-per-use, scalable, point-and-click self-service experience that is managed for you, HPE GreenLake edge-to-cloud platform accelerates digital transformation in a distributed, edge-to-cloud world.

  • − Get faster time to market
  • − Save on TCO, align costs to business
  • − Scale quickly, meet unpredictable demand
  • − Simplify IT operations across your data centers and clouds

To learn more about HPE Services, please contact your Hewlett Packard Enterprise sales representative or Hewlett Packard

Enterprise Authorized Channel Partner. Contact information for a representative in your area can be found at "Contact HPE" https://www.hpe.com/us/en/contact-hpe.html


For more information http://www.hpe.com/services

  • Configuration Information

HPE Slingshot AIOps Software is licensed on a per-HPE Slingshot (Slingshot) endpoint connection for the duration of the stated term. A license is required for each endpoint to switch connection in the system that will be analyzed. For example, a license is required for each L0 cable connection for system using a DragonFly topology.

Description

SKU

HPE Slingshot AIOps Software 1-year E-RTU

S3K63AAE

HPE Slingshot AIOps Software 3-year E-RTU

S3K64AAE

HPE Slingshot AIOps Software 4-year E-RTU

S3K65AAE

HPE Slingshot AIOps Software 5-year E-RTU

S3K66AAE

HPE Slingshot AIOps Software FIO

S3K68A

  • Summary of Changes

Date

Version History

Action

Description of Change

02-Mar-2026

Changed

Rebranding update applied to QuickSpecs

16-Dec-2024

Updated

Overview and Standard Features sections were updated.

08-Jan-2024

New

New QuickSpecs

Recommended for you