Back

HPE Performance Cluster Manager is a fully integrated system management solution offering all the functionalities you need to manage your Linux®-based high performance computing (HPC) clusters all day, everyday. The software provides system setup, hardware monitoring and management, image management and software updates as well as power management for systems of any scale - up to 100,000 nodes. The HPE Performance Cluster Manager reduces the time and resources spent administering HPC systems - lowering total cost of ownership, increasing productivity and providing a better return on your hardware investments.

What's new

  • Cluster Health Management feature
  • Active-Active High Availability (HA) setup for more resiliency

Features

Fast System Setup

Guided setup helps to easily install the software, discover hardware components for the cluster nodes and provision operating system for all compute and service nodes in the cluster.

HPE Performance Cluster Manager can quickly provision a cluster with thousands of nodes from bare metal – typically within an hour.

Adding new cluster nodes to the system does not require system shutdown.

Comprehensive Hardware Monitoring and Management

HPE Performance Cluster Manager provides fine-grained central monitoring and management of all aspects of your cluster hardware (CPU, memory, GPU, networking, cooling, …)

When issues are detected, alerts are sent to the attention of the system administrator via the console (GUI, CLI) and by email. The sofware also offers setup of automatic reactions to specific alerts.

Additional analyses of the hardware metrics can be done by visualizing the metrics at a specific point in time or over a historical period in a user-friendly GUI. Alternatively, users can also monitor and analyze metrics and view alerts via Ganglia, Nagios Core or ELK.

The software supports integrated firmware flashing including flashing of BIOS, BMC/iLO, CMC, network adapters and switches. The installed software, including the BIOS on the cluster nodes, can be compared and flagged for any inconsistencies with versions or missing items.

To protect systems from security breaches, system administrator tasks are kept on the administrative nodes are secured from end-user access.

Flexible Software Management to Accommodate a Wide Range of Requirements

With HPE Performance Cluster Manager any software image can be provisioned on all or select cluster nodes to accommodate different user requirements.

The secure software image repository can store multiple versions of the Linux operating system, libraries, tools and applications. The software supports multiple formats (RPM, ISO, gold image).

Version control allows software changes to roll forward or backward as required and keeps track of changes for accountability.

Lower Operating Costs with Advanced Power Management

HPE Performance Cluster Manager offers tools for accurate measurement and prediction of power usage for better capacity planning.

HPE Performance Cluster Manager collects system power metrics for cluster nodes and liquid cooling infrastructure and offers tools for analysis and future capacity planning.

The software supports advanced power management features for power capping and power resource management for jobs.

Step-by-step topology and protocol-aware Power On/Off feature allows controlled start of the system as well as isolation of incidents caused by a failure and faster system recovery.

  • 1.
    Available for HPE SGI 8600 systems.
  • Linux is the registered trademark of Linus Torvalds in the U.S. and other countries. All other third-party trademark(s) is/are property of their respective owner(s).