HPE EZMERAL MARKETPLACE

Explore | Learn | Engage | Deploy

Apache Spark 2.2.1 with Jupyter Notebook

Open source analytics engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing. HPE Ezmeral Container platform automates deployment of a Spark cluster on a Jupyter Notebook.

Product Name

Apache Spark 2.2.1 with Jupyter Notebook

Product Version

2.2.1

HPE Ezmeral Container Platform Version

5.2

Overview

Apache Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs. It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, MLlib for machine learning, GraphX for graph processing, and Structured Streaming for incremental computation and stream processing. 

Spark is designed to support a wide range of data analytics tasks, ranging from simple data loading and SQL queries to machine learning and streaming computation, over the same computing engine and with a consistent set of APIs. The main insight behind this goal is that real-world data analytics tasks — whether they are interactive analytics in a tool, such as a Jupyter notebook, or traditional software development for production applications — tend to combine many different processing types and libraries. Spark’s unified nature makes these tasks both easier and more efficient to write. 

 

  • Developer-friendly platform for large-scale SQL, batch processing, stream processing, and machine learning 
  • Lightning-fast unified analytics engine for big data and machine learning 
  • Easy-to-use APIs for operating on large datasets 
  • Packaged libraries increase developer productivity and seamlessly combined to create complex workflow 
     

HPE Ezmeral Container Platform makes it easy to deploy a Spark 2.2.1 cluster on a Jupyter Notebook for data scientists to create and share their work in a browser-based graphical interface.  This Spark cluster can be directly deployed from HPE Ezmeral Container Platform’s application UI.

Documentation Additional Resources

Explore the industry’s first enterprise-grade container platform for cloud-native and distributed non-cloud native applications, HPE Ezmeral Container Platform.   

Interested in learning more about the HPE Ezmeral Container Platform and Apache Spark? Please contact us to learn more.  

Explore other featured applications