To show how the HPE Ezmeral Data Fabric Event Store concepts fit together, here is an example of the flow of one message from a producer to a consumer.
Suppose that you are using HPE Ezmeral Data Fabric Event Store as part of a system to monitor traffic in San Francisco. Your producers are sensors in streets, freeways, bridges, overpasses, and other infrastructure, as well as sensors reporting the weather in many different locations. Your consumers are various analytical and reporting tools.
In a volume in a data-fabric
cluster, you create the stream /somepath/traffic_monitoring. In that
stream, you create the topics traffic, infrastructure, and
weather_conditions.
Of all of the sensors (producers) that your system uses to monitor traffic, let us choose a
sensor that is under the pavement of Market Street and follow a message that it generates.
We will follow a message that is generated by this sensor and published in the
traffic topic.
Suppose that, when you created this topic, you created several partitions within it to help
spread the load among the different nodes in your data-fabric cluster and to help improve the performance of your
consumers. For simplicity, we will assume that the traffic topic has only
one partition.
For a moment, suppose that this example used more than one partition. In that case, the sensor could influence how the HPE Ezmeral Data Fabric Event Store server determines which messages go to which partition. In the example that we are following, the sensor could include a key with each message. The HPE Ezmeral Data Fabric Event Store server would hash the key to determine the partition to place the messages received from the sensor. More information about how partitions are selected if there are more than one in a topic is explained later in this documentation.
traffic topic assigns the offset 001030 to the message
that we are following, and replicates the message to replica containers (replication rules
are controlled at the volume level) within the data-fabric cluster.traffic
traffic topic. Many more consumers could subscribe to
it, too.
Back in the cluster in San Francisco, messages are being continuously published to the
partition in the traffic topic. Message 001030 is much further in the
partition. More recent messages have filled the partition ahead of it.
When you created the stream, you set the time-to-live for messages to be six months. Message 001030 and messages around it have now been in the partition for that period, and are now expired. An automatic process eventually reclaims the disk space that message 001030 and the other expired messages are using.