Describes how replication works, and how to configure the replication factor.
Volumes are stored as pieces called containers that contain files, directories, and other
data. By default, the maximum container size is 32 GB. The data-fabric administrator sets the maximum
container size using the cldb.container.sizemb parameter (see the config commands). Containers are replicated to protect data.
Normally, each container has three copies stored on separate nodes to provide uninterrupted
access to all data, even if a node fails.
For each volume, you can specify a desired and minimum data replication factor, and a desired and minimum namespace (name container) replication factor.
When enabled, the CLDB manages the namespace container replication separate from the data container replication. Use this capability when you have low volume replication, but want to have higher namespace replication.
cldb.fs.mark.rereplicate.sec parameter (configurable
using the config API). The minimum replication factor is 1 and the maximum
is 6 (default: 3). enforceminreplicationforio property is set to
true, writes succeed only when the minimum replication
factor requirements are met. If the
enforceminreplicationforio property is set to
true and the minimum number of copies are not available,
the client is asked to retry. In the case of a:enforceminreplicationforio property (configurable at the
volume level) is set to true, the requirement to maintain a
minimum number of copies is not enforced during writes until new copies of
all containers associated with the volume are created.cldb.fs.mark.rereplicate.sec parameter (configurable
using the config API). The minimum replication factor is 1 and the maximum
is 6 (default: 3). enforcemineplicationforio property
(configurable at the volume level) is set to true, writes
succeed only when the minimum replication factor requirements are met. If
this property is set to true and minimum number of copies
are not available, the client is asked to retry. In the case of a:enforceminreplicationforio property is set to
true, the presence of the minimum number of copies is
not enforced during writes until new copies of all containers associated
with the volume are created.If any containers in the CLDB volume fall below the minimum replication factor,
the cluster is inaccessible until aggressive re-replication restores the minimum level of
replication. If a disk failure is detected, any data stored on the failed disk is
re-replicated without regard to the timeout specified in the
cldb.fs.mark.rereplicate.sec parameter.
If all copies of a container, which are neither under nor over replicated, are on the same rack, data-fabric automatically detects and distributes the copies, such that they are all not on the same rack, after 12 hours. If a container is under replicated and data-fabric is unable to find a different rack for the new copy, the creation of the copy is deferred. If another rack is unavailable for the new copy after 3 hours, data-fabric creates a copy of the container on the same rack and if this results in all copies of the container being on the same rack, data-fabric distributes the copies after 12 hours. Also, during replication, data-fabric tries to defer the scenarios where all copies end up on the same rack. As per deferring policy:
If you do not set the namespace (NS) replication and minimum namespace replication values
explicitly, they assume the same values as (data) replication and minimum replication
respectively. This means that all changes to (data) replication and
minreplication parameters are also reflected in
nsreplication and nsminreplication. If
nsreplication or nsminreplication is modified or
specified during creation, nsreplication and
nsminreplication start assuming values different from replication and
minreplication.
This section describes the advantages of both Table Replication and Mirroring, to let you determine the best option for your use case.
Advantages of Table ReplicationFor tables and streams, table replication is usually the right choice. However, there are exemptions where mirroring is the best choice.
Advantages of Mirroring