Copying Data Using the webhdfs:// Protocol

Describes how to copy data from a HDFS cluster to a HPE Ezmeral Data Fabric cluster using the webhdfs:// protocol.

Before you can copy data from an HDFS cluster to a HPE Ezmeral Data Fabric cluster using the webhdfs:// protocol, you must configure the HPE Ezmeral Data Fabric cluster to access the HDFS cluster. To do this, complete the steps listed in Configuring a HPE Ezmeral Data Fabric Cluster to Access an HDFS Cluster for the security scenario that best describes your HDFS and HPE Ezmeral Data Fabric clusters, and then complete the steps listed under Verifying Access to an HDFS Cluster.

The HDFS cluster must have WebHDFS enabled. Verify that the following parameter exists in the hdfs-site.xml file and that the value is set to true.

<property>
<name>dfs.webhdfs.enabled</name>
<value>true</value>
</property>
You also need the following information:

To copy data from the HDFS to the HPE Ezmeral Data Fabric filesystem using the webhdfs:// protocol, complete the following step:

Run the following command from a node in the HPE Ezmeral Data Fabric cluster:
hadoop distcp webhdfs://<NameNode>:<NameNode HTTP Port>/<HDFS path> maprfs:///<MapR filesystem path>
Example
hadoop distcp webhdfs://nn2:50070/user/sara maprfs:///user/sara 
Note: The triple slashes in maprfs:///... are required.