Creating a New DataTap

Tenant Administrators have the ability to create DataTaps. Clicking the Create button in the DataTaps screen opens the Create New DataTap screen.



DataTaps are created on a per-tenant basis. This means that a DataTap created in Tenant A is not available to Tenant B. You may, however, choose to create DataTaps in different tenants that point to the same storage path; in this situation, jobs in different tenants can access the same storage simultaneously. Also, multiple jobs within a tenant may use a given DataTap simultaneously. While such sharing can be useful, be aware that the same cautions and restrictions apply to these use cases as for other types of shared storage: multiple jobs modifying files at the same location may lead to file access errors and/or unexpected job results.

CAUTION:
CREATING MULTIPLE DATATAPS TO THE SAME DIRECTORY CAN LEAD TO CONFLICTS AND POTENTIAL DATA LOSS.
Note:

This article contains generic instructions for creating a DataTap. Please see the following for more specific examples:

To create a DataTap:

  1. Please see About DataTaps for important limitations on where you can create DataTaps.
  2. Enter a name for the DataTap in the Name field. This name may contain letters (A-Z or a-z), digits (0-9), and hyphens (-), but may not contain spaces.
  3. Enter a brief description for the DataTap in the Description field.
  4. You can make a DataTap read only by checking the Read Only check box. Clearing this check box allows read/write access.
  5. Select the file system type using the Select Type pull-down menu. The available options are:
  6. Review the entries you made in Steps 1-6 to make sure they are accurate.

When you have finished modifying the parameters for the DataTap, click Submit to create the new DataTap.

Note: If you need to configure wire encryption and/or Transparent Data Encryption (TDE), then please see HDFS DataTap Wire Encryption and/or HDFS DataTap TDE Configuration, as appropriate.

MAPR Parameters

If you selected MAPR in Step 5, above, then enter the following parameters:

HDFS Parameters

If you selected HDFS in Step 5, above, then enter the following parameters:

Continue from Step 6, above, after entering the HDFS parameters.

NFS Parameters

Note: This option is not available for Kubernetes tenants.

If you selected NFS in Step 5, above, then enter the following parameters:

Also, be sure to configure the storage device to allow access from each host and each Controller and Worker that will using this DataTap.

Continue from Step 6, above, after entering the NFS parameters.

GCS Parameters

An GCS DataTap is configured as follows: