DB Alias tab for Hadoop

Use the DB Alias tab to define the parameters needed for the Hadoop loader. The availability of certain options and fields might be dependent upon your entries for related options and fields on the tab.

Example of the DB Alias tab for Hadoop, described as follows.

Target URL

Specify the http URL of the Hadoop cluster’s name node. Here is an example: http://somehost:50070.

User ID

Specify the user name for the Hadoop user.

Password

Specify the password for the Hadoop user.

Work Path for Interm Files

Specify a default directory path for storing the temporary loader files.

Load on Row Count

Specify the frequency at which you want rows loaded into a CSV file and sent to the Hadoop server to load into Hadoop. For example, if you specify a row count of 1000, each time a thousand rows are loaded into a CSV file, the file is sent to the Hadoop server to load into Hadoop. This process is repeated, in 1000 row increments, until all required rows are processed.

For best performance, specify a commit frequency of 1000 - 1000000 (from one thousand to one million). You can specify a frequency of less than 1000 or more than 1000000, but doing so will adversely affect performance.



Feedback