Prerequisites

Topics:

Review the prerequisites in this section before installing and using iWay Big Data Integrator (iBDI).

JDBC Drivers

JDBC drivers are not shipped with iBDI and must be obtained from a relevant JDBC vendor (for example, Cloudera and Simba). Depending on your usage requirements, there are several open source Hive JDBC drivers available online. For additional security and availability options, commercial drivers are also available. Each driver includes a list of dependencies for the specific driver. Consult the driver provider for the list of .jar files that must be used with the specific driver version.

Server Deployment

A Hadoop Edge node (also referred to as a gateway node) is the interface between a Hadoop cluster and an external network. An Edge node is used to run client applications and administration tools for clusters. An Edge node is often used as a staging environment for data that is being transferred into a Hadoop cluster.

The iBDI Hadoop or Spark environment must be provisioned for the product to function. This requires installing the iBDI runtime Java .jar files into a folder on the Edge node.

For more information, see the iWay Big Data Integrator Provisioning Guide.

Client Workbench

This section provides prerequisites for the iBDI client workbench.

  • 8GB RAM on either Windows (Version 7, 8, or 10), Mac (OS X El Capitan), or Linux (Ubuntu or Centos).
  • Java Version 8 (1.8_31 and higher).
  • Download one of the following packaged archive files based on your operating system:
    • Linux: bdi-1.5.2.I201702211956-linux.gtk.x86_64.tar.gz
    • Mac OS X: bdi-1.5.2.I201702211956-macosx.cocoa.x86_64.tar.gz
    • Windows: bdi-1.5.2.I201702211956-win32.win32.x86_64.zip

    Note: This documentation references the Windows version of iBDI.

  • Depending on the database, download the required JDBC driver and save the driver to your local system:
    • Postgres: postgresql-9.4-1201.jdbc4.jar
    • MySQL: mysql-connector-java-5.1.28-bin.jar
    • SQL Server: sqljdbc4.jar
    • Appropriate driver for Hadoop distribution.