Sqoop Connectivity for Relational Sources and Targets
Sqoop Connectivity for Relational Sources and Targets
Effective in version 10.1, you can use Sqoop to process data between relational databases and HDFS through MapReduce programs. You can use Sqoop to import and export data. When you use Sqoop, you do not need to install the relational database client and software on any node in the Hadoop cluster.
To use Sqoop, you must configure Sqoop properties in a JDBC connection and run the mapping in the Hadoop environment. You can configure Sqoop connectivity for relational data objects, customized data objects, and logical data objects that are based on a JDBC-compliant database. For example, you can configure Sqoop connectivity for the following databases:
Aurora
IBM DB2
IBM DB2 for z/OS
Greenplum
Microsoft SQL Server
Netezza
Oracle
Teradata
You can also run a profile on data objects that use Sqoop in the Hive run-time environment.