Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Big Data Management
  3. Connections
  4. Mappings in a Hadoop Environment
  5. Mapping Objects in a Hadoop Environment
  6. Mappings in the Native Environment
  7. Profiles
  8. Native Environment Optimization
  9. Data Type Reference
  10. Function Reference
  11. Parameter Reference

Column Profiles for Sqoop Data Sources

Column Profiles for Sqoop Data Sources

You can run a column profile on data objects that use Sqoop. You must select the Hive run-time environment to run column profiles.
To run a column profile on a relational data object that uses Sqoop, you must set the Sqoop argument m to 1. Use the following syntax:
-m 1
To run a column profile on a logical data object or customized data object that uses Sqoop, you do not need to set the Sqoop argument m to 1. You can configure the num-mappers argument to achieve parallelism and improve performance. If you configure the num-mappers argument, you must also configure the split-by argument to specify the column based on which the Sqoop program must split the work units. If you do not configure the split-by argument, the value of the num-mappers argument defaults to 1.


Updated July 03, 2018