Preface
Understanding PowerExchange for Hadoop
PowerExchange for Hadoop Configuration
PowerExchange for Hadoop Sources and Targets
- PowerExchange for Hadoop Sources and Targets Overview
PowerExchange for Hadoop Sessions
Data Type Reference
- Data Type Reference Overview
- Flat File and Transformation Data Types

PowerExchange for Hadoop User Guide for PowerCenter

10.5.6
- 10.5.7
- 10.5.4
- 10.5
- 10.4.0

Back Next

Understanding Hadoop

Hadoop provides a framework for distributed processing of large data sets across multiple computers. It depends on applications rather than hardware for high availability.

Hadoop applications use HDFS as the primary storage system. HDFS replicates data blocks and distributes them across nodes in a cluster.

Hive is a data warehouse system for Hadoop. You can use Hive to add structure to datasets stored in file systems that are compatible with Hadoop.

Understanding PowerExchange for Hadoop

Download Guide

Watch

Comments

Communities

Knowledge Base

Success Portal

0 COMMENTS

We’d like to hear from you! Log in to comment.

Rename Saved Search

Table of Contents

PowerExchange for Hadoop User Guide for PowerCenter

PowerExchange for Hadoop User Guide for PowerCenter

Understanding Hadoop

Understanding Hadoop