Greenplum Database 4.2
Greenplum Database 4.2 includes language and compatibility enhancements for fast migrations to Greenplum; an extension framework and turnkey in-database analytics; and targeted performance optimization. It enables high-performance parallel import and export of all data (compressed and uncompressed) from Hadoop using gNet for Hadoop, a parallel communications transport. Features include advanced integration with EMC Data Domain deduplication storage systems via EMC Data Domain Boost. This integration distributes parts of the deduplication process to Greenplum database servers, enabling them to send only unique data to the Data Domain system, thus, dramatically increasing aggregate throughput, reducing the amount of data transferred over the network, and eliminating the need to create and manage virtual drives.