Informatica and MapR Technologies are working together to boost the performance As part of the initiative, a community edition of Informatica’s HParser will be available as a component of the MapR distribution.
HParser provides out-of-the-box Hadoop parsing capabilities for data sources, such as logs, call data records, industry standards, documents and binary or hierarchical data.
The joint solution will be able to “move many terabytes per hour of transaction, interaction and streaming data into MapR, as well as parse and process a broad range of structured and unstructured data natively in Hadoop, all without coding.”
MapR has always had a focus on performance, with a Hadoop distribution that replaces the traditional HDFS storage layer with a version of the Unix-origin network file system (NFS). MapR’s lockless file storage layer also aligns well with Informatica’s messaging to stream large amounts of data into Hadoop.
Informatica is also working with Cloudera, EMC’s Greenplum unit and Teradata’s Aster Data to provide integration with those company’s big data offerings.