Syncsort, a global leader in high-performance data integration solutions, today announced a milestone contribution in its ongoing commitment to the open source community, with a new feature that strengthens Apache Hadoop’s Big Data integration & ETL capabilities.
The new feature is now committed to Apache Hadoop 2.0.3-alpha and has received broad-based support from leading Hadoop organizations. The key improvement is a new feature that allows external sort implementations within the Hadoop MapReduce framework, helping organizations to accelerate development, build complex ETL flows and MapReduce jobs without coding and seamlessly optimize Hadoop. The patch also simplifies use cases that are currently challenging in MapReduce so they can be implemented faster and more efficiently.
“Hadoop is a rapidly evolving ecosystem that is emerging as the operating system for Big Data,” said Josh Rogers, senior vice president, data integration business, Syncsort. “Our focus is to help build out Hadoop’s data integration & ETL capabilities, removing barriers that undermine its potential and helping organizations ramp-up their Big Data initiatives.”
Syncsort has worked with the Apache Hadoop community on enhancements and fixes and will continue to collaborate on future projects. The additional flexibility provided by the new feature will help the emerging ecosystem as well as current Hadoop users tackle a broader set of use cases for Big Data analytics. In addition, Syncsort will leverage the feature by delivering a pluggable version of its leading high-performance sort solution, DMExpress this spring, which is currently in beta test with select customers.
At the O’Reilly Strata conference in Santa Clara, California this week, in booth #900, Syncsort will highlight how the feature helps Hadoop MapReduce users. Syncsort will also demonstrate how DMExpress can help organizations have the most current, accurate data available for business analysis, while reducing the cost and complexity of processing increasingly large amounts of data.
For more information about the feature, read our blog at http://blog.syncsort.com/. To learn how Syncsort helps organizations reduce data integration TCO and complexity, visit http://bit.ly/12HrJZu to get our white paper.