Topic: parquet

SD Times news digest: Cisco to acquire Perspica, Google’s Data Loss Prevention API, and Google Play Console updates

Cisco has announced their intent to acquire Perspica in an effort to support and accelerate their AppDynamics vision, which Cisco purchased earlier this year. The company hopes that the addition of Perspica will allow customers to take advantage of more machine learning capabilities in order to analyze large amounts of data. “With the addition of … continue reading

Apache Kudu becomes top-level project

The Apache Kudu Project is, as of today, a top-level project within the open-source technology foundation. Originally contributed by Cloudera, the project is an effort to build a highly efficient and fast analytics platform for quickly moving data, such as streams. Kudu, in practice, is actually a columnar storage manager for Hadoop. The system is … continue reading

Spark 1.6

Spark 1.6 is released

Apache Spark 1.6, which shipped yesterday, offers performance enhancements that range from faster processing of the Parquet data format to better overall performance for streaming state management. As a large-scale data processing platform, Apache Spark has untethered itself from the Hadoop platform. As a result, Spark can be used against key-value stores and other types … continue reading

DMCA.com Protection Status