Confluent this week introduced its first commercial product, Confluent Control Center, as part of the newly released Confluent Platform 3.0 and Apache Kafka 0.10.0. The combined package is aimed at operationalizing Kafka-based streaming applications and near real-time data processing efforts. Neha Narkhede, cofounder and CTO of Confluent (and one of the creators of Kafka), said, … continue reading
GitLab is strongly recommending users upgrade to any of the newest versions for GitLab 8.2 through 8.7 GitLab Community Edition (CE) and Enterprise Edition (EE) because they contain security fixes. One of the security fixes is for a critical privilege escalation. GitLab said that during an internal code review, it discovered a critical security flaw … continue reading
MapR, a converged data platform, and Pluralsight, an online learning platform, both announced new curriculum offerings for developers who want to learn new skills in either Apache Kafka or Java. MapR is now offering stream processing training on MapR Academy’s free On-Demand Training program. This is designed to teach developers how they can extend their … continue reading
The Apache Apex project has been promoted from the Apache Incubator to becoming top-level project as of today. This open-source stream- and batch-processing platform works with YARN and HDFS, runs in memory, and can handle event processing and fault tolerance. Apex started out as the real-time streaming core of DataTorrent. The company contributed its platform … continue reading
Apple has created a new section of its App Store for developers so they can share how they succeeded on it, and to show other developers what they have learned in the process. On Developer Insights, the section of the App Store designed for developers, there is a planning section that helps developers plan and … continue reading
The Apache Foundation today announced the general availability of Apache Fortress Project version 1.0. Fortress provides Java users with standards-based access management through a Java SDK, a security plug-in for Tomcat, REST wrappers for APIs, and all the relevant Web pages for such a system. Apache Fortress sprang out of the larger Apache Directory Project, … continue reading
After more than three years of development, the Apache PDFBox team has announced the release of Apache PDFBox 2.0.0. The Apache PDFBox library is an open-source Java tool for working with PDF documents. The project allows creation and manipulation of PDF documents, and the ability to extract content from them. Apache PDFBox also includes several … continue reading
Hortonworks, one of the three major Hadoop vendors, announced yesterday that it has been collaborating with HP to improve Apache Spark. The work has already yielded faster sort and in-memory computation for the project, as well as improved performance and usage for scalability. Hortonworks also announced the inclusion of Apache Kafka and Storm in its … continue reading
The database revolution happened a few years back, when NoSQL options stormed the world. Today, however, a second wave of innovation in data storage has been unleashed in the form of Apache Arrow. Arrow builds a standard for columnar in-memory analytics, and will provide a unified data structure, algorithms and cross-language bindings. The overall goal … continue reading
After the December announcement made by Mozilla’s senior vice president of connected devices, Ari Jaaksi, Mozilla shared its decisions about Firefox OS, with changes to Marketplace, foxfooding, and the product innovation process. Mozilla will be focusing on exploring new product innovations in Internet of Things, according to an e-mail that was shared to the Firefox … continue reading
Databricks has introduced Spark Datasets, an extension of the DataFrames API that provides a type-safe, object-oriented programming interface. Spark 1.6 includes an API preview of Datasets, and they will be a development focus for the next several versions of Spark. Databricks was founded out of UC Berkeley’s AMPLab by the creators of Apache Spark, and … continue reading
Apache Spark 1.6, which shipped yesterday, offers performance enhancements that range from faster processing of the Parquet data format to better overall performance for streaming state management. As a large-scale data processing platform, Apache Spark has untethered itself from the Hadoop platform. As a result, Spark can be used against key-value stores and other types … continue reading