Hadoop has, for the most part, moved beyond the proof-of-concept phase and the initial chasm of adoption. More and more organizations are putting the open-source framework to work on mountains of complex Big Data. The next step in Hadoop’s evolution is getting a handle on governance. To that end, Hortonworks—the enterprise data platform provider and … continue reading
Two more projects are graduating from the Apache Software Foundation’s (ASF) incubator this week. The organization has announced that both Apache BookKeeper and Apache Samza have become Top-Level Projects (TLPs). Apache Samza is an open-source Big Data distributed streaming processing framework designed to handle stateful processing, message durability, fault tolerance and scalability. Samza was originally … continue reading
iRobot is starting its own venture capital firm in order to invest in robotic companies, TechCrunch has reported. The company is in the midst of looking for a West Coast investor to head up its firm and is looking to make five to 10 investments a year, according to TechCrunch. “There’s a lot going on … continue reading
The Apache Software Foundation has announced that Apache Falcon has graduated from the Apache Incubator to a Top-Level Project. Falcon is a data processing and management solution for Apache Hadoop with a focus on data motion, data discovery, coordination of data pipelines, and life-cycle management. “Apache Falcon solves a very important and critical problem in … continue reading
The Apache Software Foundation has announced Apache Drill as a Top-Level Project. According to the Foundation, Apache Drill is a schema-free SQL query engine for Hadoop and NoSQL. By removing the constraint of building and maintaining schemas before data can be analyzed, Drill users can run interactive ANSI SQL queries on complex or constantly evolving … continue reading
The National Security Agency has released the first in a series of software products by its Technology Transfer Program (TTP) to the open-source community. Niagarafiles, also known as NiFi, automates high-volume data flows among computer networks, even if data formats and protocols differ. The technology “provides a way to prioritize data flows more effectively and … continue reading
After four years of development, the Apache Software Foundation (ASF) has announced version 3.1 of its open-source Java framework for object-relational mapping (ORM), persistence and caching: Apache Cayenne. “With the launch of version 3.1, Apache Cayenne has continued to evolve its mature 12 year-old library by introducing 125 new features,” said Andrus Adamchik, vice president … continue reading