Apache DolphinScheduler, the distributed Big Data visual workflow scheduler system, is the latest project to gain Top-Level Project status at the Apache Software Foundation (ASF). This means that the project’s community and products have been well-governed under the Apache Software Foundation’s meritocratic process and principles. The scheduler was first created in December 2017 at Analysys … continue reading
The Apache Software Foundation (ASF) announced that Apache Gobblin, the open-source distributed Big Data integration framework, has reached top-level project status. According to the foundation, achieving top-level status means that the project graduated from the Apache Incubator and has demonstrated that it’s community and products have been well-governed under the ASF’s meritocratic process and principles. … continue reading
Swap Detector is an open-source checker that detects API usage errors. The project was released this week by GrammaTech, and originally created by The Department of Homeland Security, Science and Technology Directorate, and Static Tool Analysis Modernization Project. “Traditional static-analysis techniques do not take advantage of the vast wealth of information on what represents error-free … continue reading
JetBrains announced that Big Data Tools is now available as EAP for DataGrip and PyCharm Professional. The news aims to address problems that involve both code and data. The company first announced plans to support more big data tools last year when it announced a preview of the IntelliJ IDEA Ultimate plugin with Apache Zeppelin … continue reading
The Apache Software Foundation announced that Apache ShardingSphere, a distributed Big Data middleware ecosystem, has now graduated to a Top-Level Project. “Since entering the Apache Incubator, ShardingSphere has evolved from a JDBC driver for sharding into a distributed ecosystem,” said Liang Zhang, the vice president of Apache ShardingSphere. The Apache ShardingSphere ecosystem has 3 sub-projects … continue reading
The Apache Software Foundation (ASF) graduated Rya, a scalable open-source Big Data database, to its list of Top-Level Projects (TLP). The project was submitted to the Apache Incubator in September 2015. The database is triple store (subject-predicate-object) database, and is capable of storing billions of linked information sets by using novel storage methods, indexing schemes … continue reading
HPCC Systems (High Performance Computing Cluster), a dba of LexisNexis Risk Solutions, is an open-source big-data computing platform. Flavio Villanustre, vice president technology and CISO at LexisNexis Risk Solutions, explained HPCC Systems’s evolution came as a necessity. “In 2000 we were getting into data analytics, using the platforms, databases, and data integration tools that were … continue reading
Roughly half of testing time is wasted on test data management activities, which prompted time-saving solutions like service virtualization to gain traction. One company that’s pointing its virtualization tools toward test data generation and comprehensive API test automation is Parasoft. The company this week announced the latest releases of its SOAtest and Virtualize tools, adding … continue reading
Databricks announced a massive Series E round of funding for $250 million, bringing the company’s total amount raised to $489.5 million. According to the company, this could not be possible without its Unified Analytics Platform. The platform is designed to help enterprises build data pipelines across siloed data storage systems and make data science on … continue reading
The Cloudera and Hortonworks merger that was first announced in October officially completed this month, paving the way for a new Cloudera. As part of the merger, the former rivals will live under the Cloudera name and offer an enterprise data cloud capable of supporting hybrid and multi-cloud deployments as well as provide machine learning … continue reading
Pepperdata has announced that it is now offering a free version of its Application Spotlight APM solution. The free version can be used in a single cluster of up to 20 nodes. According to Pepperdata, the solution enables developers to quickly identify and solve issues. “This free offering gives developers a powerful introduction to the … continue reading