Topic: hadoop

The building blocks of SQL

SQL is the prime building block of the modern enterprise. All those exciting applications, nifty mobile apps and massive back-end projects are, essentially, useless without the data behind them. That data may not be so important at runtime if the application is just saving logs or form information, but at the end of the day, … continue reading

Security takes center stage at Strata conference

The Strata and Hadoop World Conference in San Jose kicked off this morning with tutorial sessions covering all manner of Apache projects and Big Data challenges. While the attendees learned about techniques, the vendors were busy announcing and detailing their latest and greatest Big Data software products. Security was a popular topic as Hadoop implementations … continue reading

RSA conference has machine-learning, cloud-based solutions on display

It’s no secret that computer security is a difficult area of expertise. At the annual RSA Conference in San Francisco this week, attendees were treated to a host of solutions to solve their security woes. As usual, however, software development is not the focus. Robert Griffin, chief security architect at RSA, agreed that developing secure … continue reading

Hortonworks collaborates with HP to improve Spark

Hortonworks, one of the three major Hadoop vendors, announced yesterday that it has been collaborating with HP to improve Apache Spark. The work has already yielded faster sort and in-memory computation for the project, as well as improved performance and usage for scalability. Hortonworks also announced the inclusion of Apache Kafka and Storm in its … continue reading

Guest View: The cool kids all use polyglot persistence, it’s time your enterprise does, too

If you haven’t heard of polyglot persistence before, simply put, it means using different database technologies to handle specific needs. The term was derived from something else you may have heard: polyglot programming, which expresses the idea that applications should be written in a mix of languages to take advantage of the fact that different … continue reading

Yahoo brings deep learning framework to Spark

Yahoo has released CaffeOnSpark, which brings the fruits of two University of California, Berkeley projects together: vision-focused deep learning framework Caffe, and Big Data processing engine Apache Spark. Without the aid of Spark, Caffe can process up to 60 million images per day. Those numbers come from benchmarks on a single NVIDIA GPU, so the … continue reading

Spark Summit East: News from Databricks, IBM, MapR, MemSQL, SAP and Wikibon

Wikibon, a community formed to help solve technology problems, has released what it says is the first-ever Spark forecast that shows how Spark is changing the industry. “Our report is the first to show how Spark is remaking the marketplace, presenting not just numbers of users, but also data about what vendors are building and … continue reading

Alphabet passes Apple, researchers create auto-bug-repair system, and Syncfusion’s Big Data Platform released—SD Times news digest: Feb. 2, 2016

Shares of Alphabet, Google’s holding company, opened nearly three percent higher today, pushing it past Apple. This makes Alphabet the most valuable public company, according to a report by CNBC. Alphabet has a market cap of US$547.1 billion, which is higher than Apple’s $529.3 billion. The last time Google was more valuable than Apple was … continue reading

Is Spark replacing Hadoop?

The Apache Hadoop project took off in enterprises over a fairly short period of time. Four or five years ago, Hadoop was just becoming a “thing” for enterprise data processing and experimentation. MapReduce was at the heart of that thing, and Spark was still only a research project at the University of California at Berkeley. … continue reading

Forecast 2016: Predictions from around the industry

To paraphrase that great thinker, Ferris Bueller: “Technology moves pretty fast. It you don’t look around once in a while, you could miss it.” So, to get 2016  rolling, we’ve asked luminaries and thought leaders in the software development space to look around and tell us what they expect from the field this year. Kelly … continue reading

Spark 1.6

Spark 1.6 is released

Apache Spark 1.6, which shipped yesterday, offers performance enhancements that range from faster processing of the Parquet data format to better overall performance for streaming state management. As a large-scale data processing platform, Apache Spark has untethered itself from the Hadoop platform. As a result, Spark can be used against key-value stores and other types … continue reading

MapR introduces publish/subscribe streaming for Hadoop

While many in the Big Data space are talking about stream processing, MapR today announced the availability of Streams, a new product in its Hadoop stack that can be used to stream events across clusters distributed around the world. The new product offers a publish-and-subscribe model for event-driven data access and decision-making. While MapR Streams … continue reading

DMCA.com Protection Status