Microsoft is preparing to provide Hadoop, a Java software framework for data-intensive distributed applications, for Windows Azure customers.
Hadoop offers a massive data store upon which developers can run map/reduce jobs. It also manages clusters and distributed file systems. Microsoft will provide Hadoop within a “few months,” said a Microsoft executive who wished to remain anonymous.
The technology makes it possible for applications to analyze petabytes of both structured and unstructured data. Data is stored in clusters, and applications work on it programmatically.
“They are probably seeing Hadoop adoption trending up, and possibly have some large customers demanding it,” said Forrester principal analyst Jeffrey Hammond.
“Microsoft is all about money first; PHP support with IIS and the Web PI initiative were all about numbers and creating platform demand. If Hadoop support helps creates platform demand for Azure, why not support it? Easiest way to lead a parade is to find one and get in front of it.”
Microsoft’s map/reduce solution, codenamed “Dryad,” is still a reference architecture and not a production technology.