LexisNexis is a provider of legal, government, business and high-tech information sources. The group is also responsible for HPCC Systems (High Performance Computer Cluster), a massive, open-source parallel-processing computing platform for the world of Big Data. It’s also made its way into this week’s featured GitHub project.
The Big Data tool was open-sourced in 2011 to provide developers a less complicated way to deploy. Since there are various branches of Hadoop, getting it ready to deliver business insights could take months for a team. The platform offers analysis of Big Data by using less code and less nodes. Additionally, it offers a single programming language, platform and architecture for efficient processing, according to its GitHub page.
The architecture of this GitHub project incorporates Thor (the data refinery cluster) and Roxie (the query cluster). The clusters are joined by common middleware components, an external communications layer, client interfaces, and auxiliary components to support monitoring. The HPCC environment can only include Thor clusters or both Thor and Roxie clusters, according to the GitHub page.
A few organizations have used HPCC to help them with their own data problems, including Clemson University, RNET Technologies, North Carolina State University, Florida Atlantic University, and Engauge.
Developers that have a Git repository setup can install and build HPCC. The build process requires additional packages like Node.js and Linux. Developers can check out the full list of prerequisites here.
Top 5 projects trending on GitHub this week
#2. Felony: An open-source PGP keychain built on the modern web with Electron, React and Redux.
#3. Weex: A framework for building mobile cross-platform UI.
#4. Free Programming Books: Learning resources, for free!
As a side note, this is the first time in a hundred weeks FreeCodeCamp is not in the Top 5. Truly the end of an era.