Intel has released beta open source software, called GraphBuilder, to help data scientists in industry and academia to rapidly develop new applications that draw insights from Big Data. Developed by Intel Labs, GraphBuilder is the first scalable open source library to take large data sets and construct them into “Graphs,” web-like structures that outline relationships among data.
GraphBuilder not only constructs large-scale graphs fast but also offloads many of the complexities of graph construction, including graph formation, cleaning, compression, partitioning and serialization. This makes it easy for just about anyone to build graphs for interesting research and commercial applications. In fact, GraphBuilder makes it possible for a Java programmer to build an Internet-scale graph for PageRank in about 100 lines of code and a Wikipedia-sized graph for LDA in about 130.
GraphBuilder is available for download at https://01.org/graphbuilder under Apache 2 license.
For additional details, visit the GraphBuilder blog at: http://blogs.intel.com/intellabs/2012/12/06/graphbuilder/
GraphBuilder whitepaper: “GraphBuilder – A Scalable Graph Construction Library for ApacheHadoop” https://01.org/graphbuilder/sites/default/files/documentation/graphbuilder-whitepaper.pdf