NoSQL: The Rise of Graph Databases

By Dick Weisinger

Graph databases manage maps of interconnections between objects. With the rise of Social Media, Graph databases are increasingly being used because they are a natural fit for modeling things like interconnected web links, recommendations, tags, and friend and contact relationships. Traditional databases just aren’t able to store this kind of information easily in tables and rows of data.

Core standard relational databases are the concepts of entities and relationships. In graph databases, the corresponding concepts are nodes and edges. A node holds the property and data for an object. Diagrams which show the relationships maintained in graph databases resemble the diagrams used in object-oriented programming, and because of the similarity, graph databases easily can model the relationships between the objects used when programming an application.

Graph database vendors include Neo Technology (Neo4j), Objectivity (InfiniteGraph), and Sparsity (dex*). But there are many other less well known options too. Neo has recently raised another $11 million in funding.

Speed is another big reason graph databases are gaining some traction. Emil Eifrem, CEO of Graph Database company Neo, reported on Mashable some tests that he ran which compared the speed of relational to graph databases. He created a “friends of friends” query and found that when the query of relationships went three levels deep that the graph database beat the relational one by a factor of 150, and when the query depth was increased to four the graph database bested the relational one by a factor of 1000.

Philip Howard, analyst at Bloor Research, told InfoWorld that “Graph databases are critical when the degree of separation [ie, I know x who knows y who is related to z who used to live in the same house as w etc.] between entities becomes too great to handle using conventional technology. Oracle or DB2, for example, can reasonably handle up to three degrees of separation but not the six or seven degrees you need in, say, telco fraud.”

Howard commented on the limitations of graph databases, saying that “The major limitation is that while these are technically NoSQL databases, in practice they cannot be implemented across a low-cost cluster (at least not a present) but have to run on a single machine, the reason being that performance degrades rapidly across a network. Another potential drawback is that either you have to write your own queries using Java or whatever — which means employing expensive programmers — or you use SparcQL or one of the other query languages that have been developed to support graph databases, but this means learning a new skill.”

December 11th, 2012

Category: NoSQL

Leave a Reply Cancel reply

Legal Terms & Disclaimers

This blog site is accessed from the website of Formtek, Inc. All visitors to or users of this blog site are subject to the terms and conditions and privacy policy that govern the Formtek website, links for which are provided above.

Some of the individuals posting to this blog site, including the moderators, work for Formtek. Postings by these individuals are the personal opinions of these individuals, not of Formtek. Their posted content is provided for informational purposes only and is not meant to be an endorsement or representation by Formtek or any other party. Postings to this blog site may be outdated, invalid or inaccurate by the time you read them. Individuals posting to this blog site make no statements, representations or warranties as to the timing, validity, accuracy or reliability of their postings.

This blog site may contain links to third party sites. Access to any third party site linked to this blog site is at your own risk. None of Formtek, the blog site moderator(s) and the individuals posting on this blog site that work for Formtek is responsible for the timing, validity, accuracy or reliability of any information, data, opinions, advice or statements made on these third party sites. These links are provided merely as a convenience and do not imply any endorsement.

Postings to this blog site are available to the public. You should not post, link to or otherwise upload any information considered confidential to this blog site. All postings to this blog site are moderated. Postings will appear if and when they are approved by the moderator. Notwithstanding any approval by the moderator, by posting information to this blog site, you agree to be solely responsible for the information you post, link to, or otherwise upload to the blog site. You agree to release Formtek from any liability related to that information or to your use of the blog site. You grant Formtek a worldwide, perpetual, irrevocable, royalty-free, fully-paid, and transferable (including rights to sublicense) right to exercise all copyright, publicity, and moral rights with respect to any information you post, link to or otherwise upload to this blog site.

NoSQL: The Rise of Graph Databases

Leave a Reply Cancel reply

Company

Products and Services

News

Resources

Legal Terms & Disclaimers