Transcript of "Graph Databases and the Future of Large-Scale Knowledge Management"

2.
Abstract
Modern day open source and commercial graph databases can store on the
order of 1 billion relationships with some databases reaching the 10 billion
mark. These developments are making the graph database practical for
applications that require large-scale knowledge structures. Moreover, with
the Web of Data standards set forth by the Linked Data community, it is
possible to interlink graph databases across the web into a giant global
knowledge structure. This talk will discuss graph databases, their
underlying data model, their querying mechanisms, and the beneﬁts of the
graph data structure for modeling and analysis.
Risk Symposium – Santa Fe, New Mexico – April 8, 2009

8.
Extending our Make Believe World
• Marko is a human and Fluﬀy is a dog.
• Marko and Fluﬀy are good friends.
• Human and dog are a subclass of mammal.
Risk Symposium – Santa Fe, New Mexico – April 8, 2009

11.
Extending our Extended Make Believe World
• Marko is a human and Fluﬀy is a dog.
• Marko and Fluﬀy are good friends.
• Human and dog are a subclass of mammal.
• Fluﬀy peed on the carpet.
Risk Symposium – Santa Fe, New Mexico – April 8, 2009

20.
The Uniform Resource Locator
• The set of all URLs is the address space of all resources that can be
located and retrieved on the Web. URLs denote where a resource is.
http://markorodriguez.com/index.html
∗ Domain name server (DNS): markorodriguez.com → 216.251.43.6
∗ http:// means GET at port 80,
∗ /index.html means the resource to get at that Internet location.
Web Server
index.html
markorodriguez.com
216.251.43.6
Risk Symposium – Santa Fe, New Mexico – April 8, 2009

21.
The Uniform Resource Name
• The set of all URNs is the address space of all resources within the urn:
namespace.
urn:uuid:bd93def0-8026-11dd-842be54955baa12
urn:issn:0892-3310
urn:doi:10.1016/j.knosys.2008.03.030
• Named resources need not be retrievable through the Web.
• URNs denote what a resource is.
Risk Symposium – Santa Fe, New Mexico – April 8, 2009

23.
The “Uniform Resource Graph”
• We can denote where something is, what something is, but how do we
denote how something relates to something else?
• How can we denote what something means, where meaning is determined
by its place within a larger relational structure?
URIs are like words. They denote things in the real or imaginary world.
Linking URIs is like deﬁning words. Similar to how a dictionary deﬁnes
words in terms of other words.
Risk Symposium – Santa Fe, New Mexico – April 8, 2009

25.
The Web of Data
• The Web of Data is primarily concerned with URIs. If the World Wide
Web is the web of ﬁles, the Web of Data is the web of data. In other
words, for the World Wide Web, the level of granularity is the retrievable
ﬁle. For the Web of Data, it is the information in that ﬁle. Moreover,
this information is not necessarily contained in a ﬁle. There existence is
predicated on their URI. Their meaning is predicated on their relationship
to other URIs. The web of URIs is the Web of Data.
Risk Symposium – Santa Fe, New Mexico – April 8, 2009

33.
Cultural Diﬀerences that are Leading to a New World of
Large-Scale Knowledge Management
• Relational databases tend to not maintain public access points.
• Relational database users tend to not publish their schemas.
• Web of Data graph databases maintain public access points called
SPARQL end-points.
• Web of Data graph databases tend to reuse and extend public schemas
called ontologies.
Risk Symposium – Santa Fe, New Mexico – April 8, 2009