“Whoa! It's like Spotify but for academic articles.”

Instant Access to Thousands of Journals for just $40/month

Get 2 Weeks Free

Rya: a scalable RDF triple store for the clouds

Rya: A Scalable RDF Triple Store for the Clouds Roshan Punnoose Proteus Technologies Adina Crainiceanu US Naval Academy David Rapp Laboratory for Telecommunication Sciences roshanp@gmail.com adina@usna.edu rapp@ltsnet.net ABSTRACT Resource Description Framework (RDF) was designed with the initial goal of developing metadata for the Internet. While the Internet is a conglomeration of many interconnected networks and computers, most of today's best RDF storage solutions are confined to a single node. Working on a single node has significant scalability issues, especially considering the magnitude of modern day data. In this paper we introduce a scalable RDF data management system that uses Accumulo, a Google Bigtable variant. We introduce storage methods, indexing schemes, and query processing techniques that scale to billions of triples across multiple nodes, while providing fast and easy access to the data through conventional query mechanisms such as SPARQL. Our performance evaluation shows that in most cases, our system outperforms existing distributed RDF solutions, even systems much more complex than ours. Categories and Subject Descriptors: H.3.2 Information Storage, H.3.3 Information Search and Retrieval, H.3.4 Systems and Software - Distributed Systems H.2.4 Systems - Distributed Databases, Query Processing General Terms: Algorithms, Management, Performance. Keywords: RDF triple store, distributed, scalable. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png
Loading next page...

You're reading a free preview. Subscribe to read the entire article.

And millions more from thousands of peer-reviewed journals, for just $40/month

Get 2 Weeks Free

To be the best researcher, you need access to the best research

  • With DeepDyve, you can stop worrying about how much articles cost, or if it's too much hassle to order — it's all at your fingertips. Your research is important and deserves the top content.
  • Read from thousands of the leading scholarly journals from Springer, Elsevier, Nature, IEEE, Wiley-Blackwell and more.
  • All the latest content is available, no embargo periods.