Access the full text.
Sign up today, get DeepDyve free for 14 days.
D. Tsirogiannis, S. Guha, N. Koudas (2009)
Improving the performance of list intersectionPVLDB, 2
J.L. Bentley, A.C.-C. Yao (1976)
An almost optimal algorithm for unbounded searchingInf. Process. Lett., 5
B Ding, AC König (2011)
Fast set intersection in memoryPVLDB, 4
Z Liu, Y Chen (2008)
Reasoning and identifying relevant matches for xml keyword searchPVLDB, 1
Z Liu, Y Chen (2011)
Processing keyword search on xml: a surveyWorld Wide Web, 14
CD Manning, P Raghavan, H Schutze (2008)
Introduction to Information Retrieval
Keyword search over XML data has attracted a lot of research efforts in the last decade, where one of the fundamental research problems is how to efficiently answer a given keyword query w.r.t. a certain query semantics. We found that the key factor resulting in the inefficiency for existing methods is that they all heavily suffer from the common-ancestor-repetition problem. In this paper, we propose a novel form of inverted list, namely the IDList; the IDList for keyword $$k$$ consists of ordered nodes that directly or indirectly contain $$k$$ . We then show that finding keyword query results based on the smallest lowest common ancestor and exclusive lowest common ancestor semantics can be reduced to ordered set intersection problem, which has been heavily optimized due to its application in areas such as information retrieval and database systems. We propose several algorithms that exploit set intersection in different directions and with or without using additional indexes. We further propose several algorithms that are based on hash search to simplify the operation of finding common nodes from all involved IDLists. We have conducted an extensive set of experiments using many state-of-the-art algorithms and several large-scale datasets. The results demonstrate that our proposed methods outperform existing methods by up to two orders of magnitude in many cases.
The VLDB Journal – Springer Journals
Published: May 1, 2013
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.