The VLDB Journal (2013) 22:1–2
Special issue on best papers of VLDB 2011
Wolfgang Lehner · Sunita Sarawagi
Published online: 4 December 2012
© Springer-Verlag Berlin Heidelberg 2012
The VLDB conference is one of the most renowned venues
for presenting advances in the research and practice of data
management. The VLDB 2011 conference took place from
August 29 to September 3 in Seattle (WA, USA) with more
than 750 participants.
Out of 553 submissions to the research track, 100 papers
got accepted for presentation at the VLDB 2011 conference.
Of these, the following ﬁve papers were invited to deliver
an extended version to be published in this special issue of
– Efﬁciently Adapting Graphical Models for Selectivity
by Kostas Tzoumas (TU Berlin), Amol Deshpande,
Christian S. Jensen.
– RemusDB: Transparent High Availability for Database
by Umar Farooq Minhas (University of Waterloo),
Shriram Rajagopalan, Brendan Cully, Ashraf Aboulnaga,
Kenneth Salem, Andrew Warﬁeld.
– OXPATH: A Language for Scalable Data Extraction,
Automation, and Crawling on the Deep Web.
by Tim Furche (Oxford University), Georg Gottlob,
Giovanni Grasso, Christian Schallhart, Andrew Sellers.
– Automating the Database Schema Evolution Process.
by Carlo A Curino (Microsoft), Hyun J Moon, Alin
Deutsch, Carlo Zaniolo.
W. Lehner (
Technische Universität Dresden, Nöthnitzer Str. 46,
01187 Dresden, Germany
Computer Science and Engineering, IIT Bombay, Kanwal Rekhi
Building, Powai, Mumbai 400076, India
– Keyword Search on Form Results.
by Aditya Ramesh, S. Sudarshan (IIT Bombay), Purva
Joshi, Manisha Naik Gaonkar
The selection covers a wide spectrum of database system-
related topics, ranging from query optimization to informa-
tion retrieval and model management. Also, the papers come
from research groups all over the world: Redmond, Waterloo,
Oxford, Berlin, and Bombay; a truly international mix! All
contributions are signiﬁcantly extended and improved with
respect to the original conference version.
We hope you enjoy these “Best of VLDB 2011” papers—
may the papers spark novel research ideas!!!
The ﬁrst paper “Efficiently Adapting Graphical
Models for Selectivity Estimation” of this special
issue focuses on a classical problem within database query
optimization: how to increase the accuracy of a cost model?
Since traditional query optimization techniques are based on
the independence assumption of individual columns, even
small correlations in the database may results in signiﬁcant
estimation errors and yield a non-optimal plan for a given
query. The paper takes a principled and practical approach to
the problem by relying on joint probability distributions over
pairs of columns within a database. The work also includes
an implementation within the PostgreSQL DBMS. Experi-
ments show a signiﬁcant improvement of query plan quality
(with respect to the query runtime) by a moderate increase
in optimization time.
The second paper on “RemusDB: Transparent High
Availability for Database Systems” presents a novel
approach to achieve high availability (HA) for database
systems using techniques from operating system. The pre-
sented solution is based on Remus, a virtualization layer with
core functionality to offer high availability using efﬁcient
state replication of virtual machine images. With RemusDB,