Access the full text.
Sign up today, get DeepDyve free for 14 days.
K. Claypool, Elke Rundensteiner (1999)
Flexible Database Transformations: Teh SERF ApproachIEEE Data Eng. Bull., 22
L. Haas, Renée Miller, B. Niswonger, M. Roth, P. Schwarz, E. Wimmers (1999)
Transforming Heterogeneous Data with Database Middleware: Beyond IntegrationIEEE Data Eng. Bull., 22
R. Sinclair (2000)
Data Transformation Services
T is a forward key-map (fkmap) if it is complete ( ∀ I (cid:54) = ∅ , T ( I ) (cid:54) = ∅ ) and it has a forward schema mapping to the output key: f ( A ) T → B key
Allison Woodruff, M. Stonebraker (1997)
Supporting fine-grained data lineage in a database visualization environmentProceedings 13th International Conference on Data Engineering
(1996)
TPC96] Transaction Processing Performance Council. TPC-D Benchmark Specification, Version 1
Vijayshankar Raman, J. Hellerstein (2000)
Potters Wheel: An interactive framework for data cleaning
S. Chaudhuri, U. Dayal (1997)
An overview of data warehousing and OLAP technologySIGMOD Rec., 26
Yingwei Cui (2001)
Lineage tracing in data warehouses
Yingwei Cui, J. Widom, J. Wiener (2000)
Tracing the lineage of view data in a warehousing environmentACM Trans. Database Syst., 25
Yingwei Cui, J. Widom (2001)
Run-Time Translation of View Tuple Deletions Using Data Lineage
PowerPlay OLAP Data Analysis and Reporting Tool
A. Rosenthal, E. Sciore (1999)
First-class views: a key to user-centered computingSIGMOD Rec., 28
L. Lakshmanan, F. Sadri, Iyer Subramanian (1996)
SchemaSQL - A Language for Interoperability in Relational Multi-Database Systems
Inf] Informix Formation Data Transformation Tool. http://www.informix.com/informix/products/integration/formation/formation
Pow] Cognos: PowerPlay OLAP Analysis Tool
Wilburt Labio, J. Wiener, H. Garcia-Molina, V. Gorelik (2000)
Efficient resumption of interrupted warehouse loads
PPD Informatics: TableTrans Data Transformation Software
N. Hachem, K. Qiu, M. Gennert, M. Ward (1993)
Managing Derived Data in the Gaea Scientific DBMS
A Proofs A.1 Proof for Theorem 3
S. Abiteboul, S. Cluet, T. Milo, P. Mogilevsky, Jérôme Siméon, Sagit Zohar (1999)
Tools for Data Translation and IntegrationIEEE Data Eng. Bull., 22
Since o j .key = o k .key by the definition of a key, we obtain our contradiction
M. Stonebraker (1975)
Implementation of integrity constraints and views by query modification
Ora] Oracle 8i. http://technet.oracle.com/products/oracle8i
Yingwei Cui, J. Widom (2000)
Practical lineage tracing in data warehousesProceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073)
C. Squire (1995)
Data extraction and transformation for the data warehouse
P. Buneman, S. Davidson, Kyle Hart, G. Overton, Limsoon Wong (1995)
A Data Transformation System for Biological Data Sources
P. Bernstein, T. Bergstraesser (1999)
Meta-Data Support for Data Transformations Using Microsoft RepositoryIEEE Data Eng. Bull., 22
Thomas Lee, S. Bressan, S. Madnick (1998)
Source Attribution for Querying Against Semi-structured Documents
Vijayshankar Raman, J. Hellerstein (2000)
An Interactive Framework for Data Cleaning
A. Rosenthal, E. Sciore (1998)
Propagating Integrity Information among Interrelated Databases
Property Tracing # Transformation # Input Procedure Calls Accesses dispatcher
Sagent Technology
C. Faloutsos, H. Jagadish, N. Sidiropoulos (1997)
Recovering Information from Summary Data
(1995)
Special issue on materialized views and data warehousing
DB2OLAPServer
N. Shu (1987)
Automatic data transformation and restructuring1987 IEEE Third International Conference on Data Engineering
Data warehousing systems integrate information from operational data sources into a central repository to enable analysis and mining of the integrated information. During the integration process, source data typically undergoes a series of transformations, which may vary from simple algebraic operations or aggregations to complex “data cleansing” procedures. In a warehousing environment, the data lineage problem is that of tracing warehouse data items back to the original source items from which they were derived. We formally define the lineage tracing problem in the presence of general data warehouse transformations, and we present algorithms for lineage tracing in this environment. Our tracing procedures take advantage of known structure or properties of transformations when present, but also work in the absence of such information. Our results can be used as the basis for a lineage tracing tool in a general warehousing setting, and also can guide the design of data warehouses that enable efficient lineage tracing.
The VLDB Journal – Springer Journals
Published: May 1, 2003
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.