Database Theory Column ' An Overview of Semistructured Dat a Dan Suci u AT&T Lab s suciu©aresearch .att .cotn 1 Introductio n Research on semistructured data started from the observation that much of today's electronic dat a does not conform to traditional relational, or object oriented data models . Several applications store their data in non-standard data formats ; legacy systems . structured documents . like HTML or SGML etc . Another instance is the integration of heterogeneous data sources : often thes e sources belong to external organizations . or partners . not under the application's control, and thei r structure is only partially known . and may change without notice . Data in these applications could be modeled as an object-oriented data . but its structure is irregular : some objects may have missing attributes . others may have multiple occurrences of the same attribute . the same attribut e may have different types in different objects, semantically related information may be represente d differently in various objects . Data with these characteristics has been called semistructured data . Recent research has aimed at extending database management techniques to semistructure d data . The result of
/lp/association-for-computing-machinery/an-overview-of-semistructured-data-SPGdN9DQeK