A n O v e r v i e w of M u l t i T e x t Charles L. A. Clarke 2 Gordon V. Cormack1 Christopher R. Palmer ] 1 Department of Computer Science, University of Waterloo, Canada 2 Department of Electrical and Computer Engineering, University of Toronto, Canada mt ©plg. uwat erloo, ca h t t p ://mult itext, uwaterloo, ca The research focus of the MultiText project is the development and prototyping of scalable technologies for distributed information retrieval systems. The MultiText system is based on the network-of-workstations architecture shown in figure 1. The system is composed of several major components: The index engines maintain the index file structures and provide search capabilities. The text servers are specialized by document type and provide retrieval capabilities for arbitrary text passages specified at the word level. Finally, the marshaller/dispatcher interacts with clients and coordinates query and update activities. Research issues are addressed in the context of this distributed architecture. Issues of concern to the MultiText Project include data distribution, load balancing, fast update, compression, fault tolerance, document structure, relevance ranking, and user interaction. Support for document structure is a particular feature of the MultiText system.
/lp/association-for-computing-machinery/an-overview-of-multitext-yTCY8T19s0