Options
1994
Conference Paper
Titel
Lost in Hyperspace? Free Text Searches in the Web
Abstract
The World Wide Web (WWW) is a distributed hypermedia system for information discovery, retrieval, and collaboration. The hypertext paradigm has proven its usefulness for browsing large, distributed document structures. The ease of use provided by this paradigm is one of the reasons for the great popularity which the World Wide Web has gained through the last months. However, as the amount of information available through the World Wide Web grows, it becomes more and more important to provide additional tools and techniques for finding servers or documents which contain relevant information on a given topic. Bibliographic and literature text searching is a difficult task, the provision of a free text search mechanism can greatly facilitate this task. By supplying a pre-computed index of keywords, a fully indexed server eliminates the need for automatic indexers (such as web robots spiders) to walk the entire server tree, which is an unnecessary waste of resources. Some World Wide Web s ervers already implement keyword searches via an interface to WAISINDEX. However, this approach lacks many important features that free text search engines provide, and does not support remapping of physical directory structures to virtual paths. In this document, we will present the ICE indexing server extension which has recently been developed at the Fraunhofer Institute for Computer Graphics. This freely available software package provides a set of routines which allow for sophisticated free text searches on a World Wide Web archive.
Language
English
Tags