Design of a Parallel and Distributed Web Search Engine

2004-07-21

Salvatore Orlando (1), Raffaele Perego (2), Fabrizio Silvestri (1 and 3) ((1) Dipartimento di Informatica, Università di Venezia - Mestre, Italy, (2) Istituto di Scienza e Tecnologia per l'Informazione (A. Faedo) - Pisa, Italy, (3) Dipartimento di Informatica, Università di Pisa, Italy)

arXiv_CV

Abstract
Abstract (translated by Google)
URL
PDF

Abstract

This paper describes the architecture of MOSE (My Own Search Engine), a scalable parallel and distributed engine for searching the web. MOSE was specifically designed to efficiently exploit affordable parallel architectures, such as clusters of workstations. Its modular and scalable architecture can easily be tuned to fulfill the bandwidth requirements of the application at hand. Both task-parallel and data-parallel approaches are exploited within MOSE in order to increase the throughput and efficiently use communication, storing and computational resources. We used a collection of html documents as a benchmark, and conducted preliminary experiments on a cluster of three SMP Linux PCs.

Design of a Parallel and Distributed Web Search Engine

Abstract

Abstract (translated by Google)

URL

PDF

Similar Posts

Comments