papers AI Learner
The Github is limit! Click to go to the new site.

Design of a Parallel and Distributed Web Search Engine

2004-07-21
Salvatore Orlando (1), Raffaele Perego (2), Fabrizio Silvestri (1 and 3) ((1) Dipartimento di Informatica, Università di Venezia - Mestre, Italy, (2) Istituto di Scienza e Tecnologia per l'Informazione (A. Faedo) - Pisa, Italy, (3) Dipartimento di Informatica, Università di Pisa, Italy)

Abstract

This paper describes the architecture of MOSE (My Own Search Engine), a scalable parallel and distributed engine for searching the web. MOSE was specifically designed to efficiently exploit affordable parallel architectures, such as clusters of workstations. Its modular and scalable architecture can easily be tuned to fulfill the bandwidth requirements of the application at hand. Both task-parallel and data-parallel approaches are exploited within MOSE in order to increase the throughput and efficiently use communication, storing and computational resources. We used a collection of html documents as a benchmark, and conducted preliminary experiments on a cluster of three SMP Linux PCs.

Abstract (translated by Google)
URL

https://arxiv.org/abs/cs/0407053

PDF

https://arxiv.org/pdf/cs/0407053


Similar Posts

Comments