next up previous
Next: About this document


tabular37

Dear committee members:

Many of you are aware that our department's ``compute engine'', Erwin, is starting to show its age. When Erwin was bought, about four years ago, it was quite a fast machine. Computer technology has made a lot of progress since then, and Erwin is now somewhat slower than the average desktop PC. For comparison, a 200 MHz Pentium Pro PC is 1.3 to 2 times as fast as Erwin. In addition, Erwin has begun to have hardware failures. We have seen several mysterious crashes during the last year.

With this in mind, the main item in our HEET request this year is a successor for Erwin. For this purpose we would like to purchase a cluster of PCs (``nodes'') which will form a parallel supercomputer providing our department with over 1 GigaFlop of computing power. The technology behind this cluster was developed at NASA's Center of Excellence in Space Data and Information Sciences (CESDIS) and is currently in use at a number of educational and research institutions. The cluster we propose would increase our department's computing power by a factor of 20 to 30.

This system, called ``Beowulf,'' has several advantages over conventional supercomputers. First, it's cheap: a beowulf cluster at CESDIS has equaled the performance of an IBM SP2 with a comparable number of nodes, at less than one tenth the price. The use of commodity components allows one to shop around for the best price among a number of vendors.

Secondly, these systems are easily upgraded. New, faster, nodes can simply be dropped into place when they become available, and old nodes can be re-used as desktop PCs.

Besides NASA, other institutions with Beowulf clusters include Los Alamos National Lab, Caltech, Drexel University and Clemson University. Similar clusters, not based on Beowulf, are in use at the Max-Planck-Institut, DESY, Purdue, Cornell, Fermilab and the University of Mannheim. The widespread and apparently growing use of PC clusters provides a large base of experience from which we can draw. (See the references at the end of this memo.)

All of the users in our department would immediately be able to take advantage of such a cluster. Even tasks which are not easily parallelizeable would benefit, since each individual node is at least 30% faster than Erwin. Theorists with special interest in this cluster include a number of people who already use PVM (Parallel Virtual Machine), one of the components of Beowulf. Hank Thacker is very interested in these clusters because one of his collaborators at Fermilab has ported the CANOPY package to Beowulf. Hank already has a lot of code written for CANOPY, and would like to be able to run it here in our department. Experimentalists, such as Ralph Minehart, would benefit from the ability to analyze several large data sets in parallel.

We also ask that you consider these further improvements to our computing infrastructure during this HEET cycle:

In the following pages, I list the items requested, along with individual descriptions and cost justifications. The total cost of all items is $75,829.00 .

REFERENCES:




next up previous
Next: About this document

Bryan K. Wright
Wed Jul 9 09:15:44 EDT 1997