Building a Budget-Friendly Linux Cluster: Tips and Tricks from Hak5

  • Thread starter Thread starter Gunthi
  • Start date Start date
  • Tags Tags
    Home Linux
Click For Summary
SUMMARY

This discussion focuses on building a budget-friendly Beowulf cluster using old computers, specifically targeting distributed computing tasks. Participants recommend using Debian Linux, DRBL, and the Sun Grid Engine (SGE) for software configuration. Essential components include a reliable network switch, preferably one with good response time, and a master node setup for booting other nodes via Red Hat. Key software tools mentioned are MPI for parallel processing and LINPACK for linear algebra computations.

PREREQUISITES
  • Familiarity with Beowulf Clusters and distributed computing concepts
  • Basic knowledge of Linux server administration, particularly with Debian or Ubuntu
  • Understanding of network hardware requirements, specifically network switches
  • Experience with MPI (Message Passing Interface) for parallel processing
NEXT STEPS
  • Research how to set up a Beowulf cluster using Debian Linux and DRBL
  • Learn to configure MPI on a Linux cluster for parallel processing
  • Explore tutorials on setting up essential services like DHCP, SSH, and NTP on Linux
  • Investigate low-cost hardware options for building a cluster, including recycled components
USEFUL FOR

This discussion is beneficial for hobbyists, computer enthusiasts, and developers interested in building low-cost computing clusters for distributed tasks, particularly those with limited budgets and older hardware.

Gunthi
Messages
65
Reaction score
1
I have some old computers at home and I want to put them to work by making a cluster, maibe run SETI or other distributed computing related stuff.

I've searched the Internets, found out about Beowulf Clusters, but I can't seem to find any useful tutorial on how to make one.

So I'm calling out for some help.

1.Has anyone here made a cluster?
2.If so, what bibliography/references did you use?
3.What's the best Linux distribution for it?
4.What network hardware do i need?

Thanks in advance!
 
Computer science news on Phys.org
I have helped building a cluster, but mostly the hardware part.

Software-wise there was a master node on one server from which all other servers booted redhat over the network. The master node hosted an NTP server which synchronized time with slave servers, and also hosted a bunch of other server stuff like SSH, FTP, DHCP I think.

The software was running MPI implementation in C or C++, and LINPACK to do linear algebra. But how they configured it I have no idea. But it's definitely on my to do list.

Also the network switch that was used to connect up all the servers was expensive with really good response time. Cheap network switches are slow when it comes to switching lots of short and chatty bursts.
 
what said:
I have helped building a cluster, but mostly the hardware part.

Software-wise there was a master node on one server from which all other servers booted redhat over the network. The master node hosted an NTP server which synchronized time with slave servers, and also hosted a bunch of other server stuff like SSH, FTP, DHCP I think.

The software was running MPI implementation in C or C++, and LINPACK to do linear algebra. But how they configured it I have no idea. But it's definitely on my to do list.

Also the network switch that was used to connect up all the servers was expensive with really good response time. Cheap network switches are slow when it comes to switching lots of short and chatty bursts.

I don't intend to invest a lot of money in this, the pc's are all bellow pentium III so I'm not expecting to build a Deep Blue.

So, regarding the hardware, to connect all the nodes and server, a network switch will do right?

Also, do you know any good books about this?

Thanks ;)
 
Here is a mini guide on making a cluster
http://www.mini-itx.com/projects/cluster/

So, regarding the hardware, to connect all the nodes and server, a network switch will do right?

yes, any cheap 10/100 switch will do
 
what said:
Here is a mini guide on making a cluster
http://www.mini-itx.com/projects/cluster/



yes, any cheap 10/100 switch will do

Thanks for the link ;)

The hardware part is ok but it gets a little bit too advanced for me in the software part...
Do you know tutorials that are a little more accessible?
 
Gunthi said:
Thanks for the link ;)

The hardware part is ok but it gets a little bit too advanced for me in the software part...
Do you know tutorials that are a little more accessible?

I haven't found any more in-depth guide on the net, but there are dozens of books on amazon. But I suppose they assume a more in-depth linux/unix knowledge.

If you are not familiar how to set up a DHCP, SSH, RSH, NFS, NTP, FTP, or Apache on linux it would be a good idea to learn those individually before attempting to actually building the cluster. I would recommend setting up ubuntu server edition on the computers since ubuntu has very good step by step guides.

Secondly, once you have the computers connected and talking to each other, google around on how to setup up MPI on them.

Thirdly, what would you like to run on it? SETI at home doesn't support parallel computing. You could set up SETI on the computers but they would be working on completely independent data sets.
 
Last edited:
what said:
I haven't found any more in-depth guide on the net, but there are dozens of books on amazon. But I suppose they assume a more in-depth linux/unix knowledge.

If you are not familiar how to set up a DHCP, SSH, RSH, NFS, NTP, FTP, or Apache on linux it would be a good idea to learn those individually before attempting to actually building the cluster. I would recommend setting up ubuntu server edition on the computers since ubuntu has very good step by step guides.

Secondly, once you have the computers connected and talking to each other, google around on how to setup up MPI on them.

Ok, I'll look into those.

Thirdly, what would you like to run on it? SETI at home doesn't support parallel computing. You could set up SETI on the computers but they would be working on completely independent data sets.

I didn't know that. Thanks for the help and patience! ;)
 
I don't know how much it would help to reply to a year-old post, but I recently built a homemade Beowulf cluster using Debian Linux, DRBL, and the Sun Grid Engine (SGE). I've documented the hardware and software configurations http://sites.google.com/site/woojay/helmer" .
 
Last edited by a moderator:
Recent example of building a small cluster without spending money on frills

http://revision3.com/hak5

episode 823.

To keep cost down, for example, he didn't buy cases but just used the cardboard boxes that the motherboards came in as the cases.

Sometimes the Hak5 episodes seem like they are just giving hints and perhaps keywords that you will have to go do some digging on your own, or perhaps send them a polite question or two, to figure out how to use that in your own situation.

I'd suggest buying a few Fry's loss leader motherboard/cpu combos that will provide perhaps 100x the performance of the PIII, get a few recycled old cases, power supplies and old low performance pci video cards, leave the sides off, aim cheap house fans at the boards to keep them cool.
 
Last edited by a moderator:

Similar threads

  • · Replies 18 ·
Replies
18
Views
2K
  • · Replies 12 ·
Replies
12
Views
5K
  • · Replies 43 ·
2
Replies
43
Views
5K
Replies
10
Views
4K
Replies
10
Views
5K
  • · Replies 123 ·
5
Replies
123
Views
19K
Replies
6
Views
2K
Replies
46
Views
7K
  • · Replies 36 ·
2
Replies
36
Views
12K
  • · Replies 2 ·
Replies
2
Views
2K