Building a Budget-Friendly Linux Cluster: Tips and Tricks from Hak5

  • Thread starter Thread starter Gunthi
  • Start date Start date
  • Tags Tags
    Home Linux
Click For Summary

Discussion Overview

The discussion revolves around building a budget-friendly Linux cluster using old computers, with a focus on hardware and software configurations. Participants share their experiences, seek advice on suitable Linux distributions, network hardware, and accessible tutorials, and explore potential applications for the cluster, such as distributed computing projects like SETI.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Homework-related

Main Points Raised

  • One participant inquires about building a cluster and asks for references, the best Linux distribution, and necessary network hardware.
  • Another participant shares their experience with a cluster setup, mentioning the use of a master node, NTP server, and various software tools like MPI and LINPACK, but expresses uncertainty about the configuration details.
  • There is a suggestion that a network switch is sufficient for connecting nodes and servers, with some advocating for a cheap 10/100 switch.
  • Some participants express a need for more accessible tutorials on software setup, noting that existing resources may assume advanced knowledge of Linux/Unix.
  • One participant points out that SETI at Home does not support parallel computing, suggesting that while it can run on the cluster, the computers would work on independent data sets.
  • A later reply mentions a personal experience of building a Beowulf cluster using Debian Linux and provides a link to documentation of their setup.
  • Another participant references a Hak5 episode that discusses building a small cluster on a budget, highlighting cost-saving measures like using cardboard boxes for cases.

Areas of Agreement / Disagreement

Participants generally agree on the feasibility of building a cluster with old computers and the importance of network hardware. However, there are multiple competing views on the best software setup and the appropriateness of using SETI for distributed computing, indicating that the discussion remains unresolved in these areas.

Contextual Notes

Some participants note limitations in their knowledge of specific software configurations and the need for foundational understanding of various Linux services before attempting to build the cluster.

Gunthi
Messages
65
Reaction score
1
I have some old computers at home and I want to put them to work by making a cluster, maibe run SETI or other distributed computing related stuff.

I've searched the Internets, found out about Beowulf Clusters, but I can't seem to find any useful tutorial on how to make one.

So I'm calling out for some help.

1.Has anyone here made a cluster?
2.If so, what bibliography/references did you use?
3.What's the best Linux distribution for it?
4.What network hardware do i need?

Thanks in advance!
 
Computer science news on Phys.org
I have helped building a cluster, but mostly the hardware part.

Software-wise there was a master node on one server from which all other servers booted redhat over the network. The master node hosted an NTP server which synchronized time with slave servers, and also hosted a bunch of other server stuff like SSH, FTP, DHCP I think.

The software was running MPI implementation in C or C++, and LINPACK to do linear algebra. But how they configured it I have no idea. But it's definitely on my to do list.

Also the network switch that was used to connect up all the servers was expensive with really good response time. Cheap network switches are slow when it comes to switching lots of short and chatty bursts.
 
what said:
I have helped building a cluster, but mostly the hardware part.

Software-wise there was a master node on one server from which all other servers booted redhat over the network. The master node hosted an NTP server which synchronized time with slave servers, and also hosted a bunch of other server stuff like SSH, FTP, DHCP I think.

The software was running MPI implementation in C or C++, and LINPACK to do linear algebra. But how they configured it I have no idea. But it's definitely on my to do list.

Also the network switch that was used to connect up all the servers was expensive with really good response time. Cheap network switches are slow when it comes to switching lots of short and chatty bursts.

I don't intend to invest a lot of money in this, the pc's are all bellow pentium III so I'm not expecting to build a Deep Blue.

So, regarding the hardware, to connect all the nodes and server, a network switch will do right?

Also, do you know any good books about this?

Thanks ;)
 
Here is a mini guide on making a cluster
http://www.mini-itx.com/projects/cluster/

So, regarding the hardware, to connect all the nodes and server, a network switch will do right?

yes, any cheap 10/100 switch will do
 
what said:
Here is a mini guide on making a cluster
http://www.mini-itx.com/projects/cluster/



yes, any cheap 10/100 switch will do

Thanks for the link ;)

The hardware part is ok but it gets a little bit too advanced for me in the software part...
Do you know tutorials that are a little more accessible?
 
Gunthi said:
Thanks for the link ;)

The hardware part is ok but it gets a little bit too advanced for me in the software part...
Do you know tutorials that are a little more accessible?

I haven't found any more in-depth guide on the net, but there are dozens of books on amazon. But I suppose they assume a more in-depth linux/unix knowledge.

If you are not familiar how to set up a DHCP, SSH, RSH, NFS, NTP, FTP, or Apache on linux it would be a good idea to learn those individually before attempting to actually building the cluster. I would recommend setting up ubuntu server edition on the computers since ubuntu has very good step by step guides.

Secondly, once you have the computers connected and talking to each other, google around on how to setup up MPI on them.

Thirdly, what would you like to run on it? SETI at home doesn't support parallel computing. You could set up SETI on the computers but they would be working on completely independent data sets.
 
Last edited:
what said:
I haven't found any more in-depth guide on the net, but there are dozens of books on amazon. But I suppose they assume a more in-depth linux/unix knowledge.

If you are not familiar how to set up a DHCP, SSH, RSH, NFS, NTP, FTP, or Apache on linux it would be a good idea to learn those individually before attempting to actually building the cluster. I would recommend setting up ubuntu server edition on the computers since ubuntu has very good step by step guides.

Secondly, once you have the computers connected and talking to each other, google around on how to setup up MPI on them.

Ok, I'll look into those.

Thirdly, what would you like to run on it? SETI at home doesn't support parallel computing. You could set up SETI on the computers but they would be working on completely independent data sets.

I didn't know that. Thanks for the help and patience! ;)
 
I don't know how much it would help to reply to a year-old post, but I recently built a homemade Beowulf cluster using Debian Linux, DRBL, and the Sun Grid Engine (SGE). I've documented the hardware and software configurations http://sites.google.com/site/woojay/helmer" .
 
Last edited by a moderator:
Recent example of building a small cluster without spending money on frills

http://revision3.com/hak5

episode 823.

To keep cost down, for example, he didn't buy cases but just used the cardboard boxes that the motherboards came in as the cases.

Sometimes the Hak5 episodes seem like they are just giving hints and perhaps keywords that you will have to go do some digging on your own, or perhaps send them a polite question or two, to figure out how to use that in your own situation.

I'd suggest buying a few Fry's loss leader motherboard/cpu combos that will provide perhaps 100x the performance of the PIII, get a few recycled old cases, power supplies and old low performance pci video cards, leave the sides off, aim cheap house fans at the boards to keep them cool.
 
Last edited by a moderator:

Similar threads

  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 12 ·
Replies
12
Views
5K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 43 ·
2
Replies
43
Views
6K
  • · Replies 18 ·
Replies
18
Views
4K
  • · Replies 27 ·
Replies
27
Views
4K
Replies
10
Views
4K
Replies
10
Views
5K
  • · Replies 12 ·
Replies
12
Views
6K