I'm not sure if I have a satisfactory answer for you. I have always thought of this as simply a form of generalizations. The world we live in is a metric world and many things are covered by school math. We measure distances, open sets are simply spaces like intervals without the boundary, all is somehow dense and uncountable and so on. Not that there aren't enough weird topological facts in it despite of that, e.g. Cantor sets. However, if we start to state a topological theorem, it is for mathematicians somehow self evident to ask whether all this metric induced properties are really needed. Mathematicians love counterexamples and exceptions. So it is only natural to make the least instead of the most convenient assumptions when proving a theorem - and it's expanding its usage!
Since any union of open sets is open again but only a finite intersection is and vice versa for closed sets, and compactness is defined by the coverage with open sets, it's natural to speak of different countables, i.e. how many open sets it takes for something.
The same for separation properties. With a metric it's easy to see whether something is disjoint or not. Just measure it. But without a metric? What do we actually have to separate? Points, sets, which sets, points from sets and so on. Therefore the distinction between the various separation axioms is needed. It condenses theorems to a list of conditions which are really needed to proof a statement and therefore defines its applicability. When you deal with special topologies and all of a sudden you find out that you have only a ##T_1## space, you might get nervous and need to have a closer look on what you might take for granted and what not. Hausdorff is sometimes already a luxury.