Well, you wouldn't think it would be purely combinatorial if the result is a continuous distribution.
If you want to know where the normal distribution comes from, intuitively speaking, my answer is that it's like an exponentiated version of the old summation formula, 1 + 2 + 3 + ...+n = n(n+1)/2.
Notice that if n is large, asymptotically, this is just n^2/2, approximately, so you can start to see what I mean by an exponentiated version of this.
The idea is to find the peak of the binomial probability mass function and then do some kind of approximation around that point, using the relationship between binomial coefficients (which can be seen combinatorially). For an explanation of how this works, along with other nice stuff about the CLT, see here:
http://stats.stackexchange.com/ques...nation-is-there-for-the-central-limit-theorem
When I first encountered the CLT, I thought there should be some sort of graphical proof, but, as the post points out, the exact shape of the curve seems mysterious without at least a small amount of calculation. However, if you have a good intuition for convolution (when adding two independent rv's, the density function of the sum is given by convolution), such as the intuition I gained by taking a signal processing class, you can can actually picture something like the normal distribution forming through purely visual reasoning, although, again, the exact shape of the curve remains mysterious. A nice feature of this argument is that it's vaguely suggestive of the rigorous proof which runs along similar lines, using convolution, but in the frequency domain (using characteristic functions/Fourier transforms).
It also seems possible to arrive at the normal distribution through some physical reasoning, along with a little optimization problem (entropy maximization). This isn't at the forefront of my consciousness, so it would take me a while to dig it back up (well, to be honest, I never fully figured it out in the first place), so I'll point you to where I got this idea (Susskind lectures in statistical mechanics):
You might have to be sort of astute to catch what I'm talking about, though.
If you want the most general version, with the fewest assumptions (notably, the identically distributed hypothesis can be weakened), look up the Lindeberg CLT. It's not a simple proof, though.