If You Like Simplicity, Which Mathematicians Do, You Can't Avoid Pi
Mindscrape said:
This has probably already been brought up at some point, but does anyone else think it is strange how much pi, the ratio of circumference to diameter, occurs in so much that has nothing to do with circles?
This is really a FAQ, but it's a good question nonetheless.
"Why" questions can be problematic if you are not expecting the kind of explanation likely to be offered by mathematicians--- which brings us to another FAQ, "What is mathematics?". The standard answer is that the definining characteristic of mathematical discourse is the notion of proof, and I don't disagree, but let me toss another idea into the mix. I like to define mathematics as
the art of reliable reasoning about simple situations without getting confused. According to this definition, proof is merely a means to the end of reliable reasoning while avoiding confusion.
If you accept this, it follows that mathematically simple situations will turn up more often in the most successful mathematical theories, which are the ones most likely to appear in undergraduate math curricula. And be it noted: one component of what we call "mathematical genius" is the ability to recognize a simple phenomenon masquerading as an apparently complicated phenomenon.
Given this, the explanations I prefer are information theoretic: we define appropriate notions of complexity of differential equations and then show that the simplest differential equations are those which give rise to familiar trig, hyperbolic trig, exp, and (at the next stage) the best known special functions, such as Legendre, Bessel, elliptic, and hypergeometric functions. But of course we should not be too surprised if \pi turn ups when we study circular trig functions (for example when integrating over the circle). And we can expect circular trig functions to turn up more than hyperbolic trig functions because the circle is compact.
(A principle which is challenged by the importance of the Lorentz group in physics, but in the interests of simplicity, let's ignore that. And one could turn this reasoning around and adduce the fact that by our definitions the ODE \ddot{x} + x = 0 is particularly simple as evidence that our definitions of "complexity" are reasonable, which might cause some to doubt that we are not simply talking in circles--- no pun intended!)
We should recall here the principle that the simplest differential equations tend to model more than one situation. This is the well known "paucity of low dimensional models" phenomenon. Also, this style of reasoning is not limited to differential equations; similar remarks hold for algebraic plane curves.
I note too that probability concerns measures, and when we normalize Lebesque measure on the unit circle, we introduce a factor of 1/\pi, so we shouldn't be terribly surprised that this constant is involved in the computation of various probability problems in the real plane. Indeed, the most efficient way to derive the most often encountered probability distributions (Gaussian, Poisson, and so on) is via the Principle of Maximal Entropy. Many of these involve \pi in some way, if you like because these distributions are expressed using the "usual suspects" such as the exponential function.
Trig functions also arise in higher dimensions because n-dimensional unit spheres contain (n-1)-dimensional unit spheres. Very often in mathematics we are interested in how functions decay with distance from "the origin", and analyzing such behavior leads naturally to
harmonic decompositions in which integrations over spheres play a central role. (This is one way to explain the phenomenon mentioned by Halls, that spheres are everywhere in mathematics.) So here too we should expect \pi to play a role, and it does.
All in all, it is not very surprising that the same special functions, and therefore the same constants associated with special functions, arise all over the mathematical map. It all comes down to the unity of mathematics, and ultimately one way of explaining the happy meeting of algebra, analysis, and geometry is the information theoretic principle that leads us to expect a preference for simple models, of which there are not many, hence these models are familiar friends to undergraduate math students.
If you read German, you can look in the papers of Hurwitz for some famous examples of this kind of reasoning.
As for Euler's formula, this is less mysterious if you know about Cayley-Dickson algebras, where the starting point is factoring a simple partial differential equation. In the simplest nontrivial cases ("elliptic, parabolic, and hyperbolic"), we obtain respectively circular trig, parabolic trig, and hyperbolic trig together with the appropriate "adjoined unit" (not a real number) obeying e^2 = -1, \, e^2 = 0, \, e^2 = 1 respectively, and with isotropy groups SO(2), \, R, \, SO(1,1). Only the first is compact, so again by the principle of analysis which says that compactness tends to be associated with simplicity, we should expect circular trig to be more important than parabolic or hyperbolic trig. And it is.
(The fact that parabolic trig is associated with Galilean relativity and hyperbolic trig with special relativity challenges this notion, but overall, compactness wins, in my estimation.)