There's a more intuitive definition. If A and B are curves such that A(0)=B(0)=p, and x is a coordinate system, then we can say that A and B are equivalent if x\circ A and x\circ B have the same derivative at 0. (These are curves in \mathbb R^n, so this doesn't involve differential geometry). Then we define the tangent space at p as the set of equivalence classes of curves that pass through p at parameter value 0. It's easy to show that the equivalence relation doesn't depend on which coordinate system is used in its definition.
That set can be given a vector space structure, again using coordinate systems, and then proving that the definition is independent of the coordinate systems used. If we call that vector space T
pM, and the vector space of derivations D
pM, we can show that the function \phi:T_pM\rightarrow D_pM defined by \phi([A])(f)=(f\circ A)'(0) is a vector space isomorphism. Of course, first we have to prove that it's a well-defined function, by showing that the right-hand side doesn't depend on which member A we choose from the equivalence class [A].
See
Isham for details. (It also has a nice proof of the (non-trivial) claim that the set of partial derivative operators associated with a coordinate system is a basis for D
pM).