There's a more intuitive definition. If A and B are curves such that A(0)=B(0)=p, and x is a coordinate system, then we can say that A and B are equivalent if [itex]x\circ A[/itex] and [itex]x\circ B[/itex] have the same derivative at 0. (These are curves in [itex]\mathbb R^n[/itex], so this doesn't involve differential geometry). Then we define the tangent space at p as the set of equivalence classes of curves that pass through p at parameter value 0. It's easy to show that the equivalence relation doesn't depend on which coordinate system is used in its definition.
That set can be given a vector space structure, again using coordinate systems, and then proving that the definition is independent of the coordinate systems used. If we call that vector space T
pM, and the vector space of derivations D
pM, we can show that the function [itex]\phi:T_pM\rightarrow D_pM[/itex] defined by [itex]\phi([A])(f)=(f\circ A)'(0)[/itex] is a vector space isomorphism. Of course, first we have to prove that it's a well-defined function, by showing that the right-hand side doesn't depend on which member A we choose from the equivalence class [A].
See
Isham for details. (It also has a nice proof of the (non-trivial) claim that the set of partial derivative operators associated with a coordinate system is a basis for D
pM).