First, you should think carefully about what the other posters have said.
In post #7, you define the derivative of a function as a limit. This limit results in another function. Contrast this with your original post, where covariant differentiation changes a vector to a second rank tensor.
The covariant directional derivative of a vector results in another vector, and can be defined in terms of parallel transport and limits. If you want covariant directional derivatives to be defined in terms parallel transport, then parallel transport must first be defined. This can be done, but just as the definition of covariant differentiation (at first) seems non-intuitive, the definition of parallel transport may seem non-intuitive.
The covariant derivative of A in the direction of V is given by
V^{\sigma} \nabla_{\sigma}A^{\mu} = V^{\sigma} \left( \partial_{\sigma}A^{\mu} + \Gamma^{\mu}{}_{\sigma\lambda} A^{\lambda} \right).
If \gamma is a curve, then a notion of parallel transport along \gamma is defined by a collection on invertible linear maps - one for every pair of points in the image of the curve - such that for every a, b, and c in the curve's image:
\Gamma_{a,b}: T_{a} M \rightarrow T_{b} M;
\Gamma_{a,b} \Gamma_{b,c} = \Gamma_{a,c};
suitable smoothness conditions are satisfied.
This is a fairly general notion of parallel transport - you are probably more interested in metric-compatible parallel transport.
Once a notion of parallel transport is chosen, differentiation can be defined as
\nabla_{V}A = \lim_{\epsilon \rightarrow 0} \frac{1}{\epsilon} \left[ A_{||} \left( s + \epsilon \right) - A \left( s \right) \right],
where V is the tangent vector to \gamma at curve parameter s, and where A_{||} \left( s + \epsilon \right) means A \left( s + \epsilon \right) parallel transported to \gamma \left( s \right).
Regards,
George