This thread got me thinking about how one might arrive conceptually at the definition of the covariant derivative of a 1 form given an affine connection on the tangent bundle. Here are some thoughts. I apologize in advance for any errors.
One can perhaps motivate the covariant derivative of a 1 form by first taking the case where the affine connection is compatible with a Riemannian metric.
If ##w## is parallel along a curve fitting ##X## and the vector field ##Y## is also parallel then one would want ##w(Y)## to be constant. So the derivative ##X⋅<w_{*},Y>## must equal zero and metric compatibility implies that ##<∇_{X}w_{*},Y>## is also zero. So the dual of ##w## under the Riemannian metric must be parallel along the curve. This suggests that a good definition of the covariant derivative of ##w## is the metric dual of the covariant derivative of its metric dual vector field.
This yields the formula ## (∇_{X}w)(Y)= X⋅w(Y)-w(∇_{X}Y)##
Since this formula does not involve a metric it suggests a definition of covariant derivative of a 1 form for any affine connection.
A similar line of thought might be to start with a set of coordinates for the tangent space at a point and ask how does one extend these coordinates along a curve so that measurement of coordinates of a vector field will not depend on changes in the coordinate 1 forms themselves but only on changes in the vector field. As in standard coordinates in Euclidean space, this can be done by setting the coordinates of the derivative of a vector field to be the derivative of its coordinates. Formally, ##X.w(Y)=w(∇_{X}Y)## for each coordinate 1 form.
When ##w## is not parallel ##X⋅w(Y) - w(∇_{X}Y)## is a 1 form in ##Y## and this again suggests the formula ## (∇_{X}w)(Y)= X⋅w(Y)-w(∇_{X}Y)##
To justify this, it must be shown that ##∇_{X}w## satisfies the definition of a covariant derivative.
Definition
A covariant derivative at a point ##p## on the cotangent bundle of a manifold assigns for each tangent vector ##X_{p}## and each 1 form ##w## a cotangent vector at ##p##. This assignment is linear in both tangent vectors at ##p## and in 1 forms and satisfies the Leibniz rule, ##∇_(X_{p})fw = (X_{p}.f)w_{p} + f∇_{X_{p}}w##. An affine connection is a covariant derivative at each point and is smooth in the sense that the covariant derivatives of a smooth 1 form with respect to a smooth vector field is also a smooth 1 form.
These properties are easily verified.
E.g. by the Leibniz rule, ##(∇_{X}fw)(Y) =X⋅(fw)(Y)- fw(∇_{X}Y)= (X⋅f)w(Y)+f(X⋅w(Y) - fw(∇_{X}Y##
##∇_{X}w## also determines an affine connection on the cotangent bundle.